Movatterモバイル変換

Abstract

This document describes PNG (Portable Network Graphics), an extensible file format for thelossless, portable, well-compressed storage of static and animated raster images. PNG provides a patent-free replacement for GIF and can also replace many common uses of TIFF.Indexed-color,greyscale, andtruecolor images are supported, plus an optional alpha channel. Sample depths range from 1 to 16 bits.

PNG is designed to work well in online viewing applications, such as the World Wide Web, so it is fully streamable with a progressive display option. PNG is robust, providing both full file integrity checking and simple detection of common transmission errors. Also, PNG can store color space data for improved color matching on heterogeneous platforms.

This specification defines two Internet Media Types, image/png and image/apng.

Status of This Document

This section describes the status of this document at the time of its publication. A list of currentW3C publications and the latest revision of this technical report can be found in the W3C standards and drafts index at https://www.w3.org/TR/.

This specification is intended to become an International Standard, but is not yet one. It is inappropriate to refer to this specification as an International Standard.

This document was published by thePortable Network Graphics (PNG) Working Group as a Recommendation using theRecommendation track.

A W3C Recommendation is a specification that, after extensive consensus-building, is endorsed byW3C and its Members, and has commitments from Working Group members to royalty-free licensing for implementations.

W3C recommends the wide deployment of this specification as a standard for the Web.

This document was produced by a group operating under the W3C Patent Policy.W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes containsEssential Claim(s) must disclose the information in accordance withsection 6 of the W3C Patent Policy.

This document is governed by the03 November 2023 W3C Process Document.

1.Introduction

The design goals for this specification were:

Portability: encoding, decoding, and transmission should be software and hardware platform independent.
Completeness: it should be possible to representtruecolor,indexed-color, andgreyscale images, in each case with the option of transparency, color space information, and ancillary information such as textual comments.
Serial encode and decode: it should be possible for datastreams to be generated serially and read serially, allowing the datastream format to be used for on-the-fly generation and display of images across a serial communication channel.
Progressive presentation: it should be possible to transmit datastreams so that an approximation of the whole image can be presented initially, and progressively enhanced as the datastream is received.
Robustness to transmission errors: it should be possible to detect datastream transmission errors reliably.
Losslessness: filtering and compression should preserve all information.
Performance: any filtering, compression, and progressive image presentation should be aimed at efficient decoding and presentation. Fast encoding is a less important goal than fast decoding. Decoding speed may be achieved at the expense of encoding speed.
Compression: images should be compressed effectively, consistent with the other design goals.
Simplicity: developers should be able to implement the standard easily.
Interchangeability: any standard-conforming PNG decoder shall be capable of reading all conforming PNG datastreams.
Flexibility: future extensions and private additions should be allowed for without compromising the interchangeability of standard PNG datastreams.
Freedom from legal restrictions: no algorithms should be used that are not freely available.

2.Scope

This specification specifies a datastream and an associated file format, Portable Network Graphics (PNG, pronounced "ping"), for alossless, portable, compressed individual computer graphics image or frame-based animation, transmitted across the Internet.

3.Terms, definitions, and abbreviated terms

For the purposes of this specification the following definitions apply.

byte

octet

8-bit binary integer in the range [0, 255] where the most significant bit is bit 7 and the least significant bit is bit 0.

byte order

ordering ofbytes for multi-byte data values.

chromaticity

pair ofx andy values in thexyY space specified at [COLORIMETRY].

Note

Chromaticity is a measure of the quality of a color regardless of its luminance.

composite (verb)

form an image by merging a foreground image and a background image, using transparency information to determine where and to what extent the background should be visible.

Note

The foreground image is said to becomposited against the background.

datastream

sequence ofbytes.

deflate

member of theLZ77 family of compression methods.

SOURCE: [RFC1951]

frame

For static PNG, thestatic image is considered to be the first (and only) frame. For animated PNG, each image that forms part of theframe-based animation sequence is a frame. Thus, for animated PNG, when the static image is not the first frame, the static image is not considered to be a frame.

frame buffer

the final digital storage area for the image shown by most types of computer display.

Note

Software causes an image to appear on screen by loading the image into theframe buffer.

fully transparent black

pixel where the red, green, blue and alpha components are all equal to zero.

gamma value

value of the exponent of agamma transfer function.

gamma

power-lawtransfer function.

high dynamic range (HDR)

an image format capable of storing images with a relatively high dynamic range similar to or in excess of the human visual system's instantaneous dynamic range (~12-14stops). PNG allows the use of twoHDR formats,HLG andPQ [ITU-R-BT.2100].

hybrid log-gamma (HLG)

transfer function defined in [ITU-R-BT.2100] Table 5. (A relative scene-referred system.)

full-range image

image where reference black and white correspond, respectively, to sample values0 and2^{bit depth} - 1.

image data

1-dimensional array ofscanlines within an image.

interlaced PNG image

sequence ofreduced images generated from thePNG image bypass extraction.

lossless

method of data compression that permits reconstruction of the original data exactly, bit-for-bit.

luminance

an objective measurement of the visible light intensity, taking into account the sensitivity of the human eye to different wavelengths.

Note

Luminance andchromaticity together fully define a measured color. For a formal definition, see [COLORIMETRY].

LZ77

data compression algorithm described in [Ziv-Lempel].

narrow-range image

Image where reference black and white do not correspond, respectively, to sample values0 and2^{bit depth} - 1.

network byte order

byte order in which the most significant byte comes first, then the less significant bytes in descending order of significance (MSB LSB for two-byte integers,MSB B2 B1 LSB for four-byte integers).

perceptual quantizer (PQ)

transfer function defined in [ITU-R-BT.2100] Table 4. (An absolute display-referred system.)

Note

Only RGB may be used in PNG, ICtCp is NOT supported.

PNG decoder

process or device that reconstructs thereference image from aPNG datastream and generates a correspondingdelivered image.

PNG editor

process or device that creates a modification of an existingPNG datastream, preserving unmodified ancillary information wherever possible, and obeying thechunk ordering rules, even for unknown chunk types.

PNG encoder

process or device which constructs areference image from asource image, and generates aPNG datastream representing the reference image.

PNG file

PNG datastream stored as a file.

PNG four-byte unsigned integer

a four-byte unsigned integer limited to the range 0 to 2³¹-1.

Note

The restriction is imposed in order to accommodate languages that have difficulty with unsigned four-byte values.

PNG two-byte unsigned integer

a two-byte unsigned integer in network byte order.

sample

intersection of achannel and apixel in an image.

sample depth

number of bits used to represent asample value.

scanline

row ofpixels within an image orinterlaced PNG image.

standard dynamic range (SDR)

an image format capable of storing images with a relatively low dynamic range of 5-8stops. Examples include [SRGB], [Display-P3], [ITU-R-BT.709].

Note

Standard dynamic range is independent of the primaries and hence, gamut. Wide color gamutSDR formats are supported by PNG.

stop

a change in scene light luminance of a factor of 2.

transfer function

function relating image luminance with image samples.

white point

chromaticity of a computer display's nominal white value.

zlib

deflate-style compression method.

SOURCE: [rfc1950]

Note

Also refers to the name of a library containing a sample implementation of this method.

Cyclic Redundancy Code

CRC

type of check value designed to detect most transmission errors.

Note

A decoder calculates the CRC for the received data and checks by comparing it to the CRC calculated by the encoder and appended to the data. A mismatch indicates that the data or the CRC were corrupted in transit.

Cathode Ray Tube

CRT

vacuum tube containing one or more electron guns, which emit electron beams that are manipulated to display images on a phosphorescent screen.

Electro-Optical Transfer Function

EOTF

The transfer function between the electrical or digital domain and light energy. It defines the amount of light emitted by a display for a given input signal.

Least Significant Byte

LSB

Least significant byte of a multi-byte value.

Most Significant Byte

MSB

Most significant byte of a multi-byte value.

Opto-Electrical Transfer Function

OETF

The transfer function between light energy and the electrical or digital domain. It defines the amount of light in a scene required to produce a given output signal.

4.Concepts

4.1Static and Animated images

All PNG images contain a singlestatic image.

Some PNG images — calledAnimated PNG (APNG) — also contain a frame-based animation sequence, theanimated image. The first frame of this may be — but need not be — the static image. Non-animation-capable displays (such as printers) will display thestatic image rather than the animation sequence.

Thestatic image, and each individual frame of ananimated image, corresponds to areference image and is stored as aPNG image.

4.2Images

This specification specifies the PNG datastream, and places some requirements on PNG encoders, which generate PNG datastreams, PNG decoders, which interpret PNG datastreams, andPNG editors, which transform one PNG datastream into another. It does not specify the interface between an application and either a PNG encoder, decoder, or editor. The precise form in which an image is presented to an encoder or delivered by a decoder is not specified. Four kinds of image are distinguished.

Source image: Thesource image is the image presented to a PNG encoder.
Reference image: Thereference image, which only exists conceptually, is a rectangular array of rectangularpixels, all having the same width and height, and all containing the same number of unsigned integer samples, either three (red, green, blue) or four (red, green, blue, alpha). The array of all samples of a particular kind (red, green, blue, or alpha) is called achannel. Each channel has a sample depth in the range 1 to 16, which is the number of bits used by every sample in the channel. Different channels may have different sample depths. The red, green, and blue samples determine the intensities of the red, green, and blue components of the pixel's color; if they are all zero, the pixel is black, and if they all have their maximum values (2^sampledepth-1), the pixel is white. Thealpha sample determines a pixel's degree of opacity, where zero means fully transparent and the maximum value means fully opaque. In a three-channel reference image all pixels are fully opaque. (It is also possible for a four-channel reference image to have all pixels fully opaque; the difference is that the latter has a specific alpha sample depth, whereas the former does not.) Each horizontal row of pixels is called a scanline. Pixels are ordered from left to right within each scanline, and scanlines are ordered from top to bottom. Every reference image can be represented exactly by aPNG datastream and everyPNG datastream can be converted into a reference image. A PNG encoder may transform the source image directly into a PNG image, but conceptually it first transforms the source image into a reference image, then transforms the reference image into a PNG image. Depending on the type of source image, the transformation from the source image to a reference image may require the loss of information. That transformation is beyond the scope of this International Standard. The reference image, however, can always be recovered exactly from a PNG datastream.
PNG image: ThePNG image is obtained from the reference image by a series of transformations:alpha separation,indexing,RGB merging,alpha compaction, andsample depth scaling. Five types of PNG image are defined (see6.1Color types and values). (If the PNG encoder actually transforms the source image directly into the PNG image, and the source image format is already similar to the PNG image format, the encoder may be able to avoid doing some of these transformations.) Although not all sample depths in the range 1 to 16 bits are explicitly supported in the PNG image, the number of significant bits in each channel of the reference image may be recorded. All channels in the PNG image have the same sample depth. A PNG encoder generates a PNG datastream from the PNG image. A PNG decoder takes the PNG datastream and recreates the PNG image.
Delivered image: Thedelivered image is constructed from the PNG image obtained by decoding a PNG datastream. No specific format is specified for thedelivered image. A viewer presents an image to the user as close to the appearance of the original source image as it can achieve.

The relationships between the four kinds of image are illustrated inFigure1.

Figure1 Relationships between source, reference, PNG, and display images

The relationships between samples, channels, pixels, and sample depth are illustrated inFigure2.

Figure2 Relationships between sample, sample depth, pixel, and channel

4.3Color spaces

The RGB color space in which color samples are situated may be specified in one of four ways:

by CICP image format signaling metadata;
by an ICC profile;
by specifying explicitly that the color space is sRGB when the samples conform to this color space;
by specifying agamma value and the 1931 CIEx,y chromaticities of the red, green, and blue primaries used in the image and the referencewhite point.

For high-end applications the first two methods provides the most flexibility and control. The third method enables one particular, but extremely common, color space to be indicated. The fourth method, which was standardized before ICC profiles were widely adopted, enables the exact chromaticities of the RGB data to be specified, along with thegamma correction to be applied (seeC.Gamma and chromaticity). However, color-aware applications will prefer one of the first three methods, while color-unaware applications will typically ignore all four methods.

Table1 is a list of chunk types that provide color space information, each with an associated Priority number. If a single image contains more than one of these chunk types, the chunk with the lowest Priority number should take precedence and any higher-numbered chunk types should be ignored.

Table1 Color Chunk Priority
Chunk Type	Priority
cICP	1
iCCP	2
sRGB	3
cHRM andgAMA	4

Gamma correction is not applied to the alpha channel, if present. Alpha samples are always full-range and represent a linear fraction of full opacity.

Mastering metadata may also be provided.

4.4Reference image to PNG image transformation

Introduction

A number of transformations are applied to the reference image to create the PNG image to be encoded (seeFigure3). The transformations are applied in the following sequence, where square brackets mean the transformation is optional:

[alpha separation]indexing or ([RGB merging][alpha compaction] )sample depth scaling

When every pixel is either fully transparent or fully opaque, thealpha separation,alpha compaction, andindexing transformations can cause the recovered reference image to have an alpha sample depth different from the original reference image, or to have no alpha channel. This has no effect on the degree of opacity of any pixel. The two reference images are considered equivalent, and the transformations are considered lossless. Encoders that nevertheless wish to preserve the alpha sample depth may elect not to perform transformations that would alter the alpha sample depth.

Figure3 Reference image to PNG image transformation

4.4.1Alpha separation

If all alpha samples in a reference image have the maximum value, then the alpha channel may be omitted, resulting in an equivalent image that can be encoded more compactly.

4.4.2Indexing

If the number of distinct pixel values is 256 or less, and the RGB sample depths are not greater than 8, and the alpha channel is absent or exactly 8 bits deep or every pixel is either fully transparent or fully opaque, then the alternativeindexed-color representation, achieved through anindexing transformation, may be more efficient for encoding. In theindexed-color representation, each pixel is replaced by an index into a palette. Thepalette is a list of entries each containing three 8-bit samples (red, green, blue). If an alpha channel is present, there is also a parallel table of 8-bit alpha samples, called thealpha table.

Figure4 Indexed-color image

A suggested palette or palettes may be constructed even when the PNG image is notindexed-color in order to assist viewers that are capable of displaying only a limited number of colors.

Forindexed-color images, encoders can rearrange the palette so that the table entries with the maximum alpha value are grouped at the end. In this case the table can be encoded in a shortened form that does not include these entries.

Encoders creating indexed-color PNG must not insert index values greater than the actual length of the palette table; to do so is an error, and decoders will vary in their handling of this error.

4.4.3RGB merging

If the red, green, and blue channels have the same sample depth, and, for each pixel, the values of the red, green, and blue samples are equal, then these three channels may be merged into a single greyscale channel.

4.4.4Alpha compaction

For non-indexed images, if there exists an RGB (or greyscale) value such that all pixels with that value are fully transparent while all other pixels are fully opaque, then the alpha channel can be represented more compactly by merely identifying the RGB (or greyscale) value that is transparent.

4.4.5Sample depth scaling

In the PNG image, not all sample depths are supported (see6.1Color types and values), and all channels shall have the same sample depth. All channels of the PNG image use the smallest allowable sample depth that is not less than any sample depth in the reference image, and the possible sample values in the reference image are linearly mapped into the next allowable range for the PNG image.Figure5 shows how samples of depth 3 might be mapped into samples of depth 4.

Figure5 Scaling sample values

Allowing only a few sample depths reduces the number of cases that decoders have to cope with.Sample depth scaling is reversible with no loss of data, because the reference image sample depths can be recorded in the PNG datastream. In the absence of recorded sample depths, the reference image sample depth equals the PNG image sample depth. See12.4Sample depth scaling and13.12Sample depth rescaling.

Figure6 Possible PNG image pixel types

4.5PNG image

The transformation of the reference image results in one of five types of PNG image (seeFigure6) :

Truecolor with alpha: Eachpixel consists of foursamples: red, green, blue andalpha.
Greyscale with alpha: Eachpixel consists of twosamples: agrey sample and analpha sample.
Truecolor: Eachpixel consists of a triplet ofsamples: red, green, blue. An optional alpha channel can be specified as a single triplet of red, green, bluesamples:pixels of the image whose red, green, bluesamples are identical to the red, green, bluesamples of the alpha channel are fully transparent; others are fully opaque. If the alpha channel is not present, allpixels are fully opaque.
Greyscale: Each pixel consists of a singlegrey sample, which represents overallluminance (on a scale from black to white). An optional alpha channel can be specified as a singlegrey sample:pixels of the image whosegrey sample is identical to thegrey sample of the alpha channel are fully transparent; others are fully opaque. If the alpha channel is not present, allpixels are fully opaque.
Indexed-color: Each pixel consists of an index into a palette (and into an associated table of alpha values, if present).

The format of each pixel depends on the PNG image type and the bit depth. For PNG image types other than indexed-color, the bit depth specifies the number of bits per sample, not the total number of bits per pixel. Forindexed-color images, the bit depth specifies the number of bits in each palette index, not the sample depth of the colors in the palette or alpha table. Within the pixel the samples appear in the following order, depending on the PNG image type.

Truecolor with alpha: red, green, blue, alpha.
Greyscale with alpha: grey, alpha.
Truecolor: red, green, blue.
Greyscale: grey.
Indexed-color: palette index.

4.6Encoding the PNG image

Introduction

A conceptual model of the process of encoding a PNG image is given inFigure7. The steps refer to the operations on the array of pixels or indices in the PNG image. Thepalette andalpha table are not encoded in this way.

Pass extraction: to allow for progressive display, the PNG image pixels can be rearranged to form several smaller images called reduced images or passes.
Scanline serialization: the image is serialized a scanline at a time. Pixels are ordered left to right in a scanline and scanlines are ordered top to bottom.
Filtering: each scanline is transformed into a filtered scanline using one of the defined filter types to prepare the scanline for image compression.
Compression: occurs on all the filtered scanlines in the image.
Chunking: the compressed image is divided into conveniently sized chunks. An error detection code is added to each chunk.
Datastream construction: the chunks are inserted into the datastream.

4.6.1Pass extraction

Pass extraction (seeFigure7) splits a PNG image into a sequence ofreduced images where the first image defines a coarse view and subsequent images enhance this coarse view until the last image completes the PNG image. The set of reduced images is also called an interlaced PNG image. Two interlace methods are defined in this specification. The first method is a null method; pixels are stored sequentially from left to right and scanlines from top to bottom. The second method makes multiple scans over the image to produce a sequence of seven reduced images. The seven passes for a sample image are illustrated inFigure7. See8.Interlacing and pass extraction.

Figure7 Encoding the PNG image

Figure8 Pass extraction

4.6.2Scanline serialization

Each row of pixels, called a scanline, is represented as a sequence of bytes.

4.6.3Filtering

PNG allowsimage data to be filtered before it is compressed. Filtering can improve the compressibility of the data. The filter operation is deterministic, reversible, and lossless. This allows the decompressed data to be reverse-filtered in order to obtain the original data. See7.3Filtering.

4.6.4Compression

The sequence of filtered scanlines in the pass or passes of the PNG image is compressed (seeFigure9) by one of the defined compression methods. The concatenated filtered scanlines form the input to the compression stage. The output from the compression stage is a single compressed datastream. See10.Compression.

4.6.5Chunking

Chunking provides a convenient breakdown of the compressed datastream into manageable chunks (seeFigure9). Each chunk has its own redundancy check. See11.Chunk specifications.

Figure9 Compression and chunking

4.7Additional information

Ancillary information may be associated with an image. Decoders may ignore all or some of the ancillary information. The types of ancillary information provided are described inTable2.

Table2 Types of ancillary information
Type of information	Description
Animation information	An animated image, defined as a series of frames with associated timing, position and handling information, to be displayed if the viewer is capable of doing so. For other cases such as printers, thestatic image will be displayed instead.
Background color	Solid background color to be used when presenting the image if no better option is available.
Coding-independent code points	Identifies the color space by enumerating metadata such as thetransfer function and color primaries. Originally forSDR andHDR video, also used for still and animated images.
Content Light Level Information	Luminance of the brightest pixel in the image (or image sequence) and the average luminance level of the brightest frame in the sequence.
EXIF information	Exchangeable image file format metadata such as shutter speed, aperture, and orientation
Gamma and chromaticity	Gamma value of the image with respect to the desired output intensity, andchromaticity characteristics of the RGB values used in the image.
ICC profile	Description of the color space (in the form of an International Color Consortium (ICC) profile) to which the samples in the image conform.
Image histogram	Estimates of how frequently the image uses each palette entry.
Mastering Display Color Volume	Describes the absolute three-dimensional color gamut volume of the display used to prepare the content, including the lightest and darkest colors the mastering display can reproduce. This helps to present the image on the display device.
Physical pixel dimensions	Intended pixel size and aspect ratio to be used in presenting the PNG image.
Significant bits	The number of bits that are significant in the samples.
sRGB color space	A rendering intent (as defined by the International Color Consortium) and an indication that the image samples conform to this color space.
Suggested palette	A reduced palette that may be used when the display device is not capable of displaying the full range of colors in the image.
Textual data	Textual information (which may be compressed) associated with the image.
Time	The time when the PNG image was last modified.
Transparency	Alpha information that allows the reference image to be reconstructed when the alpha channel is not retained in the PNG image.

4.8PNG datastream

4.8.1Chunks

The PNG datastream consists of a PNG signature (see5.2PNG signature) followed by a sequence of chunks (see11.Chunk specifications). Each chunk has a chunk type which specifies its function.

4.8.2Chunk types

Chunk types are four-byte sequences chosen so that they correspond to readable labels when interpreted in the ISO 646.IRV:1991 [ISO646] character set. The first four are termed critical chunks, which shall be understood and correctly interpreted according to the provisions of this specification. These are:

IHDR: image header, which is the first chunk in a PNG datastream.
PLTE: palette table associated with indexed PNG images.
IDAT: image data chunks.
IEND: image trailer, which is the last chunk in a PNG datastream.

The remaining chunk types are termedancillary chunk types, which encoders may generate and decoders may interpret.

Transparency information:tRNS (see11.3.1Transparency information).
Color space information:cHRM,gAMA,iCCP,sBIT,sRGB,cICP,mDCV (see11.3.2Color space information).
Textual information:iTXt,tEXt,zTXt (see11.3.3Textual information).
Miscellaneous information:bKGD,hIST,pHYs,sPLT,eXIf (see11.3.4Miscellaneous information).
Time information:tIME (see11.3.5Time stamp information).
Animation information:acTL,fcTL,fdAT (see11.3.6Animation information).

4.9APNG: frame-based animation

Introduction

Animated PNG (APNG) extends the original, static-only PNG format, adding support for frame-based animated images. It is intended to be a replacement for simple animated images that have traditionally used the GIF format [GIF], while adding support for 24-bit images and 8-bit transparency, which GIF lacks.

APNG is backwards-compatible with earlier versions of PNG; a non-animated PNG decoder will ignore the ancillary APNG-specific chunks and display the static image.

4.9.1Structure

AnAPNG stream is a normal PNG stream as defined in previous versions of the PNG Specification, with three additional chunk types describing the animation and providing additional frame data.

To be recognized as an APNG, an acTL chunk must appear in the stream before anyIDAT chunks. TheacTL structure isdescribed below.

Conceptually, at the beginning of each play theoutput buffer shall be completely initialized to afully transparent black rectangle, with width and height dimensions from theIHDR chunk.

The static image may be included as the first frame of the animation by the presence of a singlefcTL chunk beforeIDAT. Otherwise, the static image is not part of the animation.

Subsequent frames are encoded infdAT chunks, which have the same structure asIDAT chunks, except preceded by asequence number. Information for each frame about placement and rendering is stored infcTL chunks. The full layout offdAT andfcTL chunks isdescribed below.

The boundaries of the entire animation are specified by the width and height parameters of theIHDR chunk, regardless of whether the default image is part of the animation. The default image should be appropriately padded withfully transparent black pixels if extra space will be needed for later frames.

Each frame is identical for each play, therefore it is safe for applications to cache the frames.

4.9.2Sequence numbers

ThefcTL andfdAT chunks have a zero-based, 4 byte sequence number. Both chunk types share the sequence. The purpose of this number is to detect (and optionally correct) sequence errors in an Animated PNG, since this specification does not impose ordering restrictions on ancillary chunks.

The firstfcTL chunk shall contain sequence number 0, and the sequence numbers in the remainingfcTL andfdAT chunks shall be in ascending order, with no gaps or duplicates.

The tables below illustrate the use of sequence numbers for images with more than one frame, and more than onefdAT chunk for the second frame. (IHDR andIEND chunks omitted in these tables, for clarity).

Table3 If the static image is also the first frame
Sequence number	Chunk
(none)	acTL
0	fcTL first frame
(none)	IDAT first frame / static image
1	fcTL second frame
2	firstfdAT for second frame
3	secondfdAT for second frame

Table4 If the static image is not part of the animation
Sequence number	Chunk
(none)	acTL
(none)	IDAT static image
0	fcTL first frame
1	firstfdAT for first frame
2	secondfdAT for first frame
3	fcTL second frame
4	firstfdAT for second frame
5	secondfdAT for second frame

4.9.3Output buffer

Theoutput buffer is a pixel array with dimensions specified by the width and height parameters of the PNGIHDR chunk. Conceptually, each frame is constructed in the output buffer before beingcomposited onto thecanvas. The contents of the output buffer are available to the decoder. The corners of the output buffer are mapped to the corners of thecanvas.

4.9.4Canvas

Thecanvas is the area on the output device on which the frames are to be displayed. The contents of the canvas are not necessarily available to the decoder. If abKGD chunk exists, it may be used to fill the canvas if there is no preferable background.

4.10Error handling

Errors in a PNG datastream fall into two general classes:

transmission errors or damage to a computer file system, which tend to corrupt much or all of the datastream;
syntax errors, which appear as invalid values in chunks, or as missing or misplaced chunks. Syntax errors can be caused not only by encoding mistakes, but also by the use of registered or private values, if those values are unknown to the decoder.

PNG decoders should detect errors as early as possible, recover from errors whenever possible, and fail gracefully otherwise. The error handling philosophy is described in detail in13.1Error handling.

4.11Extensions

This section is non-normative.

The PNG format exposes several extension points:

Some of these extension points are reserved byW3C, while others are available for private use.

5.Datastream structure

5.1PNG datastream

ThePNG datastream consists of a PNG signature followed by a sequence of chunks. It is the result of encoding aPNG image.

Note

The term datastream is used rather than "file" to describe a byte sequence that may be only a portion of a file. It is also used to emphasize that the sequence of bytes might be generated and consumed "on the fly", never appearing in a stored file at all.

5.2PNG signature

The first eight bytes of a PNG datastream always contain the following hexadecimal values:

89 50 4E 47 0D 0A 1A 0A

This signature indicates that the remainder of the datastream contains a single PNG image, consisting of a series of chunks beginning with anIHDR chunk and ending with anIEND chunk.

This signature differentiates a PNG datastream from other types ofdatastream and allows early detection of some transmission errors.

5.3Chunk layout

Eachchunk consists of three or four fields (seeFigure10). The meaning of the fields is described inTable5. The chunk data field may be empty.

Figure10 Chunk parts

Table5 Chunk fields
Name	Description
Length	APNG four-byte unsigned integer giving the number of bytes in the chunk's data field. The length countsonly the data field,not itself, the chunk type, or theCRC. Zero is a valid length. Although encoders and decoders should treat the length as unsigned, its value shall not exceed 2³¹-1 bytes.
Chunk Type	A sequence of four bytes defining the chunk type. Each byte of a chunk type is restricted to the hexadecimal values 41 to 5A and 61 to 7A. These correspond to the uppercase and lowercase ISO 646 [ISO646] letters (`A`-`Z` and`a`-`z`) respectively for convenience in description and examination of PNG datastreams. Encoders and decoders shall treat the chunk types as fixed binary values, not character strings. For example, it would not be correct to represent the chunk typeIDAT by the equivalents of those letters in the UCS 2 character set. Additional naming conventions for chunk types are discussed in5.4Chunk naming conventions.
Chunk Data	The data bytes appropriate to the chunk type, if any.This field can be of zero length.
CRC	A four-byteCRC calculated on the preceding bytes in the chunk, including the chunk type field and chunk data fields, butnot including the length field. The CRC can be used to check for corruption of the data. The CRC is always present, even for chunks containing no data. See 5.5 CRC algorithm.

The chunk data length may be any number of bytes up to the maximum; therefore, implementors cannot assume that chunks are aligned on any boundaries larger than bytes.

5.4Chunk naming conventions

Chunk types are chosen to be meaningful names when the bytes of the chunk type are interpreted as ISO 646 letters [ISO646]. Chunk types are assigned so that a decoder can determine some properties of a chunk even when the type is not recognized. These rules allow safe, flexible extension of the PNG format, by allowing a PNG decoder to decide what to do when it encounters an unknown chunk.

The naming rules are normally of interest only when the decoder does not recognize the chunk's type, as specified at13.PNG decoders and viewers.

Four bits of the chunk type, the property bits, namely bit 5 (value 32) of each byte, are used to convey chunk properties. This choice means that a human can read off the assigned properties according to whether the letter corresponding to each byte of the chunk type is uppercase (bit 5 is 0) or lowercase (bit 5 is 1).

The property bits are an inherent part of the chunk type, and hence are fixed for any chunk type. Thus,CHNK andcHNk would be unrelated chunk types, not the same chunk with different properties.

The semantics of the property bits are defined inTable6.

Table6 Semantics of property bits
Name & location	Definition	Description
Ancillary bit: first byte	0 (uppercase) = critical, 1 (lowercase) = ancillary.	Critical chunks are necessary for successful display of the contents of the datastream, for example the image header chunk (IHDR). A decoder trying to extract the image, upon encountering an unknown chunk type in which the ancillary bit is 0, shall indicate to the user that the image contains information it cannot safely interpret. Ancillary chunks are not strictly necessary in order to meaningfully display the contents of the datastream, for example the time chunk (tIME). A decoder encountering an unknown chunk type in which the ancillary bit is 1 can safely ignore the chunk and proceed to display the image.
Private bit: second byte	0 (uppercase) = public, 1 (lowercase) = private.	Public chunks are reserved for definition by theW3C. The definition of private chunks is specified at 12.10.1Use of private chunks. The names of private chunks have a lowercase second letter, while the names of public chunks have uppercase second letters.
Reserved bit: third byte	0 (uppercase) in this version of PNG. If the reserved bit is 1, the datastream does not conform to this version of PNG.	The significance of the case of the third letter of the chunk name is reserved for possible future extension. In this International Standard, all chunk names shall have uppercase third letters.
Safe-to-copy bit: fourth byte	0 (uppercase) = unsafe to copy, 1 (lowercase) = safe to copy.	This property bit is not of interest to pure decoders, but it is needed byPNG editors. This bit defines the proper handling of unrecognized chunks in a datastream that is being modified. Rules forPNG editors are discussed further in14.2Behavior of PNG editors.

The hypothetical chunk type "cHNk" has the property bits:

cHNk  <--32 bit chunk type representedin text form|||||||+-Safe-to-copy bit is1 (lowercase letter; bit5 is1)||+--Reserved bit is0     (uppercase letter; bit5 is0)|+---Private bit is0      (uppercase letter; bit5 is0)+----Ancillary bit is1    (lowercase letter; bit5 is1)

Therefore, this name represents an ancillary, public, safe-to-copy chunk.

5.5CRC algorithm

CRC fields are calculated using standardized CRC methods with pre and post conditioning, as defined by [ISO-3309] and [ITU-T-V.42]. TheCRC polynomial employed— which is identical to that used in the GZIP file format specification [RFC1952]— is

x³² + x²⁶ + x²³ + x²² + x¹⁶ + x¹² + x¹¹ + x¹⁰ + x⁸ + x⁷ + x⁵ + x⁴ + x² + x + 1

In PNG, the 32-bitCRC is initialized to all 1's, and then the data from each byte is processed from the least significant bit (1) to the most significant bit (128). After all the data bytes are processed, the CRC is inverted (its ones complement is taken). This value is transmitted (stored in the datastream)MSB first. For the purpose of separating into bytes and ordering, the least significant bit of the 32-bit CRC is defined to be the coefficient of thex³¹ term.

Practical calculation of the CRC often employs a precalculated table to accelerate the computation. See D.Sample CRC implementation.

5.6Chunk ordering

The constraints on the positioning of the individual chunks are listed inTable7 and illustrated diagrammatically for static images inFigure11 andFigure12, for animated images where the static image forms the first frame inFigure13 andFigure14, and for animated images where the static image is not part of the animation inFigure15 andFigure16. These lattice diagrams represent the constraints on positioning imposed by this specification. The lines in the diagrams define partial ordering relationships. Chunks higher up shall appear before chunks lower down. Chunks which are horizontally aligned and appear between two other chunk types (higher and lower than the horizontally aligned chunks) may appear in any order between the two higher and lower chunk types to which they are connected. The superscript associated with the chunk type is defined inTable8. It indicates whether the chunk is mandatory, optional, or may appear more than once. A vertical bar between two chunk types indicates alternatives.

Table7 Chunk ordering rules
Critical chunks (shall appear in this order, except PLTE is optional)
Chunk name	Multiple allowed	Ordering constraints
IHDR	No	Shall be first
PLTE	No	Before firstIDAT
IDAT	Yes	MultipleIDAT chunks shall be consecutive
IEND	No	Shall be last
Ancillary chunks (need not appear in this order)
Chunk name	Multiple allowed	Ordering constraints
acTL	No	BeforeIDAT
cHRM	No	BeforePLTE andIDAT
cICP	No	BeforePLTE andIDAT
gAMA	No	BeforePLTE andIDAT
iCCP	No	BeforePLTE andIDAT. If theiCCP chunk is present, thesRGB chunk should not be present.
mDCV	No	BeforePLTE andIDAT.
cLLI	No	BeforePLTE andIDAT.
sBIT	No	BeforePLTE andIDAT
sRGB	No	BeforePLTE andIDAT. If thesRGB chunk is present, theiCCP chunk should not be present.
bKGD	No	AfterPLTE; beforeIDAT
hIST	No	AfterPLTE; beforeIDAT
tRNS	No	AfterPLTE; beforeIDAT
eXIf	No	BeforeIDAT
fcTL	Yes	One may occur beforeIDAT; all others shall be afterIDAT
pHYs	No	BeforeIDAT
sPLT	Yes	BeforeIDAT
fdAT	Yes	AfterIDAT
tIME	No	None
iTXt	Yes	None
tEXt	Yes	None
zTXt	Yes	None

Table8 Meaning of symbols used in lattice diagrams
Symbol	Meaning
+	One or more
1	Only one
?	Zero or one
*	Zero or more
\|	Alternative

Figure11 Lattice diagram: Static PNG images withPLTE

Figure12 Lattice diagram: Static PNG images withoutPLTE

Figure13 Lattice diagram: Animated PNG images withPLTE, static image forms the first frame

Figure14 Lattice diagram: Animated PNG images withoutPLTE, static image forms the first frame

Figure15 Lattice diagram: Animated PNG images withPLTE, static image not part of animation

Figure16 Lattice diagram: Animated PNG images withoutPLTE, static image not part of animation

5.7Defining chunks

5.7.1General

All chunks, private and public,SHOULD be listed at [PNG-EXTENSIONS].

5.7.2Defining public chunks

Public chunks are reserved for definition by theW3C.

Public chunks are intended for broad use consistent with the philosophy of PNG.

Organizations and applications are encouraged to submit any chunk that meet the criteria above for definition as a public chunk by the PNG Working Group.

The definition as a public chunk is neither automatic nor immediate. A proposed public chunk typeSHALL not be used in publicly available software or datastreams until defined as such.

The definition of new critical chunk types is discouraged unless necessary.

5.7.3Defining private chunks

Organizations and applicationsMAY define private chunks for private and experimental use.

A private chunkSHOULD NOT be defined merely to carry textual information of interest to a human user. InsteadiTXt chunkSHOULD BE used and corresponding keywordSHOULD BE used and a suitable keyword defined.

Listing private chunks at [PNG-EXTENSIONS] reduces, but does not eliminate, the chance that the same private chunk is used for incompatible purposes by different applications. If a private chunk type is used, additional identifying informationSHOULD BE be stored at the beginning of the chunk data to further reduce the risk of conflicts.

An ancillary chunk type, not a critical chunk type,SHOULD be used for all private chunks that store information that is not absolutely essential to view the image.

Private critical chunksSHOULD NOT be defined because PNG datastreams containing such chunks are not portable, andSHOULD NOT be used in publicly available software or datastreams. If a private critical chunk is essential for an application, itSHOULD appear near the start of the datastream, so that a standard decoder need not read very far before discovering that it cannot handle the datastream.

SeeB.Guidelines for private chunk types for additional guidelines on defining private chunks.

5.8Private field values

Values greater than or equal to 128 in the following fields areprivate field values:

Theseprivate field values are neither defined nor reserved by this specification.

Private field valuesMAY be used for experimental or private semantics.

Private field valuesSHOULD NOT appear in publicly available software or datastreams since they can result in datastreams that are unreadable by PNG decoders as detailed at13.PNG decoders and viewers.

6.Reference image to PNG image transformation

6.1Color types and values

As explained in4.5PNG image there are five types of PNG image. Corresponding to each type is acolor type, which is the sum of the following values: 1 (palette used), 2 (truecolor used) and 4 (alpha used).greyscale andtruecolor images may have an explicit alpha channel. The PNG image types and correspondingcolor types are listed inTable9.

Table9 PNG image types and color types
PNG image type	Color type
Greyscale	0
Truecolor	2
Indexed-color	3
Greyscale with alpha	4
Truecolor with alpha	6

The allowed bit depths and sample depths for each PNG image type are listed inImage header.

Greyscale samples represent luminance if the transfer curve is indicated (bygAMA,sRGB,iCCP) orcICP; or device-dependent greyscale if not. RGB samples represent calibrated color information if the color space is indicated (bygAMA andcHRM,sRGB,iCCP, orcICP; or uncalibrated device-dependent color if not.

Sample values are not necessarily proportional to light intensity; thegAMA chunk specifies the relationship between sample values and display output intensity. Viewers are strongly encouraged to compensate properly. See4.3Color spaces,13.13Decoder gamma handling andC.Gamma and chromaticity.

6.2Alpha representation

In a PNG datastream transparency may be represented in one of four ways, depending on the PNG image type (see4.4.1Alpha separation and4.4.4Alpha compaction).

Truecolor with alpha,greyscale with alpha: an alpha channel is part of the image array.
truecolor,greyscale: AtRNS chunk contains a single pixel value distinguishing the fully transparent pixels from the fully opaque pixels.
Indexed-color: AtRNS chunk contains the alpha table that associates an alpha sample with each palette entry.
truecolor,greyscale,indexed-color: there is notRNS chunk present and all pixels are fully opaque.

An alpha channel included in the image array has 8-bit or 16-bit samples, the same size as the other samples. The alpha sample for each pixel is stored immediately following the greyscale or RGB samples of the pixel. An alpha value of zero represents full transparency, and a value of 2^sampledepth - 1 represents full opacity. Intermediate values indicate partially transparent pixels that can becomposited against a background image to yield the delivered image.

The color values in a pixel are not premultiplied by the alpha value assigned to the pixel. This rule is sometimes called "unassociated" or "non-premultiplied" alpha. (Another common technique is to store sample values premultiplied by the alpha value; in effect, such an image is alreadycomposited against a black background. PNG doesnot use premultiplied alpha. In consequence an image editor can take a PNG image and easily change its transparency.) See12.3Alpha channel creation and13.16Alpha channel processing.

7.Encoding the PNG image as a PNG datastream

7.1Integers and byte order

All integers that require more than one byte shall be innetwork byte order (as illustrated inFigure17 ): the most significant byte comes first, then the less significant bytes in descending order of significance (MSB LSB for two-byte integers,MSB B2 B1 LSB for four-byte integers). The highest bit (value 128) of a byte is numbered bit 7; the lowest bit (value 1) is numbered bit 0. Values are unsigned unless otherwise noted. Values explicitly noted as signed are represented in two's complement notation.

PNG four-byte unsigned integers are limited to the range 0 to 2³¹-1 to accommodate languages that have difficulty with unsigned four-byte values.

Figure17 Integer representation in PNG

7.2Scanlines

A PNG image (or pass, see8.Interlacing and pass extraction) is a rectangular pixel array, with pixels appearing left-to-right within each scanline, and scanlines appearing top-to-bottom. The size of each pixel is determined by the number of bits per pixel.

Pixels within a scanline are always packed into a sequence of bytes with no wasted bits between pixels. Scanlines always begin on byte boundaries. Permitted bit depths andcolor types are restricted so that in all cases the packing is simple and efficient.

In PNG images ofcolor type 0 (greyscale) each pixel is a single sample, which may have precision less than a byte (1, 2, or 4 bits). These samples are packed into bytes with the leftmost sample in the high-order bits of a byte followed by the other samples for the scanline.

In PNG images ofcolor type 3 (indexed-color) each pixel is a single palette index. These indices are packed into bytes in the same way as the samples forcolor type 0.

When there are multiple pixels per byte, some low-order bits of the last byte of a scanline may go unused. The contents of these unused bits are not specified.

PNG images that are notindexed-color images may have sample values with a bit depth of 16. Such sample values are innetwork byte order (MSB first,LSB second). PNG permits multi-sample pixels only with 8 and 16-bit samples, so multiple samples of a single pixel are never packed into one byte.

7.3Filtering

Afilter method is a transformation applied to an array ofscanlines with the aim of improving their compressibility.

PNG standardizes onefilter method and several filter types that may be used to prepareimage data for compression. It transforms the byte sequence into an equal length sequence of bytes preceded by a filter type byte (seeFigure18 for an example).

The encoder shall use only a singlefilter method for an interlaced PNG image, but may use different filter types for each scanline in a reduced image. An intelligent encoder can switch filters from one scanline to the next. The method for choosing which filter to employ is left to the encoder.

The filter type byte is not considered part of theimage data, but it is included in the datastream sent to the compression step. See9.Filtering.

Figure18 Serializing and filtering a scanline

8.Interlacing and pass extraction

Introduction

Pass extraction (seeFigure 4.8) splits a PNG image into a sequence of reduced images (the interlaced PNG image) where the first image defines a coarse view and subsequent images enhance this coarse view until the last image completes the PNG image. This allows progressive display of the interlaced PNG image by the decoder and allows images to "fade in" when they are being displayed on-the-fly. On average, interlacing slightly expands the datastream size, but it can give the user a meaningful display much more rapidly.

8.1Interlace methods

Two interlace methods are defined in this International Standard, methods 0 and 1. Other values of interlace method are reserved for future standardization.

With interlace method 0, the null method, pixels are extracted sequentially from left to right, and scanlines sequentially from top to bottom. The interlaced PNG image is a single reduced image.

Interlace method 1, known as Adam7, defines seven distinct passes over the image. Each pass transmits a subset of the pixels in the reference image. The pass in which each pixel is transmitted (numbered from 1 to 7) is defined by replicating the following 8-by-8 pattern over the entire image, starting at the upper left corner:

1 6 4 6 2 6 4 67 7 7 7 7 7 7 75 6 5 6 5 6 5 67 7 7 7 7 7 7 73 6 4 6 3 6 4 67 7 7 7 7 7 7 75 6 5 6 5 6 5 67 7 7 7 7 7 7 7

Figure 4.8 shows the seven passes of interlace method 1. Within each pass, the selected pixels are transmitted left to right within a scanline, and selected scanlines sequentially from top to bottom. For example, pass 2 contains pixels 4, 12, 20, etc. of scanlines 0, 8, 16, etc. (where scanline 0, pixel 0 is the upper left corner). The last pass contains all of scanlines 1, 3, 5, etc. The transmission order is defined so that all the scanlines transmitted in a pass will have the same number of pixels; this is necessary for proper application of some of the filters. The interlaced PNG image consists of a sequence of seven reduced images. For example, if the PNG image is 16 by 16 pixels, then the third pass will be a reduced image of two scanlines, each containing four pixels (seeFigure 4.8).

Scanlines that do not completely fill an integral number of bytes are padded as defined in7.2Scanlines.

Note

NOTE If the reference image contains fewer than five columns or fewer than five rows, some passes will be empty.

9.Filtering

9.1Filter methods and filter types

Filtering transforms the PNG image with the goal of improving compression. The overall process is depicted inFigure7 while the specifics of serializing and filtering a scanline are shown inFigure18.

PNG allows for a number offilter methods. All the reduced images in an interlaced image shall use a singlefilter method. Onlyfilter method 0 is defined by this specification. Otherfilter methods are reserved for future standardization.Filter method 0 provides a set of five filter types, and individual scanlines in each reduced image may use different filter types.

PNG imposes no additional restriction on which filter types can be applied to an interlaced PNG image. However, the filter types are not equally effective on all types of data. See12.7Filter selection.

Filtering transforms the byte sequence in a scanline to an equal length sequence of bytes preceded by the filter type. Filter type bytes are associated only with non-empty scanlines. No filter type bytes are present in an empty pass. See13.10Interlacing and progressive display.

9.2Filter types for filter method 0

Filters are applied tobytes, not to pixels, regardless of the bit depth orcolor type of the image. The filters operate on the byte sequence formed by a scanline that has been represented as described in7.2Scanlines. If the image includes an alpha channel, the alpha data is filtered in the same way as theimage data.

Filters may use the original values of the following bytes to generate the new byte value:

Table10 Named filter bytes
Name	Definition
`x`	the byte being filtered;
`a`	the byte corresponding to x in the pixel immediately before the pixel containing x (or the byte immediately before x, when the bit depth is less than 8);
`b`	the byte corresponding to x in the previous scanline;
`c`	the byte corresponding to b in the pixel immediately before the pixel containing b (or the byte immediately before b, when the bit depth is less than 8).

Figure19 Positions of filter bytes a, b and c relative to x

Figure19 shows the relative positions of the bytesx,a,b, andc.

Filter method 0 defines five basic filter types as listed inTable11.Orig(y) denotes the original (unfiltered) value of bytey.Filt(y) denotes the value after a filter type has been applied.Recon(y) denotes the value after the corresponding reconstruction function has been applied. The Paeth filter typePaethPredictor [Paeth] is defined below.

Filter method 0 specifies exactly this set of five filter types and this shall not be extended. This ensures that decoders need not decompress the data to determine whether it contains unsupported filter types: it is sufficient to check thefilter method in11.2.1IHDR Image header.

Table11 Filter types
Type	Name	Filter Function	Reconstruction Function
0	None	`Filt(x) = Orig(x)`	`Recon(x) = Filt(x)`
1	Sub	`Filt(x) = Orig(x) - Orig(a)`	`Recon(x) = Filt(x) + Recon(a)`
2	Up	`Filt(x) = Orig(x) - Orig(b)`	`Recon(x) = Filt(x) + Recon(b)`
3	Average	`Filt(x) = Orig(x) - floor((Orig(a) + Orig(b)) / 2)`	`Recon(x) = Filt(x) + floor((Recon(a) + Recon(b)) / 2)`
4	Paeth	`Filt(x) = Orig(x) - PaethPredictor(Orig(a), Orig(b), Orig(c))`	`Recon(x) = Filt(x) + PaethPredictor(Recon(a), Recon(b), Recon(c))`

For all filters, the bytes "to the left of" the first pixel in a scanline shall be treated as being zero. For filters that refer to the prior scanline, the entire prior scanline and bytes "to the left of" the first pixel in the prior scanline shall be treated as being zeroes for the first scanline of a reduced image.

To reverse the effect of a filter requires the decoded values of the prior pixel on the same scanline, the pixel immediately above the current pixel on the prior scanline, and the pixel just to the left of the pixel above.

Unsigned arithmetic modulo 256 is used, so that both the inputs and outputs fit into bytes. Filters are applied to each byte regardless of bit depth. The sequence ofFilt values is transmitted as the filtered scanline.

9.3Filter type 3: Average

The sumOrig(a) + Orig(b) shall be performed without overflow (using at least nine-bit arithmetic).floor() indicates that the result of the division is rounded to the next lower integer if fractional; in other words, it is an integer division or right shift operation.

9.4Filter type 4: Paeth

The Paeth filter type computes a simple linear function of the three neighboring pixels (left, above, upper left), then chooses as predictor the neighboring pixel closest to the computed value. The algorithm used in this specification is an adaptation of the technique due to Alan W. Paeth [Paeth].

The PaethPredictor function is defined in the code below. The logic of the function and the locations of the bytesa,b,c, andx are shown inFigure20.Pr is the predictor for bytex.

p = a + b - cpa =abs(p - a)pb =abs(p - b)pc =abs(p - c)if pa <= pb and pa <= pc thenPr = aelseif pb <= pc thenPr = belsePr = creturnPr

Figure20 The PaethPredictor function

The calculations within the PaethPredictor function shall be performed exactly, without overflow.

The order in which the comparisons are performed is critical and shall not be altered. The function tries to establish in which of the three directions (vertical, horizontal, or diagonal) the gradient of the image is smallest.

Exactly the same PaethPredictor function is used by both encoder and decoder.

10.Compression

10.1Compression method 0

Only PNG compression method 0 is defined by this International Standard. Other values of compression method are reserved for future standardization. PNG compression method 0 isdeflate compression with a sliding window (which is an upper bound on the distances appearing in thedeflate stream) of at most 32768 bytes.Deflate compression is derived fromLZ77.

Deflate-compressed datastreams within PNG are stored in thezlib format, which has the structure:

zlib compression method/flags code	1 byte
Additional flags/check bits	1 byte
Compressed data blocks	n bytes
Check value	4 bytes

zlib is specified at [rfc1950].

For PNG compression method 0, thezlib compression method/flags code shall specify method code 8 (deflate compression) and anLZ77 window size of not more than 32768 bytes. Thezlib compression method number is not the same as the PNG compression method number in theIHDR chunk. The additional flags shall not specify a preset dictionary.

If the data to be compressed contain 16384 bytes or fewer, the PNG encoder may set the window size by rounding up to a power of 2 (256 minimum). This decreases the memory required for both encoding and decoding, without adversely affecting the compression ratio.

The compressed data within thezlib datastream are stored as a series of blocks, each of which can represent raw (uncompressed) data,LZ77-compressed data encoded with fixed Huffman codes, orLZ77-compressed data encoded with custom Huffman codes. A marker bit in the final block identifies it as the last block, allowing the decoder to recognize the end of the compressed datastream. Further details on the compression algorithm and the encoding are given in thedeflate specification [rfc1951].

The check value stored at the end of thezlib datastream is calculated on the uncompressed data represented by the datastream. The algorithm used to calculate this is not the same as theCRC calculation used for PNG chunk CRC field values. The zlib check value is useful mainly as a cross-check that thedeflate algorithms are implemented correctly. Verifying the individual PNG chunk CRCs provides confidence that the PNG datastream has been transmitted undamaged.

10.2Compression of the sequence of filtered scanlines

The sequence of filtered scanlines is compressed and the resulting data stream is split intoIDAT chunks. The concatenation of the contents of all theIDAT chunks makes up azlib datastream. This datastream decompresses to filteredimage data.

It is important to emphasize that the boundaries betweenIDAT chunks are arbitrary and can fall anywhere in thezlib datastream. There is not necessarily any correlation betweenIDAT chunk boundaries anddeflate block boundaries or any other feature of thezlib data. For example, it is entirely possible for the terminatingzlib check value to be split acrossIDAT chunks.

Similarly, there is no required correlation between the structure of theimage data (i.e., scanline boundaries) anddeflate block boundaries orIDAT chunk boundaries. The complete filtered PNG image is represented by a singlezlib datastream that is stored in a number ofIDAT chunks.

10.3Other uses of compression

PNG also uses compression method 0 iniTXt,iCCP, andzTXt chunks. Unlike theimage data, such datastreams are not split across chunks; each such chunk contains an independentzlib datastream (see10.1Compression method 0).

11.Chunk specifications

11.1General

This clause defines chunk used in this specification.

11.2Critical chunks

Introduction

Acritical chunk is a chunk that is absolutely required in order to successfully decode a PNG image from a PNG datastream. Extension chunks may be defined as critical chunks (see14.Editors), though this practice is strongly discouraged.

A valid PNG datastream shall begin with a PNG signature, immediately followed by anIHDR chunk, then one or moreIDAT chunks, and shall end with anIEND chunk. Only oneIHDR chunk and oneIEND chunk are allowed in a PNG datastream.

11.2.1IHDR Image header

The four-byte chunk type field contains the hexadecimal values

49 48 44 52

TheIHDR chunk shall be the first chunk in the PNG datastream. It contains:

Width	4 bytes
Height	4 bytes
Bit depth	1 byte
Color type	1 byte
Compression method	1 byte
Filter method	1 byte
Interlace method	1 byte

Width and height give the image dimensions in pixels. They arePNG four-byte unsigned integers. Zero is an invalid value.

Bit depth is a single-byte integer giving the number of bits per sample or per palette index (not per pixel). Valid values are 1, 2, 4, 8, and 16, although not all values are allowed for allcolor types. See6.1Color types and values.

Color type is a single-byte integer.

Bit depth restrictions for eachcolor type are imposed to simplify implementations and to prohibit combinations that do not compress well. The allowed combinations are defined inTable12.

Table12 Allowed combinations ofcolor type and bit depth
PNG image type	Color type	Allowed bit depths	Interpretation
Greyscale	0	1, 2, 4, 8, 16	Each pixel is a greyscale sample
Truecolor	2	8, 16	Each pixel is an R,G,B triple
Indexed-color	3	1, 2, 4, 8	Each pixel is a palette index; aPLTE chunk shall appear.
Greyscale with alpha	4	8, 16	Each pixel is a greyscale sample followed by an alpha sample.
Truecolor with alpha	6	8, 16	Each pixel is an R,G,B triple followed by an alpha sample.

The sample depth is the same as the bit depth except in the case ofindexed-color PNG images (color type 3), in which the sample depth is always 8 bits (see4.5PNG image).

Compression method is a single-byte integer that indicates the method used to compress theimage data. Only compression method 0 (deflate compression with a sliding window of at most 32768 bytes) is defined in this specification. All conforming PNG images shall be compressed with this scheme.

Filter method is a single-byte integer that indicates the preprocessing method applied to theimage data before compression. Onlyfilter method 0 (adaptive filtering with five basic filter types) is defined in this specification. See9.Filtering for details.

Interlace method is a single-byte integer that indicates the transmission order of theimage data. Two values are defined in this specification: 0 (no interlace) or 1 (Adam7 interlace). See8.Interlacing and pass extraction for details.

11.2.2PLTE Palette

The four-byte chunk type field contains the hexadecimal values

50 4C 54 45

ThePLTE chunk contains from 1 to 256 palette entries, each a three-byte series of the form:

Red	1 byte
Green	1 byte
Blue	1 byte

The number of entries is determined from the chunk length. A chunk length not divisible by 3 is an error.

This chunk shall appear forcolor type 3, and may appear forcolor types 2 and 6; it shall not appear forcolor types 0 and 4. There shall not be more than onePLTE chunk.

Forcolor type 3 (indexed-color), thePLTE chunk is required. The first entry inPLTE is referenced by pixel value 0, the second by pixel value 1, etc. The number of palette entries shall not exceed the range that can be represented in the image bit depth (for example, 2⁴ = 16 for a bit depth of 4). It is permissible to have fewer entries than the bit depth would allow. In that case, any out-of-range pixel value found in theimage data is an error.

Forcolor types 2 and 6 (truecolor andtruecolor with alpha), thePLTE chunk is optional. If present, it provides a suggested set of colors (from 1 to 256) to which thetruecolor image can be quantized if it cannot be displayed directly. It is, however, recommended that thesPLT chunk be used for this purpose, rather than thePLTE chunk. If neitherPLTE norsPLT chunks are present and the image cannot be displayed directly, quantization has to be done by the viewing system. However, it is often preferable for the selection of colors to be done once by the PNG encoder. (See12.5Suggested palettes.)

Note that the palette uses 8 bits (1 byte) per sample regardless of the image bit depth. In particular, the palette is 8 bits deep even when it is a suggested quantization of a 16-bittruecolor image.

There is no requirement that the palette entries all be used by the image, nor that they all be different.

11.2.3IDAT Image data

The four-byte chunk type field contains the hexadecimal values

49 44 41 54

TheIDAT chunk contains the actualimage data which is the output stream of the compression algorithm. See9.Filtering and10.Compression for details.

There may be multipleIDAT chunks; if so, they shall appear consecutively with no other intervening chunks. The compressed datastream is then the concatenation of the contents of the data fields of all theIDAT chunks (noting that data fieldsmay be of zero length).

Some images have unused trailing bytes at the end of the final IDAT chunk. This could happen when an entire buffer is stored rather than just the portion of the buffer which is used. This is undesirable. Preferably, an encoder would not include these unused bytes. If it must, setting the bytes to zero will prevent accidental data sharing. A decoder should ignore these trailing bytes.

11.2.4IEND Image trailer

The four-byte chunk type field contains the hexadecimal values

49 45 4E 44

TheIEND chunk marks the end of the PNG datastream. The chunk's data field is empty.

11.3Ancillary chunks

Introduction

The ancillary chunks defined in this specification are listed in the order in4.8.2Chunk types. This is not the order in which they appear in a PNG datastream. Ancillary chunks may be ignored by a decoder. For each ancillary chunk, the actions described are under the assumption that the decoder is not ignoring the chunk.

11.3.1Transparency information

11.3.1.1tRNS Transparency

The four-byte chunk type field contains the hexadecimal values

74 52 4E 53

ThetRNS chunk specifies either alpha values that are associated with palette entries (forindexed-color images) or a single transparent color (forgreyscale andtruecolor images). ThetRNS chunk contains:

Color type 0
Grey sample value	2 bytes
Color type 2
Red sample value	2 bytes
Green sample value	2 bytes
Blue sample value	2 bytes
Color type 3
Alpha for palette index 0	1 byte
Alpha for palette index 1	1 byte
...etc...	1 byte

Forcolor type 3 (indexed-color), thetRNS chunk contains a series of one-byte alpha values, corresponding to entries in thePLTE chunk. Each entry indicates that pixels of the corresponding palette index shall be treated as having the specified alpha value. Alpha values have the same interpretation as in an 8-bit full alpha channel: 0 is fully transparent, 255 is fully opaque, regardless of image bit depth. ThetRNS chunk shall not contain more alpha values than there are palette entries, but atRNS chunk may contain fewer values than there are palette entries. In this case, the alpha value for all remaining palette entries is assumed to be 255. In the common case in which only palette index 0 need be made transparent, only a one-bytetRNS chunk is needed, and when all palette indices are opaque, thetRNS chunk may be omitted.

Forcolor types 0 or 2, two bytes per sample are used regardless of the image bit depth (see7.1Integers and byte order). Pixels of the specified grey sample value or RGB sample values are treated as transparent (equivalent to alpha value 0); all other pixels are to be treated as fully opaque (alpha value 2^bitdepth-1). If the image bit depth is less than 16, the least significant bits are used. Encoders should set the other bits to 0, and decoders must mask the other bits to 0 before the value is used.

AtRNS chunk shall not appear forcolor types 4 and 6, since a full alpha channel is already present in those cases.

Note

NOTE For 16-bitgreyscale ortruecolor data, as described in13.12Sample depth rescaling, onlypixels matching the entire 16-bit values intRNS chunks are transparent. Decoders have to postpone any sample depth rescaling until after the pixels have been tested for transparency.

11.3.2Color space information

11.3.2.1cHRM Primary chromaticities and white point

The four-byte chunk type field contains the hexadecimal values

63 48 52 4D

ThecHRM chunk may be used to specify the 1931 CIEx,y chromaticities of the red, green, and blue display primaries used in thePNG image, and the referencedwhite point. SeeC.Gamma and chromaticity for more information. TheiCCP, andsRGB chunks provide more sophisticated support for color management and control.

ThecHRM chunk contains:

Table13 cHRM chunk components
Name	Size
White point x	4 bytes
White point y	4 bytes
Red x	4 bytes
Red y	4 bytes
Green x	4 bytes
Green y	4 bytes
Blue x	4 bytes
Blue y	4 bytes

Each value is encoded as aPNG four-byte unsigned integer, representing thex ory value times 100000.

A value of 0.3127 would be stored as the integer 31270.

ThecHRM chunk is allowed in all PNG datastreams, although it is of little value forgreyscale images.

This chunk is ignored unless it is thehighest-precedence color chunk understood by the decoder.

11.3.2.2gAMA Image gamma

The four-byte chunk type field contains the hexadecimal values

67 41 4D 41

ThegAMA chunk specifies agamma value.

In fact specifying the desired display output intensity is insufficient. It is also necessary to specify the viewing conditions under which the output is desired. ForgAMA these are the reference viewing conditions of the sRGB specification [SRGB]. Adjustment for different viewing conditions is normally handled by a Color Management System. If the adjustment is not performed, the error is usually small. Applications desiring high color fidelity may wish to use ansRGB,iCCP chunk.

ThegAMA chunk contains:

Image gamma

4 bytes

The value is encoded as aPNG four-byte unsigned integer, representing thegamma value times 100000.

Agamma value of 1/2.2 would be stored as the integer 45455.

See12.1Encoder gamma handling and13.13Decoder gamma handling for more information.

This chunk is ignored unless it is thehighest-precedence color chunk understood by the decoder.

11.3.2.3iCCP Embedded ICC profile

The four-byte chunk type field contains the hexadecimal values

69 43 43 50

TheiCCP chunk contains:

Profile name	1-79 bytes (character string)
Null separator	1 byte (null character)
Compression method	1 byte
Compressed profile	n bytes

The profile name may be any convenient name for referring to the profile. It is case-sensitive. Profile names shall contain only printable Latin-1 characters and spaces (only code points 0x20-7E and 0xA1-FF are allowed). Leading, trailing, and consecutive spaces are not permitted. The only compression method defined in this specification is method 0 (zlib datastream withdeflate compression, see10.3Other uses of compression). The compression method entry is followed by a compressed profile that makes up the remainder of the chunk. Decompression of this datastream yields the embedded ICC profile.

If theiCCP chunk is present, the image samples conform to the color space represented by the embedded ICC profile as defined by the International Color Consortium [ICC][ISO_15076-1]. The color space of the ICC profile shall be an RGB color space for color images (color types 2, 3, and 6), or a greyscale color space forgreyscale images (color types 0 and 4). A PNG encoder that writes theiCCP chunk is encouraged to also writegAMA andcHRM chunks that approximate the ICC profile, to provide compatibility with applications that do not use theiCCP chunk.

This chunk is ignored unless it is thehighest-precedence color chunk understood by the decoder.

Unless acICP chunk exists, a PNG datastream should contain at most one embedded profile, whether specified explicitly with aniCCP or implicitly with ansRGB chunk.

11.3.2.4sBIT Significant bits

The four-byte chunk type field contains the hexadecimal values

73 42 49 54

To simplify decoders, PNG specifies that only certain sample depths may be used, and further specifies that sample values should be scaled to the full range of possible values at the sample depth. ThesBIT chunk defines the original number of significant bits (which can be less than or equal to the sample depth). This allows PNG decoders to recover the original data losslessly even if the data had a sample depth not directly supported by PNG.

ThesBIT chunk contains:

Table14 sBIT chunk contents
Color type 0
significant greyscale bits	1 byte
Color types 2 and 3
significant red bits	1 byte
significant green bits	1 byte
significant blue bits	1 byte
Color type 4
significant greyscale bits	1 byte
significant alpha bits	1 byte
Color type 6
significant red bits	1 byte
significant green bits	1 byte
significant blue bits	1 byte
significant alpha bits	1 byte

Each depth specified insBIT shall be greater than zero and less than or equal to the sample depth (which is 8 forindexed-color images, and the bit depth given inIHDR for othercolor types). Note thatsBIT does not provide a sample depth for the alpha channel that is implied by atRNS chunk; in that case, all of the sample bits of the alpha channel are to be treated as significant. If thesBIT chunk is not present, then all of the sample bits of all channels are to be treated as significant.

11.3.2.5sRGB Standard RGB color space

The four-byte chunk type field contains the hexadecimal values

73 52 47 42

If thesRGB chunk is present, the image samples conform to the sRGB color space [SRGB] and should be displayed using the specified rendering intent defined by the International Color Consortium [ICC] or [ICC-2].

ThesRGB chunk contains:

Table15 sRGB chunk contents
Name	Size
Rendering intent	1 byte

The following values are defined for rendering intent:

Table16 Rendering intent values
Value	Name	Description
0	Perceptual	for images preferring good adaptation to the output device gamut at the expense of colorimetric accuracy, such as photographs.
1	Relative colorimetric	for images requiring color appearance matching (relative to the output devicewhite point), such as logos.
2	Saturation	for images preferring preservation of saturation at the expense of hue and lightness, such as charts and graphs.
3	Absolute colorimetric	for images requiring preservation of absolute colorimetry, such as previews of images destined for a different output device (proofs).

It is recommended that a PNG encoder that writes thesRGB chunk also write agAMA chunk (and optionally acHRM chunk) for compatibility with decoders that do not use thesRGB chunk. Only the following values shall be used.

Table17 gAMA and cHRM values for sRGB
gAMA
Gamma	45455
cHRM
White point x	31270
White point y	32900
Red x	64000
Red y	33000
Green x	30000
Green y	60000
Blue x	15000
Blue y	6000

This chunk is ignored unless it is thehighest-precedence color chunk understood by the decoder.

It is recommended that thesRGB andiCCP chunks do not appear simultaneously in a PNG datastream.

11.3.2.6cICP Coding-independent code points for video signal type identification

The four-byte chunk type field contains the hexadecimal values

63 49 43 50

If present, thecICP chunk specifies the color space (primaries), transfer function, matrix coefficients and scaling factor of the image using the code points specified in [ITU-T-H.273]. The video format signalingSHOULD be used when processing the image, including by a decoder or when rendering the image.

The cICP chunk consists of four 1-byte unsigned integers to identify the characteristics described above.

The following specifies the syntax of thecICP chunk:

Table18 cICP chunk components
Name	Size
Color Primaries	1 byte
Transfer Function	1 byte
Matrix Coefficients	1 byte
Video Full Range Flag	1 byte

Each of the fields of thecICP chunk corresponds to the parameter of the same name in [ITU-T-H.273].

RGB is currently the only supported color model in PNG, and as suchMatrix Coefficients shall be set to0.

Note

IfVideo Full Range Flag value is1, then the image is a full-range image. Typically, images in the RGB color representation are stored in the full-range signal quantization, therefore the vast majority of computer graphics and web images, including those used in traditional PNG workflows, arefull-range images. IfVideo Full Range Flag value is0, then the image is anarrow-range image. Narrow range images are found in video workflows where there are sample values below reference black (0% signal level) or above nominal peak (100% signal level). For example, [ITU-R-BT.709] specifies that, for 10-bit coding, reference black (called black level) corresponds to a code value of 64 and nominal peak to a code value of 940. In narrow range, momentary excursions defined as overshoots and undershoots exist below reference black and above nominal peak in order to preserve processing artifacts caused by filtering/compression or by uncontrolled lighting without clipping. This can improve image quality during additional stages of processing and compression. The use of undershoot/overshoot has also been used to preserve additional color volume (both light and color) as described in [EBU-R-103]. [SMPTE-RP-2077] describes full range in more detail and includes the mapping fromfull-range images tonarrow-range images and describes protected code values for SDI (baseband) carriage.

IfVideo Full Range Flag is0 (anarrow-range image), recommended practice is to define transfer functions such as EOTF or inverse OETF over the extended range, so as to include negative values. This is done as follows:

out= sign(in) * TransferFunction(abs(in))

ThecICP chunkMUST come before thePLTE andIDAT chunks.

This chunk, if understood by the decoder, is thehighest-precedence color chunk.

Example 1

cICP chunk field values for afull-range image that uses the color primaries and thePQ transfer function specified at [ITU-R-BT.2100]:

09 10 00 01

(Four 1-byte unsigned integers, in hexadecimal)

Example 2

cICP chunk field values for afull-range image that uses the color primaries and theHLG transfer function specified at [ITU-R-BT.2100]:

09 12 00 01

(Four 1-byte unsigned integers, in hexadecimal)

Example 3

cICP chunk field values for anarrow-range image that uses the color primaries and thetransfer function defined at [ITU-R-BT.709]:

01 01 00 00

(Four 1-byte unsigned integers, in hexadecimal)

In a similar way to the use of thesRGB chunk to compactly signal an sRGB image,cICP can be used to compactly signal a Display P3 image [Display-P3].

Example 4

cICP chunk field values for afull-range image that uses the color primaries and thetransfer function defined by [Display-P3]:

0C 0D 00 01

(Four 1-byte unsigned integers, in hexadecimal)

11.3.2.7mDCV Mastering Display Color Volume

The four-byte chunk type field contains the hexadecimal values

6D 44 43 56

If present, themDCV chunk characterizes the Mastering Display Color Volume (mDCV) used at the point of content creation, as specified in [SMPTE-ST-2086]. The mDCV chunk provides informative static metadata which allows a target (consumer) display to potentially optimize its tone mapping decisions on a comparison of its inherent capabilities versus the original mastering display's capabilites.

mDCV is typically used with thePQ [ITU-R-BT.2100] transfer function and additionalcLLI metadata and is commonly then called [HDR10] (PQ with ST 2086 static metadata, MaxFALL and MaxCLL). The mDCV chunk may also be included withHLG [ITU-R-BT.2100] andSDR image formats (for example [ITU-R-BT.709]).

Since mDCV was originally created as supplemental static metadata meant to optimize the tone-mapping of images on a video display target, a cICP chunk must accompany the use of mDCV in order to establish the basic characteristics of the image content. Color Primaries and White Point characteristics can be derived from cICP chunk formats. Specific examples of its most common use-cases for images using both HDR [ITU-R-BT.2100] and SDR [ITU-R-BT.709] are available in [ITU-T-Series-H-Supplement-19]. The basic (cICP) characteristics plus the supplemental (mDCV) static metadata may provide valuable information to optimize tone-mapping decisions.

Note

Issue #319 discusses tone-mapping behavior when themDCV chunk is present.

ForSDR images, if mDCV display min/max luminance are unknown, the default characteristics can be derived from the values in [ITU-T-Series-H-Supplement-19] Table 11 or from the relevantSDR specification. At present, there is no published, standardized method for translating an SDR image signal from its default viewing condition (display luminance and ambient illumination) to that signalled in the mDCV chunk.

Note

The HLG [ITU-R-BT.2100] image format does have published methods for translating the image for both changes in display luminance (within [ITU-R-BT.2100]) and ambient illumination (within the accompanying report [ITU-R-BT.2390]). This may be used with SDR images.

The following specifies the syntax of themDCV chunk:

Table19 mDCV chunk components
Name	Size	Divisor value
Mastering display color primary chromaticities (CIE 1931x,y of R,G,B )	12 bytes	0.00002
Mastering display white point chromaticity (CIE 1931x,y)	4 bytes	0.00002
Mastering display maximum luminance (measured in cd/m²)	4 bytes	0.0001
Mastering display minimum luminance (measured in cd/m²)	4 bytes	0.0001

The color primaries are encoded as three pairs ofPNG two-byte unsigned integers, in the orderx and theny, each representing the x or y primary chromaticity value divided by the divisor value. They are ordered starting with the primary with the largest x chromaticity, followed by the primary with the largest y chromaticity, followed by the remaining primary. For RGB color spaces, this corresponds to the order R, G, B.

The white point is encoded as a pair ofPNG two-byte unsigned integers, in the orderx and theny, each representing the x or y whie chromaticity value divided by the divisor value.

The maximum and minimum luminance values are encoded asPNG four-byte unsigned integers, representing the absolute luminance value in cd/m² divided by the divisor value.

The divisor maps from actual value to stored value. For example, the unitless divisor of 0.00002 for the primaries and white point would store the chromaticity (0.6800, 0.3200) as {34000, 16000}.

ThemDCV chunkMUST come before thePLTE andIDAT chunks.

Below are mDCV examples for [ITU-R-BT.2100]HDR.

Example 5

ExamplemDCV chunk mastering display color primaries forHDR [ITU-R-BT.2100]:

Name	Actual values	Stored Decimal values	Stored Hexadecimal values
Color primaries specified in [ITU-R-BT.2020]	(0.708, 0.292)	{ 35400, 14600 }	{ 8A 48, 39 08 }
	(0.170, 0.797)	{ 8500, 39850 }	{ 21 34, 9B AA }
	(0.131, 0.046)	{ 6550, 2300 }	{ 19 96, 08 FC }

Example 6

ExamplemDCV chunk mastering display white point forHDR [ITU-R-BT.2100]:

Name	Actual values	Stored Decimal values	Stored Hexadecimal values
Illuminant D65 specified in [SMPTE-RP-177]	(0.3127, 0.3290)	{ 15635, 16450 }	{ 3C 05, 40 42 }

Example 7

ExamplemDCV chunk mastering display maximum luminance forHDR [ITU-R-BT.2100]:

Actual value	Stored Decimal value	Stored Hexadecimal value
4000 cd/m²	40000000	02 62 5A 00

Example 8

ExamplemDCV chunk mastering display minimum luminance:

Actual value	Stored Decimal value	Stored Hexadecimal value
0.0005 cd/m²	5	00 00 00 05

Below are mDCV examples for [Display-P3]SDR.

Example 9

ExamplemDCV chunk mastering display color primaries for [Display-P3]:

Name	Actual values	Stored Decimal values	Stored Hexadecimal values
Color primaries specified in [Display-P3]	(0.68, 0.32)	{ 34000, 16000 }	{ 84 D0, 3E 80 }
	(0.265, 0.69)	{ 13520, 34500 }	{ 34 D0, 86 C4 }
	(0.15, 0.06)	{ 7500, 3000 }	{ 1D 4C, 0B B8 }

Example 10

ExamplemDCV chunk mastering display white point for [Display-P3]:

Name	Actual values	Stored Decimal values	Stored Hexadecimal values
Illuminant D65 specified in [SMPTE-RP-177]	(0.3127, 0.3290)	{ 15635, 16450 }	{ 3D 13, 40 42 }

Example 11

ExamplemDCV chunk mastering display maximum luminance for [Display-P3]:

Actual value	Stored Decimal values	Stored Hexadecimal values
80 cd/m²	800000	00 0C 35 00

Example 12

Example mDCV chunk mastering display minimum luminance for [Display-P3]:

Actual value	Stored Decimal values	Stored Hexadecimal values
0.05 cd/m²	500	00 00 01 F4

11.3.2.8cLLI Content Light Level Information

The four-byte chunk type field contains the hexadecimal values

63 4C 4C 49

If present, thecLLI chunk identifies two characteristics ofHDR content:

ThecLLI chunk adds static metadata which provides an opportunity to optimize tone mapping of the associated content to a specific target display. This is accomplished by tailoring the tone mapping of the content itself to the specific peak brightness capabilities of the target display to prevent clipping. The method of tone-mapping optimization is currently subjective.

MaxCLL (Maximum Content Light Level) uses a static metadata value to indicate the maximum light level of any single pixel (in cd/m², also known as nits) of the entire playback sequence. There is often an algorithmic filter to eliminate false values occurring from processing or noise that could adversely affect intended downstream tone mapping.

MaxFALL (Maximum Frame Average Light Level) uses a static metadata value to indicate the maximum value of theframe average light level (in cd/m², also known as nits) of the entire playback sequence. MaxFALL is calculated by first averaging the decoded luminance values of all the pixels in each frame, and then using the value for the frame with the highest value.

The MaxCLL and MaxFALL values are encoded asPNG four-byte unsigned integers.

Note

[CTA-861.3-A] describes the method of calculation for generating thecLLI values, but does not specify any filtering. [HDR-Static-Meta] describes an improved method which rejects extreme values from statistical outliers, noise or ringing from resampling filters, and is recommended for practical implementations.

Note

[SMPTE-ST-2067-21] Section 7.5 adds additional information in Section 7.5 in the case where thecLLI values are unknown and have not been calculated.

Note

Issue #319 discusses tone-mapping behavior when thecLLI chunk is present.

Eachframe is analyzed.

A value of zero for either MaxCLL or MaxFALL means that the value is unknown or not currently calculable.

Note

An example where this will not be calculable is when creating a live animated PNG stream, when not all frames will be available to compute the values until the stream ends. The encoder may wish to use the value zero initially and replace this with the calculated value when the stream ends.

The following specifies the syntax of thecLLI chunk:

Table20cLLI chunk components
Name	Size	Divisor value
Maximum Content Light Level (MaxCLL)	4 bytes	0.0001 cd/m²
Maximum Frame-Average Light Level (MaxFALL)	4 bytes	0.0001 cd/m²

Example 13

ExamplecLLI chunk Maximum Content Light Level:

Actual value	Stored Decimal values	Stored Hexadecimal values
1000 cd/m²	10000000	00 98 96 80

Example 14

ExamplecLLI chunk Maximum Frame-Average Light Level:

Actual value	Stored Decimal values	Stored Hexadecimal values
250 cd/m²	2500000	00 26 25 A0

11.3.3Textual information

Introduction

PNG provides thetEXt,iTXt, andzTXt chunks for storing text strings associated with the image, such as an image description or copyright notice. Keywords are used to indicate what each text string represents. Any number of such text chunks may appear, and more than one with the same keyword is permitted.

11.3.3.1Keywords and text strings

The following keywords are predefined and should be used where appropriate.

Table21 Predefined keywords
Keyword value	Description
Title	Short (one line) title or caption for image
Author	Name of image's creator
Description	Description of image (possibly long)
Copyright	Copyright notice
Creation Time	Time of original image creation
Software	Software used to create the image
Disclaimer	Legal disclaimer
Warning	Warning of nature of content
Source	Device used to create the image
Comment	Miscellaneous comment
XML:com.adobe.xmp	Extensible Metadata Platform (XMP) information, formatted as required by the XMP specification [XMP]. The use ofiTXt, with Compression Flag set to 0, and both Language Tag and Translated Keyword set to the null string, are recommended for XMP compliance.
Collection	Name of a collection to which the image belongs. An image may belong to one or more collections, each named by a separate text chunk.

Other keywordsMAY be defined by any application for private or general interest.

KeywordsSHOULD be .

reasonably self-explanatory, since the aim is to let other human users understand what the chunk contains; and
chosen to minimize the chance that the same keyword is used for incompatible purposes by different applications.

Keywords of general interestSHOULD be listed in [PNG-EXTENSIONS].

Keywords shall contain only printable Latin-1 [ISO_8859-1] characters and spaces; that is, only code points 0x20-7E and 0xA1-FF are allowed. To reduce the chances for human misreading of a keyword, leading spaces, trailing spaces, and consecutive spaces are not permitted in keywords, nor is U+00A0 NON-BREAKING SPACE since it is visually indistinguishable from an ordinary space.

Keywords shall be spelled exactly as registered, so that decoders can use simple literal comparisons when looking for particular keywords. In particular, keywords are considered case-sensitive. Keywords are restricted to 1 to 79 bytes in length.

For the Creation Time keyword, the date formatSHOULD be in the RFC 3339 [rfc3339] date-time format or in the date format defined in section 5.2.14 of RFC 1123 [rfc1123]. The RFC3339 date-time format is preferred. The actual format of this field is undefined.

TheiTXt chunk uses the UTF-8 encoding [rfc3629] and can be used to convey characters in any language. There is an option to compress text strings in theiTXt chunk.iTXt is recommended for all text strings, as it supports Unicode. There are alsotEXt andzTXt chunks, whose content is restricted to the printable Latin-1 character set plus U+000A LINE FEED (LF). Text strings inzTXt are compressed intozlib datastreams usingdeflate compression (see10.3Other uses of compression).

11.3.3.2tEXt Textual data

The four-byte chunk type field contains the hexadecimal values

74 45 58 74

EachtEXt chunk contains a keyword and a text string, in the format:

Keyword	1-79 bytes (character string)
Null separator	1 byte (null character)
Text string	0 or more bytes (character string)

The keyword and text string are separated by a zero byte (null character). Neither the keyword nor the text string may contain a null character. The text string isnot null-terminated (the length of the chunk defines the ending). The text string may be of any length from zero bytes up to the maximum permissible chunk size less the length of the keyword and null character separator.

The keyword indicates the type of information represented by the text string as described in11.3.3.1Keywords and text strings.

Text is interpreted according to the Latin-1 character set [ISO_8859-1]. The text string may contain any Latin-1 character. Newlines in the text string should be represented by a single linefeed character (decimal 10). Characters other than those defined in Latin-1 plus the linefeed character have no defined meaning intEXt chunks. Text containing characters outside the repertoire of ISO/IEC 8859-1 should be encoded using theiTXt chunk.

11.3.3.3zTXt Compressed textual data

The four-byte chunk type field contains the hexadecimal values

7A 54 58 74

ThezTXt andtEXt chunks are semantically equivalent, but thezTXt chunk is recommended for storing large blocks of text.

AzTXt chunk contains:

Keyword	1-79 bytes (character string)
Null separator	1 byte (null character)
Compression method	1 byte
Compressed text datastream	n bytes

The keyword and null character are the same as in thetEXt chunk. The keyword is not compressed. The compression method entry defines the compression method used. The only value defined in this International Standard is 0 (deflate compression). Other values are reserved for future standardization. The compression method entry is followed by the compressed text datastream that makes up the remainder of the chunk. For compression method 0, this datastream is azlib datastream with deflate compression (see10.3Other uses of compression). Decompression of this datastream yields Latin-1 text that is identical to the text that would be stored in an equivalenttEXt chunk.

11.3.3.4iTXt International textual data

The four-byte chunk type field contains the hexadecimal values

69 54 58 74

AniTXt chunk contains:

Keyword	1-79 bytes (character string)
Null separator	1 byte (null character)
Compression flag	1 byte
Compression method	1 byte
Language tag	0 or more bytes (character string)
Null separator	1 byte (null character)
Translated keyword	0 or more bytes
Null separator	1 byte (null character)
Text	0 or more bytes

The keyword is described in11.3.3.1Keywords and text strings.

The compression flag is 0 for uncompressed text, 1 for compressed text. Only the text field may be compressed. The compression method entry defines the compression method used. The only compression method defined in this specification is 0 (zlib datastream withdeflate compression, see10.3Other uses of compression). For uncompressed text, encoders shall set the compression method to 0, and decoders shall ignore it.

The language tag is a well-formed language tag defined by [BCP47]. Unlike the keyword, the language tag is case-insensitive. Subtags must appear in theIANA language subtag registry. If the language tag is empty, the language is unspecified. Examples of language tags include:en,en-GB,es-419,zh-Hans,zh-Hans-CN,tlh-Cyrl-AQ,ar-AE-u-nu-latn, andx-private.

The translated keyword and text both use the UTF-8 encoding [rfc3629], and neither shall contain a zero byte (null character). The text, unlike other textual data in this chunk, is not null-terminated; its length is derived from the chunk length.

Line breaks should not appear in the translated keyword. In the text, a newline should be represented by a single linefeed character (hexadecimal 0A). The remaining control characters (01-09, 0B-1F, 7F-9F) are discouraged in both the translated keyword and text. In UTF-8 there is a difference between the characters 80-9F (which are discouraged) and the bytes 80-9F (which are often necessary).

The translated keyword, if not empty, should contain a translation of the keyword into the language indicated by the language tag, and applications displaying the keyword should display the translated keyword in addition.

11.3.4Miscellaneous information

11.3.4.1bKGD Background color

The four-byte chunk type field contains the hexadecimal values

62 4B 47 44

ThebKGD chunk specifies a default background color to present the image against. If there is any other preferred background, either user-specified or part of a larger page (as in a browser), thebKGD chunk should be ignored. ThebKGD chunk contains:

Table22 bKGD chunk contents
Color types 0 and 4
Greyscale	2 bytes
Color types 2 and 6
Red	2 bytes
Green	2 bytes
Blue	2 bytes
Color type 3
Palette index	1 byte

Forcolor type 3 (indexed-color), the value is the palette index of the color to be used as background.

Forcolor types 0 and 4 (greyscale,greyscale with alpha), the value is the grey level to be used as background in the range 0 to (2^bitdepth)-1. Forcolor types 2 and 6 (truecolor,truecolor with alpha), the values are the color to be used as background, given as RGB samples in the range 0 to (2^bitdepth)-1. In each case, for consistency, two bytes per sample are used regardless of the image bit depth. If the image bit depth is less than 16, the least significant bits are used. Encoders should set the other bits to 0, and decoders must mask the other bits to 0 before the value is used.

11.3.4.2hIST Image histogram

The four-byte chunk type field contains the hexadecimal values

68 49 53 54

ThehIST chunk contains a series of two-byte unsigned integers:

Frequency	2 bytes (unsigned integer)
...etc...

ThehIST chunk gives the approximate usage frequency of each color in the palette. A histogram chunk can appear only when aPLTE chunk appears. If a viewer is unable to provide all the colors listed in the palette, the histogram may help it decide how to choose a subset of the colors for display.

There shall be exactly one entry for each entry in thePLTE chunk. Each entry is proportional to the fraction of pixels in the image that have that palette index; the exact scale factor is chosen by the encoder.

Histogram entries are approximate, with the exception that a zero entry specifies that the corresponding palette entry is not used at all in the image. A histogram entry shall be nonzero if there are any pixels of that color.

Note

NOTE When the palette is a suggested quantization of atruecolor image, the histogram is necessarily approximate, since a decoder may map pixels to palette entries differently than the encoder did. In this situation, zero entries should not normally appear, because any entry might be used.

11.3.4.3pHYs Physical pixel dimensions

The four-byte chunk type field contains the hexadecimal values

70 48 59 73

ThepHYs chunk specifies the intended pixel size or aspect ratio for display of the image. It contains:

Table23 pHYs chunk contents
Name	Size
Pixels per unit, X axis	4 bytes (PNG four-byte unsigned integer)
Pixels per unit, Y axis	4 bytes (PNG four-byte unsigned integer)
Unit specifier	1 byte

The following values are defined for the unit specifier:

Table24 Unit specifier values
Value	Description
0	unit is unknown
1	unit is the metre

When the unit specifier is 0, thepHYs chunk defines pixel aspect ratio only; the actual size of the pixels remains unspecified.

If thepHYs chunk is not present, pixels are assumed to be square, and the physical size of each pixel is unspecified.

11.3.4.4sPLT Suggested palette

The four-byte chunk type field contains the hexadecimal values

73 50 4C 54

ThesPLT chunk contains:

Table25 sPLT chunk contents
Name	Size
Palette name	1-79 bytes (character string)
Null separator	1 byte (null character)
Sample depth	1 byte
Red	1 or 2 bytes
Green	1 or 2 bytes
Blue	1 or 2 bytes
Alpha	1 or 2 bytes
Frequency	2 bytes
...etc...

Each palette entry is six bytes or ten bytes containing five unsigned integers (red, blue, green, alpha, and frequency).

There may be any number of entries. A PNG decoder determines the number of entries from the length of the chunk remaining after the sample depth byte. This shall be divisible by 6 if thesPLT sample depth is 8, or by 10 if thesPLT sample depth is 16. Entries shall appear in decreasing order of frequency. There is no requirement that the entries all be used by the image, nor that they all be different.

The palette name can be any convenient name for referring to the palette (for example "256 color including Macintosh default", "256 color including Windows-3.1 default", "Optimal 512"). The palette name may aid the choice of the appropriate suggested palette when more than one appears in a PNG datastream.

The palette name is case-sensitive, and subject to the same restrictions as the keyword parameter for thetEXt chunk. Palette names shall contain only printable Latin-1 characters and spaces (only code points 0x20-7E and 0xA1-FF are allowed). Leading, trailing, and consecutive spaces are not permitted.

ThesPLT sample depth shall be 8 or 16.

The red, green, blue, and alpha samples are either one or two bytes each, depending on thesPLT sample depth, regardless of the image bit depth. The color samples are not premultiplied by alpha, nor are they precomposited against any background. An alpha value of 0 means fully transparent. An alpha value of 255 (when thesPLT sample depth is 8) or 65535 (when thesPLT sample depth is 16) means fully opaque. ThesPLT chunk may appear for anycolor type. Entries insPLT use the samegamma value andchromaticity values as the PNG image, but may fall outside the range of values used in the color space of the PNG image; for example, in agreyscale PNG image, eachsPLT entry would typically have equal red, green, and blue values, but this is not required. Similarly,sPLT entries can have non-opaque alpha values even when the PNG image does not use transparency.

Each frequency value is proportional to the fraction of the pixels in the image for which that palette entry is the closest match in RGBA space, before the image has beencomposited against any background. The exact scale factor is chosen by the PNG encoder; it is recommended that the resulting range of individual values reasonably fills the range 0 to 65535. A PNG encoder may artificially inflate the frequencies for colors considered to be "important", for example the colors used in a logo or the facial features of a portrait. Zero is a valid frequency meaning that the color is "least important" or that it is rarely, if ever, used. When all the frequencies are zero, they are meaningless, that is to say, nothing may be inferred about the actual frequencies with which the colors appear in the PNG image.

MultiplesPLT chunks are permitted, but each shall have a different palette name.

11.3.4.5eXIf Exchangeable Image File (Exif) Profile

The four-byte chunk type field contains the hexadecimal values

65 58 49 66

The data segment of theeXIf chunk contains an Exif profile in the format specified in "4.7.2 Interoperability Structure of APP1 in Compressed Data" of [CIPA-DC-008] except that the JPEG APP1 marker, length, and the "Exif ID code" described in 4.7.2(C), i.e., "Exif", NULL, and padding byte, are not included.

TheeXIf chunk size is constrained only by the maximum of 2³¹-1 bytes imposed by the PNG specification. Only oneeXIf chunk is allowed in a PNG datastream.

TheeXIf chunk contains metadata concerning the originalimage data. If the image has been edited subsequent to creation of the Exif profile, this data might no longer apply to the PNGimage data. It is recommended that unless a decoder has independent knowledge of the validity of the Exif data, the data should be considered to be of historical value only. It is beyond the scope of this specification to resolve potential conflicts between data in the eXIf chunk and in other PNG chunks.

11.3.4.5.1eXIf General Recommendations

While the PNG specification allows the chunk size to be as large as 2³¹-1 bytes, application authors should be aware that, if the Exif profile is going to be written to a JPEG [JPEG] datastream, the total length of theeXIf chunk data may need to be adjusted to not exceed 2¹⁶-9 bytes, so it can fit into a JPEG APP1 marker (Exif) segment.

11.3.4.5.2eXIf Recommendations for Decoders

The first two bytes of data are either "II" for little-endian (Intel) or "MM" for big-endian (Motorola) byte order. Decoders should check the first four bytes to ensure that they have the following hexadecimal values:

4949 2A00 (ASCII"II",16-bit little-endian integer42)

4D 4D00 2A (ASCII"MM",16-bit big-endian integer42)

All other values are reserved for possible future definition.

11.3.4.5.3eXIf Recommendations for Encoders

Image editing applications should consider Paragraph E.3 of the Exif Specification [CIPA-DC-008], which discusses requirements for updating Exif data when the image is changed. Encoders should follow those requirements, but decoders should not assume that it has been accomplished.

While encoders may choose to update them, there is no expectation that any thumbnails present in the Exif profile have (or have not) been updated if the main image was changed.

11.3.5Time stamp information

11.3.5.1tIME Image last-modification time

The four-byte chunk type field contains the hexadecimal values

74 49 4D 45

ThetIME chunk gives the time of the last image modification (not the time of initial image creation). It contains:

Table26 tIME chunk contents
Name	Size
Year	2 bytes (complete; for example, 1995,not 95)
Month	1 byte (1-12)
Day	1 byte (1-31)
Hour	1 byte (0-23)
Minute	1 byte (0-59)
Second	1 byte (0-60) (to allow for leap seconds)

Universal Time (UTC) should be specified rather than local time.

ThetIME chunk is intended for use as an automatically-applied time stamp that is updated whenever theimage data are changed.

11.3.6Animation information

11.3.6.1acTL Animation Control Chunk

The four-byte chunk type field contains the hexadecimal values

61 63 54 4C

TheacTL chunk declares that this is an animated PNG image, gives the number of frames, and the number of times to loop. It contains:

`num_frames`	4 bytes
`num_plays`	4 bytes

Each value is encoded as aPNG four-byte unsigned integer.

num_frames indicates the total number of frames in the animation. This must equal the number offcTL chunks. 0 is not a valid value. 1 is a valid value, for a single-frame PNG. If this value does not equal the actual number of frames it should be treated as an error.

num_plays indicates the number of times that this animation should play; if it is 0, the animation should play indefinitely. If nonzero, the animation should come to rest on the final frame at the end of the last play.

TheacTL chunk must appear before the firstIDAT chunk within a valid PNG stream.

Note

For Web compatibility, due to the long time between the development and deployment of this chunk and it's incorporation into the PNG specification, this chunk name is exceptionally defined as if it were a private chunk.

11.3.6.2fcTL Frame Control Chunk

The four-byte chunk type field contains the hexadecimal values

66 63 54 4C

ThefcTL chunk defines the dimensions, position, delay and disposal of an individual frame. Exactly onefcTL chunk chunk is required for each frame. It contains:

Table27 fcTL chunk contents
Name	Size
`sequence_number`	4 bytes
`width`	4 bytes
`height`	4 bytes
`x_offset`	4 bytes
`y_offset`	4 bytes
`delay_num`	2 bytes
`delay_den`	2 bytes
`dispose_op`	1 byte
`blend_op`	1 byte

sequence_number defines thesequence number of the animation chunk, starting from 0. It is encoded as aPNG four-byte unsigned integer.

width andheight define the width and height of the following frame. They are encoded asPNG four-byte unsigned integers. They must be greater than zero.

x_offset andy_offset define the x and y position of the following frame. They are encoded asPNG four-byte unsigned integers. Zero is a valid value.

The frame must be rendered within the region defined byx_offset,y_offset,width, andheight. This region may not fall outside of the default image; thusx_offset pluswidth must not be greater than theIHDR width; similarlyy_offset plusheight must not be greater than theIHDR height.

delay_num anddelay_den define the numerator and denominator of the delay fraction; indicating the time to display the current frame, in seconds. If the denominator is 0, it is to be treated as if it were 100 (that is,delay_num then specifies 1/100ths of a second). If the the value of the numerator is 0 the decoder should render the next frame as quickly as possible, though viewers may impose a reasonable lower bound. They are encoded as two-byte unsigned integers.

Frame timings should be independent of the time required for decoding and display of each frame, so that animations will run at the same rate regardless of the performance of the decoder implementation.

dispose_op defines the type of frame area disposal to be done after rendering this frame; in other words, it specifies how the output buffer should be changed at the end of the delay (before rendering the next frame). It is encoded as a one-byte unsigned integer.

Valid values fordispose_op are:

0	`APNG_DISPOSE_OP_NONE`
1	`APNG_DISPOSE_OP_BACKGROUND`
2	`APNG_DISPOSE_OP_PREVIOUS`

APNG_DISPOSE_OP_NONE: no disposal is done on this frame before rendering the next; the contents of the output buffer are left as is.
APNG_DISPOSE_OP_BACKGROUND: the frame's region of the output buffer is to be cleared to fully transparent black before rendering the next frame.
APNG_DISPOSE_OP_PREVIOUS: the frame's region of the output buffer is to be reverted to the previous contents before rendering the next frame.

If the firstfcTL chunk uses adispose_op ofAPNG_DISPOSE_OP_PREVIOUS it should be treated asAPNG_DISPOSE_OP_BACKGROUND.

blend_op specifies whether the frame is to be alpha blended into the current output buffer content, or whether it should completely replace its region in the output buffer. It is encoded as a one-byte unsigned integer.

Valid values forblend_op are:

0	`APNG_BLEND_OP_SOURCE`
1	`APNG_BLEND_OP_OVER`

Ifblend_op isAPNG_BLEND_OP_SOURCE all color components of the frame, including alpha, overwrite the current contents of the frame's output buffer region. Ifblend_op isAPNG_BLEND_OP_OVER the frame should becomposited onto the output buffer based on its alpha, using a simple OVER operation as described inAlpha Channel Processing. Note that the second variation of the sample code is applicable.

Note that for the first frame, the two blend modes are functionally equivalent due to the clearing of the output buffer at the beginning of each play.

ThefcTL chunk corresponding to the default image, if it exists, has these restrictions:

Thex_offset andy_offset fields must be 0.
Thewidth andheight fields must equal the corresponding fields from theIHDR chunk.

As noted earlier, the output buffer must be completely initialized to fully transparent black at the beginning of each play. This is to ensure that each play of the animation will be identical. Decoders are free to avoid an explicit clear step as long as the result is guaranteed to be identical. For example, if the default image is included in the animation, and uses ablend_op ofAPNG_BLEND_OP_SOURCE, clearing is not necessary because the entire output buffer will be overwritten.

Note

11.3.6.3fdAT Frame Data Chunk

The four-byte chunk type field contains the hexadecimal values

66 64 41 54

ThefdAT chunk serves the same purpose for animations as theIDAT chunks do for static images; the set offdAT chunks contains theimage data for all frames (or, for animations which include thestatic image as first frame, for all frames after the first one). It contains:

Table28 fdAT chunk contents
Name	Size
`sequence_number`	4 bytes
`frame_data`	n bytes

At least onefdAT chunk is required for each frame, except for the first frame, if that frame is represented by anIDAT chunk.

The compressed datastream for each frame is then the concatenation, in ascendingsequence number order, of the contents of theframe_data fields of all thefdAT chunks within a frame.

Because of the sequence number,fdAT chunksmay not be of zero length); however theframe_data fields may be of zero length. When decompressed, the datastream is the complete pixel data of a PNG image, including the filter byte at the beginning of each scanline, similar to the uncompressed data of all theIDAT chunks. It utilizes the same bit depth,color type, compression method,filter method, interlace method, and palette (if any) as thestatic image.

Each frame inherits every property specified by any critical or ancillary chunksbefore the firstIDAT chunk in the file, except the width and height, which come from thefcTL chunk.

If the PNGpHYs chunk is present, theAPNG images and theirx_offset andy_offset values must be scaled in the same way as the main image. Conceptually, such scaling occurs while mapping the output buffer onto the canvas.

Note

12.PNG Encoders

Introduction

This clause gives requirements and recommendations for encoder behavior. A PNG encoder shall produce a PNG datastream from a PNG image that conforms to the format specified in the preceding clauses. Best results will usually be achieved by following the additional recommendations given here.

12.1Encoder gamma handling

SeeC.Gamma and chromaticity for a brief introduction togamma issues.

PNG encoders capable of full color management will perform more sophisticated calculations than those described here and may choose to use theiCCP chunk. If it is known that the image samples conform to the sRGB specification [SRGB], encoders are strongly encouraged to write thesRGB chunk without performing additionalgamma handling. In both cases it is recommended that an appropriategAMA chunk be generated for use by PNG decoders that do not recognize theiCCP orsRGB chunks.

A PNG encoder has to determine:

what value to write in thegAMA chunk;
how to transform the provided image samples into the values to be written in the PNG datastream.

The value to write in thegAMA chunk is that value which causes a PNG decoder to behave in the desired way. See13.13Decoder gamma handling.

The transform to be applied depends on the nature of the image samples and their precision. If the samples represent light intensity in floating-point or high precision integer form (perhaps from a computer graphics renderer), the encoder may performgamma encoding (applying a power function with exponent less than 1) before quantizing the data to integer values for inclusion in the PNG datastream. This results in fewer banding artifacts at a given sample depth, or allows smaller samples while retaining the same visual quality. An intensity level expressed as a floating-point value in the range 0 to 1 can be converted to a datastream image sample by:

integer_sample = floor((2^sampledepth-1) * intensity^{encoding_exponent} + 0.5)

If the intensity in the equation is the desired output intensity, the encoding exponent is thegamma value to be used in thegAMA chunk.

If the intensity available to the PNG encoder is the original scene intensity, another transformation may be needed. There is sometimes a requirement for the displayed image to have higher contrast than the original source image. This corresponds to an end-to-endtransfer function from original scene to display output with an exponent greater than 1. In this case:

gamma= encoding_exponent/end_to_end_exponent

If it is not known whether the conditions under which the original image was captured or calculated warrant such a contrast change, it may be assumed that the display intensities are proportional to original scene intensities, i.e. the end-to-end exponent is 1 and hence:

gamma= encoding_exponent

If the image is being written to a datastream only, the encoder is free to choose the encoding exponent. Choosing a value that causes thegamma value in thegAMA chunk to be 1/2.2 is often a reasonable choice because it minimizes the work for a PNG decoder displaying on a typical video monitor.

Some image renderers may simultaneously write the image to a PNG datastream and display it on-screen. The displayed pixels should begamma corrected for the display system and viewing conditions in use, so that the user sees a proper representation of the intended scene.

If the renderer wants to write the displayed sample values to the PNG datastream, avoiding a separategamma encoding step for the datastream, the renderer should approximate thetransfer function of the display system by a power function, and write the reciprocal of the exponent into thegAMA chunk. This will allow a PNG decoder to reproduce what was displayed on screen for the originator during rendering.

However, it is equally reasonable for a renderer to compute displayed pixels appropriate for the display device, and to perform separategamma encoding for data storage and transmission, arranging to have a value in thegAMA chunk more appropriate to the future use of the image.

Computer graphics renderers often do not performgamma encoding, instead making sample values directly proportional to scene light intensity. If the PNG encoder receives sample values that have already been quantized into integer values, there is no point in doinggamma encoding on them; that would just result in further loss of information. The encoder should just write the sample values to the PNG datastream. This does not imply that thegAMA chunk should contain agamma value of 1.0 because the desired end-to-endtransfer function from scene intensity to display output intensity is not necessarily linear. However, the desiredgamma value is probably not far from 1.0. It may depend on whether the scene being rendered is a daylight scene or an indoor scene, etc.

When the sample values come directly from a piece of hardware, the correctgAMA value can, in principle, be inferred from thetransfer function of the hardware and lighting conditions of the scene. In the case of video digitizers ("frame grabbers"), the samples are probably in the sRGB color space, because the sRGB specification was designed to be compatible with modern video standards. Image scanners are less predictable. Their output samples may be proportional to the input light intensity since CCD sensors themselves are linear, or the scanner hardware may have already applied a power function designed to compensate for dot gain in subsequent printing (an exponent of about 0.57), or the scanner may have corrected the samples for display on a monitor. It may be necessary to refer to the scanner's manual or to scan a calibrated target in order to determine the characteristics of a particular scanner. It should be remembered thatgamma relates samples to desired display output, not to scanner input.

Datastream format converters generally should not attempt to convert supplied images to a differentgamma. The data should be stored in the PNG datastream without conversion, and thegamma value should be deduced from information in the source datastream if possible.Gamma alteration at datastream conversion time causes re-quantization of the set of intensity levels that are represented, introducing further roundoff error with little benefit. It is almost always better to just copy the sample values intact from the input to the output file.

If the source datastream describes thegamma characteristics of the image, a datastream converter is strongly encouraged to write agAMA chunk. Some datastream formats specify the display exponent (the exponent of the function which maps image samples to display output rather than the other direction). If the source file'sgamma value is greater than 1.0, it is probably a display exponent, and the reciprocal of this value should be used for the PNGgamma value. If the source file format records the relationship between image samples and a quantity other than display output, it will be more complex than this to deduce the PNGgamma value.

If a PNG encoder or datastream converter knows that the image has been displayed satisfactorily using a display system whosetransfer function can be approximated by a power function with exponentdisplay_exponent, the image can be marked as having thegamma value:

gamma=1/display_exponent

It is better to write agAMA chunk with a value that is approximately correct than to omit the chunk and force PNG decoders to guess an approximategamma value. If a PNG encoder is unable to infer thegamma value, it is preferable to omit thegAMA chunk. If a guess has to be made this should be left to the PNG decoder.

gamma does not apply to alpha samples; alpha is always represented linearly.

12.2Encoder color handling

SeeC.Gamma and chromaticity for references to color issues.

If it is possible for the encoder to determine the chromaticities of the source display primaries, or to make a strong guess based on the origin of the image, or the hardware running it, the encoder is strongly encouraged to output thecHRM chunk. If this is done, thegAMA chunk should also be written; decoders can do little with acHRM chunk if thegAMA chunk is missing.

There are a number of recommendations and standards for primaries andwhite points, some of which are linked to particular technologies, for example the CCIR 709 standard [ITU-R-BT.709] and the SMPTE-C standard [SMPTE-170M].

There are three cases that need to be considered:

the encoder is part of the generation system;
the source image is captured by a camera or scanner;
the PNG datastream was generated by translation from some other format.

In the case of hand-drawn or digitally edited images, it is necessary to determine what monitor they were viewed on when being produced. Many image editing programs allow the type of monitor being used to be specified. This is often because they are working in some device-independent space internally. Such programs have enough information to write validcHRM andgAMA chunks, and are strongly encouraged to do so automatically.

If the encoder is compiled as a portion of a computer image renderer that performs full-spectral rendering, the monitor values that were used to convert from the internal device-independent color space to RGB should be written into thecHRM chunk. Any colors that are outside the gamut of the chosen RGB device should be mapped to be within the gamut; PNG does not store out-of-gamut colors.

If the computer image renderer performs calculations directly in device-dependent RGB space, acHRM chunk should not be written unless the scene description and rendering parameters have been adjusted for a particular monitor. In that case, the data for that monitor should be used to construct acHRM chunk.

A few image formats store calibration information, which can be used to fill in thecHRM chunk. For example, TIFF 6.0 files [TIFF-6.0] can optionally store calibration information, which if present should be used to construct thecHRM chunk.

Video created with recent video equipment probably uses the CCIR 709 primaries and D65white point [ITU-R-BT.709], which are given inTable29.

Table29 CCIR 709 primaries and D65 whitepoint
	R	G	B	White
x	0.640	0.300	0.150	0.3127
y	0.330	0.600	0.060	0.3290

An older but still very popular video standard is SMPTE-C [SMPTE-170M] given inTable30.

Table30 SMPTE-C video standard
	R	G	B	White
x	0.630	0.310	0.155	0.3127
y	0.340	0.595	0.070	0.3290

It isnot recommended that datastream format converters attempt to convert supplied images to a different RGB color space. The data should be stored in the PNG datastream without conversion, and the source primary chromaticities should be recorded if they are known. Color space transformation at datastream conversion time is a bad idea because of gamut mismatches and rounding errors. As withgamma conversions, it is better to store the data losslessly and incur at most one conversion when the image is finally displayed.

See13.14Decoder color handling.

12.3Alpha channel creation

The alpha channel can be regarded either as a mask that temporarily hides transparent parts of the image, or as a means for constructing a non-rectangular image. In the first case, the color values of fully transparent pixels should be preserved for future use. In the second case, the transparent pixels carry no useful data and are simply there to fill out the rectangular image area required by PNG. In this case, fully transparent pixels should all be assigned the same color value for best compression.

Image authors should keep in mind the possibility that a decoder will not support transparency control in full (see13.16Alpha channel processing). Hence, the colors assigned to transparent pixels should be reasonable background colors whenever feasible.

For applications that do not require a full alpha channel, or cannot afford the price in compression efficiency, thetRNS transparency chunk is also available.

If the image has a known background color, this color should be written in thebKGD chunk. Even decoders that ignore transparency may use thebKGD color to fill unused screen area.

If the original image has premultiplied (also called "associated") alpha data, it can be converted to PNG's non-premultiplied format by dividing each sample value by the corresponding alpha value, then multiplying by the maximum value for the image bit depth, and rounding to the nearest integer. In valid premultiplied data, the sample values never exceed their corresponding alpha values, so the result of the division should always be in the range 0 to 1. If the alpha value is zero, output black (zeroes).

12.4Sample depth scaling

When encoding input samples that have a sample depth that cannot be directly represented in PNG, the encoder shall scale the samples up to a sample depth that is allowed by PNG. The most accurate scaling method is the linear equation:

output = floor((input * MAXOUTSAMPLE / MAXINSAMPLE) + 0.5)

where the input samples range from 0 toMAXINSAMPLE and the outputs range from 0 toMAXOUTSAMPLE (which is 2^sampledepth-1).

A close approximation to the linear scaling method is achieved by "left bit replication", which is shifting the valid bits to begin in the most significant bit and repeating the most significant bits into the open bits. This method is often faster to compute than linear scaling.

Assume that 5-bit samples are being scaled up to 8 bits. If the source sample value is 27 (in the range from 0-31), then the original bits are:

4 3 2 1 0---------1 1 0 1 1

Left bit replication gives a value of 222:

7 6 5 4 3  2 1 0----------------1 1 0 1 1  1 1 0|=======|  |===|    |      Leftmost Bits Repeated to Fill Open Bits    |Original Bits

which matches the value computed by the linear equation. Left bit replication usually gives the same value as linear scaling, and is never off by more than one.

A distinctly less accurate approximation is obtained by simply left-shifting the input value and filling the low order bits with zeroes. This scheme cannot reproduce white exactly, since it does not generate an all-ones maximum value; the net effect is to darken the image slightly. This method is not recommended in general, but it does have the effect of improving compression, particularly when dealing with greater-than-8-bit sample depths. Since the relative error introduced by zero-fill scaling is small at high sample depths, some encoders may choose to use it. Zero-fill shallnot be used for alpha channel data, however, since many decoders will treat alpha values of all zeroes and all ones as special cases. It is important to represent both those values exactly in the scaled data.

When the encoder writes ansBIT chunk, it is required to do the scaling in such a way that the high-order bits of the stored samples match the original data. That is, if thesBIT chunk specifies a sample depth of S, the high-order S bits of the stored data shall agree with the original S-bit data values. This allows decoders to recover the original data by shifting right. The added low-order bits are not constrained. All the above scaling methods meet this restriction.

When scaling up sourceimage data, it is recommended that the low-order bits be filled consistently for all samples; that is, the same source value should generate the same sample value at any pixel position. This improves compression by reducing the number of distinct sample values. This is not a mandatory requirement, and some encoders may choose not to follow it. For example, an encoder might instead dither the low-order bits, improving displayed image quality at the price of increasing file size.

In some applications the original source data may have a range that is not a power of 2. The linear scaling equation still works for this case, although the shifting methods do not. It is recommended that ansBIT chunk not be written for such images, sincesBIT suggests that the original data range was exactly 0..2^S-1.

12.5Suggested palettes

Suggested palettes may appear assPLT chunks in any PNG datastream, or as aPLTE chunk intruecolor PNG datastreams. In either case, the suggested palette is not an essential part of theimage data, but it may be used to present the image on indexed-color display hardware. Suggested palettes are of no interest to viewers running ontruecolor hardware.

When ansPLT chunk is used to provide a suggested palette, it is recommended that the encoder use the frequency fields to indicate the relative importance of the palette entries, rather than leave them all zero (meaning undefined). The frequency values are most easily computed as "nearest neighbor" counts, that is, the approximate usage of each RGBA palette entry if no dithering is applied. (These counts will often be available "for free" as a consequence of developing the suggested palette.) Because the suggested palette includes transparency information, it should be computed for the un-composited image.

Even for indexed-color images,sPLT can be used to define alternative reduced palettes for viewers that are unable to display all the colors present in thePLTE chunk. If thePLTE chunk appears without thebKGD chunk in an image ofcolor type 6, the circumstances under which the palette was computed are unspecified.

An older method for including a suggested palette in atruecolor PNG datastream uses thePLTE chunk. If this method is used, the histogram (frequencies) should appear in a separatehIST chunk. ThePLTE chunk does not include transparency information. Hence for images ofcolor type 6 (truecolor with alpha), it is recommended that abKGD chunk appear and that the palette and histogram be computed with reference to the image as it would appear after compositing against the specified background color. This definition is necessary to ensure that useful palette entries are generated for pixels having fractional alpha values. The resulting palette will probably be useful only to viewers that present the image against the same background color. It is recommended thatPNG editors delete or recompute the palette if they alter or remove thebKGD chunk in an image ofcolor type 6.

For images ofcolor type 2 (truecolor), it is recommended that thePLTE andhIST chunks be computed with reference to the RGB data only, ignoring any transparent-color specification. If the datastream uses transparency (has atRNS chunk), viewers can easily adapt the resulting palette for use with their intended background color (see13.17Histogram and suggested palette usage).

For providing suggested palettes, thesPLT chunk is more flexible than thePLTE chunk in the following ways:

WithsPLT multiple suggested palettes may be provided. A PNG decoder may choose an appropriate palette based on name or number of entries.
In a PNG datastream ofcolor type 6 (truecolor with alpha channel), thePLTE chunk represents a palette alreadycomposited against thebKGD color, so it is useful only for display against that background color. ThesPLT chunk provides an un-composited palette, which is useful for display against backgrounds chosen by the PNG decoder.
Since thesPLT chunk is an ancillary chunk, aPNG editor may add or modify suggested palettes without being forced to discard unknown unsafe-to-copy chunks.
Whereas thesPLT chunk is allowed in PNG datastreams forcolor types 0, 3, and 4 (greyscale andindexed-color), thePLTE chunk cannot be used to provide reduced palettes in these cases.
More than 256 entries may appear in thesPLT chunk.

A PNG encoder that uses thesPLT chunk may choose to write a suggested palette represented byPLTE andhIST chunks as well, for compatibility with decoders that do not recognize thesPLT chunk.

12.6Interlacing

This specification defines two interlace methods, one of which is no interlacing. Interlacing provides a convenient basis from which decoders can progressively display an image, as described in13.10Interlacing and progressive display.

12.7Filter selection

For images ofcolor type 3 (indexed-color), filter type 0 (None) is usually the most effective. Color images with 256 or fewer colors should almost always be stored inindexed-color format;truecolor format is likely to be much larger.

Filter type 0 is also recommended for images of bit depths less than 8. For low-bit-depth greyscale images, in rare cases, better compression may be obtained by first expanding the image to 8-bit representation and then applying filtering.

Fortruecolor andgreyscale images, any of the five filters may prove the most effective. If an encoder uses a fixed filter, the Paeth filter type is most likely to be the best.

For best compression oftruecolor andgreyscale images, and if compression efficiency is valued over speed of compression, the recommended approach is adaptive filtering in which a filter type is chosen for each scanline. Each unique image will have a different set of filters which perform best for it. An encoder could try every combination of filters to find what compresses best for a given image. However, when an exhaustive search is unacceptable, here are some general heuristics which may perform well enough: compute the output scanline using all five filters, and select the filter that gives the smallest sum of absolute values of outputs. (Consider the output bytes as signed differences for this test.) This method usually outperforms any single fixed filter type choice.

Filtering according to these recommendations is effective in conjunction with either of the two interlace methods defined in this specification.

12.8Compression

The encoder may divide the compressed datastream intoIDAT chunks however it wishes. (MultipleIDAT chunks are allowed so that encoders may work in a fixed amount of memory; typically the chunk size will correspond to the encoder's buffer size.) A PNG datastream in which eachIDAT chunk contains only one data byte is valid, though remarkably wasteful of space. (Zero-length IDAT chunks are also valid, though even more wasteful.)

12.9Text chunk processing

A nonempty keyword shall be provided for each text chunk. The generic keyword "Comment" can be used if no better description of the text is available. If a user-supplied keyword is used, encoders should check that it meets the restrictions on keywords.

TheiTXt chunk uses the UTF-8 encoding of Unicode and thus can store text in any language. ThetEXt andzTXt chunks use the Latin-1 (ISO 8859-1) character encoding, which limits the range of characters that can be used in these chunks. Encoders should preferiTXt totEXt andzTXt chunks, in order to allow a wide range of characters without data loss. Encoders must convert characters that use locallegacy character encodings to the appropriate encoding when storing text.

When creatingiTXt chunks, encoders should followUTF-8 encode inEncoding Standard.

Encoders should discourage the creation of single lines of text longer than 79 Unicodecode points, in order to facilitate easy reading. It is recommended that text items less than 1024 bytes in size should be output using uncompressed text chunks. It is recommended that the basic title and author keywords be output using uncompressed text chunks. Placing large text chunks after theimage data (after theIDAT chunks) can speed up image display in some situations, as the decoder will decode theimage data first. It is recommended that small text chunks, such as the image title, appear before theIDAT chunks.

12.10Chunking

12.10.1Use of private chunks

EncodersMAY use private chunks to carry information that need not be understood by other applications.

12.10.2Use of non-reserved field values

EncodersMAY use non-reserved field values for experimental or private use.

12.10.3Ancillary chunks

All ancillary chunks are optional, encoders need not write them. However, encoders are encouraged to write the standard ancillary chunks when the information is available.

13.PNG decoders and viewers

Introduction

This clause gives some requirements and recommendations for PNG decoder behavior and viewer behavior. A viewer presents the decoded PNG image to the user. Since viewer and decoder behavior are closely connected, decoders and viewers are treated together here. The only absolute requirement on a PNG decoder is that it successfully reads any datastream conforming to the format specified in the preceding chapters. However, best results will usually be achieved by following these additional recommendations.

PNG decoders shall support all valid combinations of bit depth,color type, compression method,filter method, and interlace method that are explicitly defined in this International Standard.

13.1Error handling

Errors in a PNG datastream will fall into two general classes, transmission errors and syntax errors (see4.10Error handling).

Examples of transmission errors are transmission in "text" or "ascii" mode, in which byte codes 13 and/or 10 may be added, removed, or converted throughout the datastream; unexpected termination, in which the datastream is truncated; or a physical error on a storage device, in which one or more blocks (typically 512 bytes each) will have garbled or random values. Some examples of syntax errors are an invalid value for a row filter, an invalid compression method, an invalid chunk length, the absence of aPLTE chunk before the firstIDAT chunk in an indexed image, or the presence of multiplegAMA chunks. A PNG decoder should handle errors as follows:

Detect errors as early as possible using the PNG signature bytes and CRCs on each chunk. Decoders should verify that all eight bytes of the PNG signature are correct. A decoder can have additional confidence in the datastream's integrity if the next eight bytes begin anIHDR chunk with the correct chunk length. ACRC should be checked before processing the chunk data. Sometimes this is impractical, for example when a streaming PNG decoder is processing a large IDAT chunk. In this case theCRC should be checked when the end of the chunk is reached.
Recover from an error, if possible; otherwise fail gracefully. Errors that have little or no effect on the processing of the image may be ignored, while those that affect critical data shall be dealt with in a manner appropriate to the application.
Provide helpful messages describing errors, including recoverable errors.

Three classes of PNG chunks are relevant to this philosophy. For the purposes of this classification, an "unknown chunk" is either one whose type was genuinely unknown to the decoder's author, or one that the author chose to treat as unknown, because default handling of that chunk type would be sufficient for the program's purposes. Other chunks are called "known chunks". Given this definition, the three classes are as follows:

known chunks, which necessarily includes all of the critical chunks defined in this specification (IHDR,PLTE,IDAT,IEND)
unknown critical chunks (bit 5 of the first byte of the chunk type is 0)
unknown ancillary chunks (bit 5 of the first byte of the chunk type is 1)

See5.4Chunk naming conventions for a description of chunk naming conventions.

PNG chunk types are marked "critical" or "ancillary" according to whether the chunks are critical for the purpose of extracting a viewable image (as withIHDR,PLTE, andIDAT) or critical to understanding the datastream structure (as withIEND). This is a specific kind of criticality and one that is not necessarily relevant to every conceivable decoder. For example, a program whose sole purpose is to extract text annotations (for example, copyright information) does not require a viewable image but shoulddecode UTF-8 correctly. Another decoder might consider thetRNS andgAMA chunks essential to its proper execution.

Syntax errors always involve known chunks because syntax errors in unknown chunks cannot be detected. The PNG decoder has to determine whether a syntax error is fatal (unrecoverable) or not, depending on its requirements and the situation. For example, most decoders can ignore an invalidIEND chunk; a text-extraction program can ignore the absence ofIDAT; an image viewer cannot recover from an emptyPLTE chunk in an indexed image but it can ignore an invalidPLTE chunk in atruecolor image; and a program that extracts the alpha channel can ignore an invalidgAMA chunk, but may consider the presence of twotRNS chunks to be a fatal error. Anomalous situations other than syntax errors shall be treated as follows:

Encountering an unknown ancillary chunk is never an error. The chunk can simply be ignored.
Encountering an unknown critical chunk is a fatal condition for any decoder trying to extract the image from the datastream. A decoder that ignored a critical chunk could not know whether the image it extracted was the one intended by the encoder.
A PNG signature mismatch, aCRC mismatch, or an unexpected end-of-stream indicates a corrupted datastream, and may be regarded as a fatal error. A decoder could try to salvage something from the datastream, but the extent of the damage will not be known.

When a fatal condition occurs, the decoder should fail immediately, signal an error to the user if appropriate, and optionally continue displaying any image data already visible to the user (i.e. "fail gracefully"). The application as a whole need not terminate.

When a non-fatal error occurs, the decoder should signal a warning to the user if appropriate, recover from the error, and continue processing normally.

When decoding an indexed-color PNG, if out-of-range indexes are encountered, decoders have historically varied in their handling of this error.Displaying the pixel as opaque black is one common error recovery tactic, and is now required by this specification. Older implementations will vary, and so the behavior must not be relied on by encoders.

Decoders that do not compute CRCs should interpret apparent syntax errors as indications of corruption (see also13.2Error checking).

Errors in compressed chunks (IDAT,zTXt,iTXt,iCCP) could lead to buffer overruns. Implementors ofdeflate decompressors should guard against this possibility.

APNG is designed to allow incremental display of frames before the entire datastream has been read. This implies that some errors may not be detected until partway through the animation. It is strongly recommended that when any error is encountered decoders should discard all subsequent frames, stop the animation, and revert to displaying the static image. A decoder which detects an error before the animation has started should display the static image. An error message may be displayed to the user if appropriate.

Decoders shall treat out-of-orderAPNG chunks as an error.APNG-aware PNG editors should restore them to correct order, using the sequence numbers.

13.2Error checking

The PNG error handling philosophy is described in13.1Error handling.

An unknown chunk type isnot to be treated as an error unless it is a critical chunk.

The chunk type can be checked for plausibility by seeing whether all four bytes are in the range codes 41-5A and 61-7A (hexadecimal); note that this need be done only for unrecognized chunk types. If the total datastream size is known (from file system information, HTTP protocol, etc), the chunk length can be checked for plausibility as well. If CRCs are not checked, dropped/added data bytes or an erroneous chunk length can cause the decoder to get out of step and misinterpret subsequent data as a chunk header.

For known-length chunks, such asIHDR, decoders should treat an unexpected chunk length as an error. Future extensions to this specification will not add new fields to existing chunks; instead, new chunk types will be added to carry new information.

Unexpected values in fields of known chunks (for example, an unexpected compression method in theIHDR chunk) shall be checked for and treated as errors. However, it is recommended that unexpected field values be treated as fatal errors only incritical chunks. An unexpected value in an ancillary chunk can be handled by ignoring the whole chunk as though it were an unknown chunk type. (This recommendation assumes that the chunk'sCRC has been verified. In decoders that do not check CRCs, it is safer to treat any unexpected value as indicating a corrupted datastream.)

Standard PNG images shall be compressed with compression method 0. The compression method field of the IHDR chunk is provided for possible future standardization or proprietary variants. Decoders shall check this byte and report an error if it holds an unrecognized code. See10.Compression for details.

13.3Security considerations

A PNG datastream is composed of a collection of explicitly typed chunks. Chunks whose contents are defined by the specification could actually contain anything, including malicious code. Similarly there could be data after theIEND chunk which could contain anything, including malicious code. There is no known risk that such malicious code could be executed on the recipient's computeras a result of decoding the PNG image. However, a malicious application might hide such code inside an innocent-looking image file and then execute it.

The possible security risks associated with future chunk types cannot be specified at this time. Security issues will be considered when defining future public chunks. There is no additional security risk associated with unknown or unimplemented chunk types, because such chunks will be ignored, or at most be copied into another PNG datastream.

TheiTXt,tEXt, andzTXt chunks contain keywords and data that are meant to be displayed as plain text. TheiCCP andsPLT chunks contain keywords that are meant to be displayed as plain text. It is possible that if the decoder displays such text without filtering out control characters, especially the ESC (escape) character, certain systems or terminals could behave in undesirable and insecure ways. It is recommended that decoders filter out control characters to avoid this risk; see13.7Text chunk processing.

For theeXIf chunk, the Exif Specification [CIPA-DC-008] does not contain an express requirement that tag "value offset" pointers must actually point to a valid address within the file. This requirement is merely implied. (See Paragraph 4.6.2, which describes the Exif IFD structure.) Regardless, decoders should be prepared to encounter invalid pointers and to handle them appropriately.

Every chunk begins with a length field, which makes it easier to write decoders that are invulnerable to fraudulent chunks that attempt to overflow buffers. TheCRC at the end of every chunk provides a robust defence against accidentally corrupted data. The PNG signature bytes provide early detection of common file transmission errors.

A decoder that fails to check CRCs could be subject to data corruption. The only likely consequence of such corruption is incorrectly displayed pixels within the image. Worse things might happen if the CRC of the IHDR chunk is not checked and the width or height fields are corrupted. See13.2Error checking.

A poorly written decoder might be subject to buffer overflow, because chunks can be extremely large, up to 2³¹-1 bytes long. But properly written decoders will handle large chunks without difficulty.

13.4Privacy considerations

Some image editing tools have historically performed redaction by merely setting the alpha channel of the redacted area to zero, without also removing the actual image data. Users who rely solely on the visual appearance of such images run a privacy risk because the actual image data can be easily recovered.

Similarly, some image editing tools have historically performed clipping by rewriting the width and height inIHDR without re-encoding the image data, which thus extends beyond the new width and height and may be recovered.

Images witheXIf chunks may contain automatically-included data, such as photographic GPS coordinates, which could be a privacy risk if the user is unaware that the PNG image contains this data. (Other image formats that contain EXIF, such as JPEG/JFIF, have the same privacy risk).

13.5Chunking

Decoders shall recognize chunk types by a simple four-byte literal comparison; it is incorrect to perform case conversion on chunk types. A decoder encountering an unknown chunk in which the ancillary bit is 1 may safely ignore the chunk and proceed to display the image. A decoder trying to extract the image, upon encountering an unknown chunk in which the ancillary bit is 0, indicating a critical chunk, shall indicate to the user that the image contains information it cannot safely interpret.

Decoders should test the properties of an unknown chunk type by numerically testing the specified bits. Testing whether a character is uppercase or lowercase is inefficient, and even incorrect if a locale-specific case definition is used.

Decoders should not flag an error if the reserved bit is set to 1, however, as some future version of the PNG specification could define a meaning for this bit. It is sufficient to treat a chunk with this bit set in the same way as any other unknown chunk type.

Decoders do not need to test the chunk type private bit, since it has no functional significance and is used to avoid conflicts between chunks defined byW3C and those defined privately.

All ancillary chunks are optional; decoders may ignore them. However, decoders are encouraged to interpret these chunks when appropriate and feasible.

13.6Pixel dimensions

Non-square pixels can be represented (see11.3.4.3pHYs Physical pixel dimensions), but viewers are not required to account for them; a viewer can present any PNG datastream as though its pixels are square.

Where the pixel aspect ratio of the display differs from the aspect ratio of the physical pixel dimensions defined in the PNG datastream, viewers are strongly encouraged to rescale images for proper display.

When thepHYs chunk has a unit specifier of 0 (unit is unknown), the behavior of a decoder may depend on the ratio of the two pixels-per-unit values, but should not depend on their magnitudes. For example, apHYs chunk containing(ppuX, ppuY, unit) = (2, 1, 0) is equivalent to one containing(1000, 500, 0); both are equally valid indications that the image pixels are twice as tall as they are wide.

One reasonable way for viewers to handle a difference between the pixel aspect ratios of the image and the display is to expand the image either horizontally or vertically, but not both. The scale factors could be obtained using the following floating-point calculations:

image_ratio = pHYs_ppuY / pHYs_ppuXdisplay_ratio = display_ppuY / display_ppuXscale_factor_X = max(1.0, image_ratio/display_ratio)scale_factor_Y = max(1.0, display_ratio/image_ratio)

Because other methods such as maintaining the image area are also reasonable, and because ignoring thepHYs chunk is permissible, authors should not assume that all viewing applications will use this scaling method.

As well as making corrections for pixel aspect ratio, a viewer may have reasons to perform additional scaling both horizontally and vertically. For example, a viewer might want to shrink an image that is too large to fit on the display, or to expand images sent to a high-resolution printer so that they appear the same size as they did on the display.

13.7Text chunk processing

If practical, PNG decoders should have a way to display to the user all theiTXt,tEXt, andzTXt chunks found in the datastream. Even if the decoder does not recognize a particular text keyword, the user might be able to understand it.

When processingtEXt andzTXt chunks, decoders could encounter characters other than those permitted. Some can be safely displayed (e.g., TAB, FF, and CR, hexadecimal 09, 0C, and 0D, respectively), but others, especially the ESC character (hexadecimal 1B), could pose a security hazard (because unexpected actions may be taken by display hardware or software). Decoders should not attempt to directly display any non-Latin-1 characters (except for newline and perhaps TAB, FF, CR) encountered in atEXt orzTXt chunk. Instead, they should be ignored or displayed in a visible notation such as "\nnn". See13.3Security considerations.

When processingiTXt chunks, decoders should followUTF-8 decode inEncoding Standard.

Even though encoders are recommended to represent newlines as linefeed (hexadecimal 0A), it is recommended that decoders not rely on this; it is best to recognize all the common newline combinations (CR, LF, and CR-LF) and display each as a single newline. TAB can be expanded to the proper number of spaces needed to arrive at a column multiple of 8.

Decoders running on systems with a non-Latin-1legacy character encoding should remap character codes so that Latin-1 characters are displayed correctly. Unsupported characters should be replaced with a system-appropriate replacement character (such as U+FFFD REPLACEMENT CHARACTER, U+003F QUESTION MARK, or U+001A SUB) or mapped to a visible notation such as "\nnn". Characters should be only displayed if they are printable characters on the decoding system. Some byte values may be interpreted by the decoding system as control characters; for security, decoders running on such systems should not display these control characters.

Decoders should be prepared to display text chunks that contain any number of printing characters between newline characters, even though it is recommended that encoders avoid creating lines in excess of 79 characters.

13.8Decompression

The compression technique used in this specification does not require the entire compressed datastream to be available before decompression can start. Display can therefore commence before the entire decompressed datastream is available. It is extremely unlikely that any general purpose compression methods in future versions of this specification will not have this property.

It is important to emphasize thatIDAT chunk boundaries have no semantic significance and can occur at any point in the compressed datastream. There is no required correlation between the structure of theimage data (for example, scanline boundaries) anddeflate block boundaries orIDAT chunk boundaries. The completeimage data is represented by a singlezlib datastream that is stored in some number ofIDAT chunks; a decoder that assumes any more than this is incorrect. Some encoder implementations may emit datastreams in which some of these structures are indeed related, but decoders cannot rely on this.

13.9Filtering

To reverse the effect of a filter, the decoder may need to use the decoded values of the prior pixel on the same line, the pixel immediately above the current pixel on the prior line, and the pixel just to the left of the pixel above. This implies that at least one scanline's worth ofimage data needs to be stored by the decoder at all times. Even though some filter types do not refer to the prior scanline, the decoder will always need to store each scanline as it is decoded, since the next scanline might use a filter type that refers to it. See7.3Filtering.

13.10Interlacing and progressive display

Decoders are required to be able to read interlaced images. If the reference image contains fewer than five columns or fewer than five rows, some passes will be empty. Encoders and decoders shall handle this case correctly. In particular, filter type bytes are associated only with nonempty scanlines; no filter type bytes are present in an empty reduced image.

When receiving images over slow transmission links, viewers can improve perceived performance by displaying interlaced images progressively. This means that as each reduced image is received, an approximation to the complete image is displayed based on the data received so far. One simple yet pleasing effect can be obtained by expanding each received pixel to fill a rectangle covering the yet-to-be-transmitted pixel positions below and to the right of the received pixel. This process can be described by the following ISO C code [ISO_9899]:

/*    variables declared and initialized elsewhere in the code:        height, width    functions or macros defined elsewhere in the code:        visit(), min() */int starting_row[7]  = {0,0,4,0,2,0,1 };int starting_col[7]  = {0,4,0,2,0,1,0 };int row_increment[7] = {8,8,8,4,4,2,2 };int col_increment[7] = {8,8,4,4,2,2,1 };int block_height[7]  = {8,8,4,4,2,2,1 };int block_width[7]   = {8,4,4,2,2,1,1 };int pass;long row, col;pass =0;while (pass <7){    row = starting_row[pass];while (row < height)    {        col = starting_col[pass];while (col < width)        {visit(row, col,min(block_height[pass], height - row),min(block_width[pass], width - col));            col = col + col_increment[pass];        }        row = row + row_increment[pass];    }    pass = pass +1;}

The functionvisit(row,column,height,width) obtains the next transmitted pixel and paints a rectangle of the specified height and width, whose upper-left corner is at the specified row and column, using the color indicated by the pixel. Note that row and column are measured from 0,0 at the upper left corner.

If the viewer is merging the received image with a background image, it may be more convenient just to paint the received pixel positions (thevisit() function sets only the pixel at the specified row and column, not the whole rectangle). This produces a "fade-in" effect as the new image gradually replaces the old. An advantage of this approach is that proper alpha or transparency processing can be done as each pixel is replaced. Painting a rectangle as described above will overwrite background-image pixels that may be needed later, if the pixels eventually received for those positions turn out to be wholly or partially transparent. This is a problem only if the background image is not stored anywhere offscreen.

13.11Truecolor image handling

To achieve PNG's goal of universal interchangeability, decoders shall accept all types of PNG image:indexed-color,truecolor, andgreyscale. Viewers running on indexed-color display hardware need to be able to reducetruecolor images to indexed-color for viewing. This process is called "color quantization".

A simple, fast method for color quantization is to reduce the image to a fixed palette. Palettes with uniform color spacing ("color cubes") are usually used to minimize the per-pixel computation. For photograph-like images, dithering is recommended to avoid ugly contours in what should be smooth gradients; however, dithering introduces graininess that can be objectionable.

The quality of rendering can be improved substantially by using a palette chosen specifically for the image, since a color cube usually has numerous entries that are unused in any particular image. This approach requires more work, first in choosing the palette, and second in mapping individual pixels to the closest available color. PNG allows the encoder to supply suggested palettes, but not all encoders will do so, and the suggested palettes may be unsuitable in any case (they may have too many or too few colors). Therefore, high-quality viewers will need to have a palette selection routine at hand. A large lookup table is usually the most feasible way of mapping individual pixels to palette entries with adequate speed.

Numerous implementations of color quantization are available. The PNG sample implementation, libpng (http://www.libpng.org/pub/png/libpng.html), includes code for the purpose.

13.12Sample depth rescaling

Decoders may wish to scale PNG data to a lesser sample depth (data precision) for display. For example, 16-bit data will need to be reduced to 8-bit depth for use on most present-day display hardware. Reduction of 8-bit data to 5-bit depth is also common.

The most accurate scaling is achieved by the linear equation

output = floor((input * MAXOUTSAMPLE / MAXINSAMPLE) + 0.5)

where

MAXINSAMPLE = (2^sampledepth)-1
MAXOUTSAMPLE = (2^{desired_sampledepth})-1

A slightly less accurate conversion is achieved by simply shifting right by(sampledepth - desired_sampledepth) places. For example, to reduce 16-bit samples to 8-bit, the low-order byte can be discarded. In many situations the shift method is sufficiently accurate for display purposes, and it is certainly much faster. (But ifgamma correction is being done, sample rescaling can be merged into thegamma correction lookup table, as is illustrated in13.13Decoder gamma handling.)

If the decoder needs to scale samples up (for example, if theframe buffer has a greater sample depth than the PNG image), it should use linear scaling or left-bit-replication as described in12.4Sample depth scaling.

When ansBIT chunk is present, the referenceimage data can be recovered by shifting right to the sample depth specified bysBIT. Note that linear scaling will not necessarily reproduce the original data, because the encoder is not required to have used linear scaling to scale the data up. However, the encoder is required to have used a method that preserves the high-order bits, so shifting always works. This is the only case in which shifting might be said to be more accurate than linear scaling. A decoder need not pay attention to thesBIT chunk; the stored image is a valid PNG datastream of the sample depth indicated by theIHDR chunk; however, usingsBIT to recover the original samples before scaling them to suit the display often yields a more accurate display than ignoringsBIT.

When comparing pixel values totRNS chunk values to detect transparent pixels, the comparison shall be done exactly. Therefore, transparent pixel detection shall be done before reducing sample precision.

13.13Decoder gamma handling

SeeC.Gamma and chromaticity for a brief introduction togamma issues.

Viewers capable of full color management will perform more sophisticated calculations than those described here.

For an image display program to produce correct tone reproduction, it is necessary to take into account the relationship between samples and display output, and thetransfer function of the display system. This can be done by calculating:

sample = integer_sample / (2^sampledepth - 1.0) display_output = sample^1.0/gamma display_input = inverse_display_transfer(display_output) framebuf_sample = floor((display_input * MAX_FRAMEBUF_SAMPLE)+0.5)

whereinteger_sample is the sample value from the datastream,framebuf_sample is the value to write into theframe buffer, andMAX_FRAMEBUF_SAMPLE is the maximum value of aframe buffer sample (255 for 8-bit, 31 for 5-bit, etc). The first line converts an integer sample into a normalized floating point value (in the range 0.0 to 1.0), the second converts to a value proportional to the desired display output intensity, the third accounts for the display system'stransfer function, and the fourth converts to an integerframe buffer sample. Zero raised to any positive power is zero.

A step could be inserted between the second and third to adjustdisplay_output to account for the difference between the actual viewing conditions and the reference viewing conditions. However, this adjustment requires accounting for veiling glare, black mapping, and color appearance models, none of which can be well approximated by power functions. Such calculations are not described here. If viewing conditions are ignored, the error will usually be small.

The displaytransfer function can typically be approximated by a power function with exponentdisplay_exponent, in which case the second and third lines can be merged into:

display_input = sample^{1.0/(gamma * display_exponent)} = sample^{decoding_exponent}

so as to perform only one power calculation. For color images, the entire calculation is performed separately for R, G, and B values.

Thegamma value can be taken directly from thegAMA chunk. Alternatively, an application may wish to allow the user to adjust the appearance of the displayed image by influencing thegamma value. For example, the user could manually set a parameteruser_exponent which defaults to 1.0, and the application could set:

gamma= gamma_from_file / user_exponentdecoding_exponent=1.0 / (gamma * display_exponent)= user_exponent / (gamma_from_file * display_exponent)

The user would setuser_exponent greater than 1 to darken the mid-level tones, or less than 1 to lighten them.

AgAMA chunk containing zero is meaningless but could appear by mistake. Decoders should ignore it, and editors may discard it and issue a warning to the user.

It isnot necessary to perform a transcendental mathematical computation for every pixel. Instead, a lookup table can be computed that gives the correct output value for every possible sample value. This requires only 256 calculations per image (for 8-bit accuracy), not one or three calculations per pixel. For an indexed-color image, a one-time correction of the palette is sufficient, unless the image uses transparency and is being displayed against a nonuniform background.

If floating-point calculations are not possible,gamma correction tables can be computed using integer arithmetic and a precomputed table of logarithms. Example code appears in [PNG-EXTENSIONS].

When the incoming image has unknowngamma value (gAMA,sRGB, andiCCP all absent), standalone image viewers should choose a likely defaultgamma value, but allow the user to select a new one if the result proves too dark or too light. The defaultgamma value may depend on other knowledge about the image, for example whether it came from the Internet or from the local system. For consistency, viewers for document formats such as HTML, or vector graphics such as SVG, should treat embedded or linked PNG images with unknowngamma value in the same way that they treat other untagged images.

In practice, it is often difficult to determine what value of display exponent should be used. In systems with no built-ingamma correction, the display exponent is determined entirely by theCRT. A display exponent of 2.2 should be used unless detailed calibration measurements are available for the particular CRT used.

Many modern frame buffers have lookup tables that are used to performgamma correction, and on these systems the display exponent value should be the exponent of the lookup table andCRT combined. It may not be possible to find out what the lookup table contains from within the viewer application, in which case it may be necessary to ask the user to supply the display system's exponent value. Unfortunately, different manufacturers use different ways of specifying what should go into the lookup table, so interpretation of the system gamma value is system-dependent.

The response of real displays is actually more complex than can be described by a single number (the display exponent). If actual measurements of the monitor's light output as a function of voltage input are available, the third and fourth lines of the computation above can be replaced by a lookup in these measurements, to find the actualframe buffer value that most nearly gives the desired brightness.

13.14Decoder color handling

SeeC.Gamma and chromaticity for references to color issues.

In many cases, theimage data in PNG datastreams will be treated as device-dependent RGB values and displayed without modification (except for appropriategamma correction). This provides the fastest display of PNG images. But unless the viewer uses exactly the same display hardware as that used by the author of the original image, the colors will not be exactly the same as those seen by the original author, particularly for darker or near-neutral colors. ThecHRM chunk provides information that allows closer color matching than that provided bygamma correction alone.

ThecHRM data can be used to transform theimage data from RGB to XYZ and thence into a perceptually linear color space such as CIE LAB. The colors can be partitioned to generate an optimal palette, because the geometric distance between two colors in CIE LAB is strongly related to how different those colors appear (unlike, for example, RGB or XYZ spaces). The resulting palette of colors, once transformed back into RGB color space, could be used for display or written into aPLTE chunk.

Decoders that are part of image processing applications might also transformimage data into CIE LAB space for analysis.

In applications where color fidelity is critical, such as product design, scientific visualization, medicine, architecture, or advertising, PNG decoders can transform theimage data from source RGB to the display RGB space of the monitor used to view the image. This involves calculating the matrix to go from source RGB to XYZ and the matrix to go from XYZ to display RGB, then combining them to produce the overall transformation. The PNG decoder is responsible for implementing gamut mapping.

Decoders running on platforms that have a Color Management System (CMS) can pass theimage data,gAMA, andcHRM values to the CMS for display or further processing.

PNG decoders that provide color printing facilities can use the facilities in Level 2 PostScript to specifyimage data in calibrated RGB space or in a device-independent color space such as XYZ. This will provide better color fidelity than a simple RGB to CMYK conversion. The PostScript Language Reference manual [PostScript] gives examples. Such decoders are responsible for implementing gamut mapping between source RGB (specified in thecHRM chunk) and the target printer. The PostScript interpreter is then responsible for producing the required colors.

PNG decoders can use thecHRM data to calculate an accurate greyscale representation of a color image. Conversion from RGB to grey is simply a case of calculating the Y (luminance) component of XYZ, which is a weighted sum of R, G, and B values. The weights depend upon the monitor type, i.e. the values in thecHRM chunk. PNG decoders may wish to do this for PNG datastreams with nocHRM chunk. In this case, a reasonable default would be the CCIR 709 primaries [ITU-R-BT.709]. The original NTSC primaries shouldnot be used unless the PNG image really was color-balanced for such a monitor.

13.15Background color

The background color given by thebKGD chunk will typically be used to fill unused screen space around the image, as well as any transparent pixels within the image. (Thus,bKGD is valid and useful even when the image does not use transparency.) If nobKGD chunk is present, the viewer will need to decide upon a suitable background color. When no other information is available, a medium grey such as 153 in the 8-bit sRGB color space would be a reasonable choice. Transparent black or white text and dark drop shadows, which are common, would all be legible against this background.

Viewers that have a specific background against which to present the image (such as web browsers) should ignore thebKGD chunk, in effect overridingbKGD with their preferred background color or background image.

The background color given by thebKGD chunk is not to be considered transparent, even if it happens to match the color given by thetRNS chunk (or, in the case of anindexed-color image, refers to a palette index that is marked as transparent by thetRNS chunk). Otherwise one would have to imagine something "behind the background" tocomposite against. The background color is either used as background or ignored; it is not an intermediate layer between the PNG image and some other background.

Indeed, it will be common that thebKGD andtRNS chunks specify the same color, since then a decoder that does not implement transparency processing will give the intended display, at least when no partially-transparent pixels are present.

13.16Alpha channel processing

The alpha channel can be used tocomposite a foreground image against a background image. The PNG datastream defines the foreground image and the transparency mask, but not the background image. PNG decoders arenot required to support this most general case. It is expected that most will be able to support compositing against a single background color.

The equation for computing acomposited sample value is:

output = alpha * foreground + (1-alpha) * background

where alpha and the input and output sample values are expressed as fractions in the range 0 to 1. This computation should be performed with intensity samples (notgamma-encoded samples). For color images, the computation is done separately for R, G, and B samples.

The following code illustrates the general case of compositing a foreground image against a background image. It assumes that the original pixel data are available for the background image, and that output is to aframe buffer for display. Other variants are possible; see the comments below the code. The code allows the sample depths andgamma values of foreground image and background image all to be different and not necessarily suited to the display system. In practice no assumptions about equality should be made without first checking.

This code is ISO C [ISO_9899], with line numbers added for reference in the comments below.

01  int foreground[4];/* image pixel: R, G, B, A */02  int background[3];/* background pixel: R, G, B */03  int fbpix[3];/* frame buffer pixel */04  int fg_maxsample;/* foreground max sample */05  int bg_maxsample;/* background max sample */06  int fb_maxsample;/* frame buffer max sample */07  int ialpha;08  float alpha, compalpha;09  float gamfg, linfg, gambg, linbg, comppix, gcvideo;/* Get max sample values in data and frame buffer */10  fg_maxsample = (1 << fg_sample_depth) -1;11  bg_maxsample = (1 << bg_sample_depth) -1;12  fb_maxsample = (1 << frame_buffer_sample_depth) -1;/*     * Get integer version of alpha.     * Check for opaque and transparent special cases;     * no compositing needed if so.     *     * We show the whole gamma decode/correct process in     * floating point, but it would more likely be done     * with lookup tables.     */13  ialpha = foreground[3];14if (ialpha ==0) {/*         * Foreground image is transparent here.         * If the background image is already in the frame         * buffer, there is nothing to do.         */15      ;16  }elseif (ialpha == fg_maxsample) {/*         * Copy foreground pixel to frame buffer.         */17for (i =0; i <3; i++) {18          gamfg = (float) foreground[i] / fg_maxsample;19          linfg =pow(gamfg,1.0 / fg_gamma);20          comppix = linfg;21          gcvideo =pow(comppix,1.0 / display_exponent);22          fbpix[i] = (int) (gcvideo * fb_maxsample +0.5);23      }24  }else {/*         * Compositing is necessary.         * Get floating-point alpha and its complement.         * Note: alpha is always linear; gamma does not         * affect it.         */25      alpha = (float) ialpha / fg_maxsample;26      compalpha =1.0 - alpha;27for (i =0; i <3; i++) {/*             * Convert foreground and background to floating             * point, then undo gamma encoding.             */28          gamfg = (float) foreground[i] / fg_maxsample;29          linfg =pow(gamfg,1.0 / fg_gamma);30          gambg = (float) background[i] / bg_maxsample;

31          linbg =pow(gambg,1.0 / bg_gamma);/*             * Composite.             */32          comppix = linfg * alpha + linbg * compalpha;/*             * Gamma correct for display.             * Convert to integer frame buffer pixel.             */33          gcvideo =pow(comppix,1.0 / display_exponent);34          fbpix[i] = (int) (gcvideo * fb_maxsample +0.5);35      }36  }

Variations:

If output is to another PNG datastream instead of aframe buffer, lines 21, 22, 33, and 34 should be changed along the following lines
```
/* * Gamma encode for storage in output datastream. * Convert to integer sample value. */gamout =pow(comppix, outfile_gamma);outpix[i] = (int) (gamout * out_maxsample +0.5);
```
Also, it becomes necessary to process background pixels when alpha is zero, rather than just skipping pixels. Thus, line 15will need to be replaced by copies of lines 17-23, but processing background instead of foreground pixel values.
If the sample depths of the output file, foreground file, and background file are all the same, and the threegamma values also match, then the no-compositing code in lines 14-23 reduces to copying pixel values from the input file to the output file if alpha is one, or copying pixel values from background to output file if alpha is zero. Since alpha is typically either zero or one for the vast majority of pixels in an image, this is a significant saving. Nogamma computations are needed for most pixels.
When the sample depths andgamma values all match, it may appear attractive to skip thegamma decoding and encoding (lines 28-31, 33-34) and just perform line 32 usinggamma-encoded sample values. Although this does not have too bad an effect on image quality, the time savings are small if alpha values of zero and one are treated as special cases as recommended here.
If the original pixel values of the background image are no longer available, only processedframe buffer pixels left by display of the background image, then lines 30 and 31 need to extract intensity from theframe buffer pixel values using code such as
```
/* * Convert frame buffer value into intensity sample. */gcvideo = (float) fbpix[i] / fb_maxsample;linbg =pow(gcvideo, display_exponent);
```
However, some roundoff error can result, so it is better to have the original background pixels available if at all possible.
Note that lines 18-22 are performing exactly the samegamma computation that is done when no alpha channel is present. If the no-alpha case is handled with a lookup table, the same lookup table can be used here. Lines 28-31 and 33-34 can also be done with (different) lookup tables.
Integer arithmetic can be used instead of floating point, providing care is taken to maintain sufficient precision throughout.

Note

NOTE In floating point, no overflow or underflow checks are needed, because the input sample values are guaranteed to be between 0 and 1, and compositing always yields a result that is in between the input values (inclusive). With integer arithmetic, some roundoff-error analysis might be needed to guarantee no overflow or underflow.

When displaying a PNG image with full alpha channel, it is important to be able tocomposite the image against some background, even if it is only black. Ignoring the alpha channel will cause PNG images that have been converted from an associated-alpha representation to look wrong. (Of course, if the alpha channel is a separate transparency mask, then ignoring alpha is a useful option: it allows the hidden parts of the image to be recovered.)

Even if the decoder does not implement true compositing logic, it is simple to deal with images that contain only zero and one alpha values. (This is implicitly true forgreyscale andtruecolor PNG datastreams that use atRNS chunk; forindexed-color PNG datastreams it is easy to check whether thetRNS chunk contains any values other than 0 and 255.) In this simple case, transparent pixels are replaced by the background color, while others are unchanged.

If a decoder contains only this much transparency capability, it should deal with a full alpha channel by treating all nonzero alpha values as fully opaque or by dithering. Neither approach will yield very good results for images converted from associated-alpha formats, but this is preferable to doing nothing. Dithering full alpha to binary alpha is very much like dithering greyscale to black-and-white, except that all fully transparent and fully opaque pixels should be left unchanged by the dither.

13.17Histogram and suggested palette usage

For viewers running on indexed-color hardware attempting to display atruecolor image, or an indexed-color image whose palette is too large for theframe buffer, the encoder may have provided one or more suggested palettes insPLT chunks. If one of these is found to be suitable, based on size and perhaps name, the PNG decoder can use that palette. Suggested palettes with a sample depth different from what the decoder needs can be converted using sample depth rescaling (see13.12Sample depth rescaling).

When the background is a solid color, the viewer shouldcomposite the image and the suggested palette against that color, then quantize the resulting image to the resulting RGB palette. When the image uses transparency and the background is not a solid color, no suggested palette is likely to be useful.

Fortruecolor images, a suggested palette might also be provided in aPLTE chunk. If the image has atRNS chunk and the background is a solid color, the viewer will need to adapt the suggested palette for use with its desired background color. To do this, the palette entry closest to thetRNS color should be replaced with the desired background color; or alternatively a palette entry for the background color can be added, if the viewer can handle more colors than there arePLTE entries.

For images ofcolor type 6 (truecolor with alpha), anyPLTE chunk should have been designed for display of the image against a uniform background of the color specified by thebKGD chunk. Viewers should probably ignore the palette if they intend to use a different background, or if thebKGD chunk is missing. Viewers can use a suggested palette for display against a different background than it was intended for, but the results may not be very good.

If the viewer presents a transparenttruecolor image against a background that is more complex than a uniform color, it is unlikely that the suggested palette will be optimal for thecomposite image. In this case it is best to perform atruecolor compositing step on thetruecolor PNG image and background image, then color-quantize the resulting image.

Intruecolor PNG datastreams, if bothPLTE andsPLT chunks appear, the PNG decoder may choose from among the palettes suggested by both, bearing in mind the different transparency semantics described above.

The frequencies in thesPLT andhIST chunks are useful when the viewer cannot provide as many colors as are used in the palette in the PNG datastream. If the viewer has a shortfall of only a few colors, it is usually adequate to drop the least-used colors from the palette. To reduce the number of colors substantially, it is best to choose entirely new representative colors, rather than trying to use a subset of the existing palette. This amounts to performing a new color quantization step; however, the existing palette and histogram can be used as the input data, thus avoiding a scan of theimage data in theIDAT chunks.

If no suggested palette is provided, a decoder can develop its own, at the cost of an extra pass over theimage data in theIDAT chunks. Alternatively, a default palette (probably a color cube) can be used.

14.Editors

14.1Additional chunk types

Authors are encouraged to look existing chunk types in both this specification and [PNG-EXTENSIONS] before considering introducing a new chunk types. The chunk types at [PNG-EXTENSIONS] are expected to be less widely supported than those defined in this specification.

14.2Behavior of PNG editors

Two examples ofPNG editors are a program that adds or modifies text chunks, and a program that adds a suggested palette to atruecolor PNG datastream. Ordinary image editors are notPNG editors because they usually discard all unrecognized information while reading in an image.

To allow new chunk types to be added to PNG, it is necessary to establish rules about the ordering requirements for all chunk types. Otherwise aPNG editor does not know what to do when it encounters an unknown chunk.

EXAMPLE Consider a hypothetical new ancillary chunk type that is safe-to-copy and is required to appear afterPLTE ifPLTE is present. If a program attempts to add aPLTE chunk and does not recognize the new chunk, it may insert thePLTE chunk in the wrong place, namely after the new chunk. Such problems could be prevented by requiringPNG editors to discard all unknown chunks, but that is a very unattractive solution. Instead, PNG requires ancillary chunks not to have ordering restrictions like this.

To prevent this type of problem while allowing for future extension, constraints are placed on both the behavior ofPNG editors and the allowed ordering requirements for chunks. The safe-to-copy bit defines the proper handling of unrecognized chunks in a datastream that is being modified.

If a chunk's safe-to-copy bit is 1, the chunk may be copied to a modified PNG datastream whether or not thePNG editor recognizes the chunk type, and regardless of the extent of the datastream modifications.
If a chunk's safe-to-copy bit is 0, it indicates that the chunk depends on theimage data. If the program has madeany changes tocritical chunks, including addition, modification, deletion, or reordering of critical chunks, then unrecognized unsafe chunks shallnot be copied to the output PNG datastream. (Of course, if the programdoes recognize the chunk, it can choose to output an appropriately modified version.)
APNG editor is always allowed to copy all unrecognized ancillary chunks if it has only added, deleted, modified, or reorderedancillary chunks. This implies that it is not permissible for ancillary chunks to depend on other ancillary chunks.
PNG editors shall terminate on encountering an unrecognized critical chunk type, because there is no way to be certain that a valid datastream will result from modifying a datastream containing such a chunk. (Simply discarding the chunk is not good enough, because it might have unknown implications for the interpretation of other chunks.) The safe/unsafe mechanism is intended for use with ancillary chunks. The safe-to-copy bit will always be 0 for critical chunks.

The rules governing ordering of chunks are as follows.

When copying an unknownunsafe-to-copy ancillary chunk, aPNG editor shall not move the chunk relative to any critical chunk. It may relocate the chunk freely relative to other ancillary chunks that occur between the same pair of critical chunks. (This is well defined since the editor shall not add, delete, modify, or reorder critical chunks if it is preserving unknown unsafe-to-copy chunks.)
When copying an unknownsafe-to-copy ancillary chunk, aPNG editor shall not move the chunk from beforeIDAT to afterIDAT or vice versa. (This is well defined becauseIDAT is always present.) Any other reordering is permitted.
When copying aknown ancillary chunk type, an editor need only honour the specific chunk ordering rules that exist for that chunk type. However, it may always choose to apply the above general rules instead.

These rules are expressed in terms of copying chunks from an input datastream to an output datastream, but they apply in the obvious way if a PNG datastream is modified in place.

14.3Ordering of chunks

14.3.1Ordering of critical chunks

Critical chunks may have arbitrary ordering requirements, becausePNG editors are required to terminate if they encounter unknown critical chunks. For exampleIHDR has the specific ordering rule that it shall always appear first. A PNG editor, or indeed any PNG-writing program, shall know and follow the ordering rules for any critical chunk type that it can generate.

14.3.2Ordering of ancillary chunks

The strictest ordering rules for an ancillary chunk type are:

Unsafe-to-copy chunks may have ordering requirements relative to critical chunks.
Safe-to-copy chunks may have ordering requirements relative toIDAT.

The actual ordering rules for any particular ancillary chunk type may be weaker. See for example the ordering rules for the standard ancillary chunk types in5.6Chunk ordering.

Decoders shall not assume more about the positioning of any ancillary chunk than is specified by the chunk ordering rules. In particular, it is never valid to assume that a specific ancillary chunk type occurs with any particular positioning relative to other ancillary chunks.

EXAMPLE It is unsafe to assume that a particular private ancillary chunk occurs immediately beforeIEND. Even if it is always written in that position by a particular application, aPNG editor might have inserted some other ancillary chunk after it. But it is safe to assume that the chunk will remain somewhere betweenIDAT andIEND.

15.Conformance

As well as sections marked as non-normative, all authoring guidelines, diagrams, examples, and notes in this specification are non-normative. Everything else in this specification is normative.

The key wordsMAY,MUST,SHALL,SHOULD, andSHOULD NOT in this document are to be interpreted as described inBCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.

15.1Conformance

15.2Introduction

15.2.1Objectives

This clause addresses conformance of PNG datastreams, PNG encoders, PNG decoders, andPNG editors.

The primary objectives of the specifications in this clause are:

to promote interoperability by eliminating arbitrary subsets of, or extensions to, this specification;
to promote uniformity in the development of conformance tests;
to promote consistent results across PNG encoders, decoders, and editors;
to facilitate automated test generation.

15.2.2Scope

Conformance is defined for PNG datastreams and for PNG encoders, decoders, and editors.

This clause addresses the PNG datastream and implementation requirements including the range of allowable differences for PNG encoders, PNG decoders, andPNG editors. This clause does not directly address the environmental, performance, or resource requirements of the encoder, decoder, or editor.

The scope of this clause is limited to rules for the open interchange of PNG datastreams.

15.3Conformance conditions

15.3.1Conformance of PNG datastreams

A PNG datastream conforms to this specification if the following conditions are met.

The PNG datastream contains a PNG signature as the first content (see5.2PNG signature).
With respect to the chunk types defined in this International Standard:
- the PNG datastream contains as its first chunk, anIHDR chunk, immediately following the PNG signature;
- the PNG datastream contains as its last chunk, anIEND chunk.
No chunks or other content follow theIEND chunk.
All chunks contained therein match the specification of the corresponding chunk types of this specification. The PNG datastream shall obey the relationships among chunk types defined in this specification.
The sequence of chunks in the PNG datastream obeys the ordering relationship specified in this International Standard.
All field values in the PNG datastream obey the relationships specified in this specification producing the structure specified in this specification.
No chunks appear in the PNG datastream other than those specified in this specification or those defined according to the rules for creating new chunk types as defined in this specification.
The PNG datastream is encoded according to the rules of this International Standard.

15.3.2Conformance of PNG encoders

A PNG encoder conforms to this specification if it satisfies the following conditions.

All PNG datastreams that are generated by the PNG encoder are conforming PNG datastreams.
When encoding input samples that have a sample depth that cannot be directly represented in PNG, the encoder scales the samples up to the next higher sample depth that is allowed by PNG. The data are scaled in such a way that the high-order bits match the original data.
Private field values are used when encoding experimental or private definitions of values for any of the method or type fields.

15.3.3Conformance of PNG decoders

A PNG decoder conforms to this specification if it satisfies the following conditions.

It is able to read any PNG datastream that conforms to this International Standard, including both public and private chunks whose types may not be recognized.
It supports all the standardized critical chunks, and all the standardized compression, filter, and interlace methods and types in any PNG datastream that conforms to this International Standard.
Unknown chunk types are handled as described in5.4Chunk naming conventions. An unknown chunk type isnot treated as an error unless it is a critical chunk.
Unexpected values in fields of known chunks (for example, an unexpected compression method in theIHDR chunk) are treated as errors.
All types of PNG images (indexed-color,truecolor,greyscale,truecolor with alpha, andgreyscale with alpha) are processed. For example, decoders which are part of viewers running on indexed-color display hardware shall reducetruecolor images to indexed format for viewing.
Encountering an unknown chunk in which the ancillary bit is 0 generates an error if the decoder is attempting to extract the image.
A chunk type in which the reserved bit is set is treated as an unknown chunk type.
All valid combinations of bit depth andcolor type as defined in11.2.1IHDR Image header are supported.
An error is reported if an unrecognized value is encountered in the bit depth,color type, compression method,filter method, or interlace method bytes of theIHDR chunk.
When processing 16-bitgreyscale ortruecolor data in thetRNS chunk, both bytes of the sample values are evaluated to determine whether a pixel is transparent.
When processing an image compressed by compression method 0, the decoder assumes no more than that the completeimage data is represented by a single compressed datastream that is stored in some number ofIDAT chunks.
No assumptions are made concerning the positioning of any ancillary chunk other than those that are specified by the chunk ordering rules.

15.3.4Conformance of PNG editors

APNG editor conforms to this specification if it satisfies the following conditions.

It conforms to the requirements for PNG encoders.
It conforms to the requirements for PNG decoders.
It is able to encode all chunks that it decodes.
It preserves the ordering of the chunks presented within the rules in5.6Chunk ordering.
It properly processes the safe-to-copy bit information and preserves unknown chunks when the safe-to-copy rules permit it.
Unless the user specifically permits lossy operations or the editor issues a warning, it preserves all information required to reconstruct the reference image exactly, except that the sample depth of the alpha channel need not be preserved if it contains only zero and maximum values. Operations such as changing thecolor type or rearranging the palette in anindexed-color datastream are permitted provided that the new datastream losslessly represents the same reference image.

A.Internet Media Types

A.1image/png

This updates the existingimage/png Internet Media type, under theimage top level type. This appendix is in conformance withBCP 13 andW3CRegMedia.

Media type name:

image

Media subtype name:

png

Required parameters:

None

Optional parameters:

None

Encoding considerations:

binary

Security considerations:

A PNG document is composed of a collection of explicitly typed "chunks". For each of the chunk types defined in the PNG specification (except forgIFx), the only effect associated with those chunks is to cause an image to be rendered on the recipient's display or printer.

ThegIFx chunk type is used to encapsulate Application Extension data, and some use of that data might present security risks, though no risks are known. Likewise, the security risks associated with future chunk types cannot be evaluated, particularly unregistered chunks. However, it is the intention of the PNG Working Group to disallow chunks containing "executable" data to become registered chunks.

The text chunks,tEXt,iTXT andzTXt, contain data that can be displayed in the form of comments, etc. Some operating systems or terminals might allow the display of textual data with embedded control characters to perform operations such as re-mapping of keys, creation of files, etc. For this reason, the specification recommends that the text chunks be filtered for control characters before direct display.

The PNG format is specifically designed to facilitate early detection of file transmission errors, and makes use of cyclical redundancy checks to ensure the integrity of the data contained in its chunks.

Interoperability considerations:

Network byte order used throughout.

Published specification:

Portable Network Graphics (PNG) Specification,https://www.w3.org/TR/PNG/

Applications which use this media:

PNG is widely implemented in all Web browsers, image viewers, and image creation tools

Fragment identifier considerations:

N/A

Restrictions on usage:

N/A

Provisional registration? (standards tree only):

Additional information:

Deprecated alias names for this type:: N/A
Magic number(s):: 89 50 4E 47 0D 0A 1A 0A
File extension(s):: .png
Macintosh file type code:: PNGf
Object Identifiers:: N/A

General Comments:

This registration updates the earlier one:

The old one points to an expired Internet Draft. This updated registration points to aW3C Recommendation.
The old contact person is sadly deceased. The new contact email is a publicly archived W3C mailing list for the PNG Working Group.
Change controller is W3C

Person to contact for further information:

Name:: PNG Working Group
Email:: public-png@w3.org

Intended usage:

Common

Author/Change controller:

W3C

A.2image/apng

This appendix is in conformance withBCP 13 andW3CRegMedia.

Media type name:

image

Media subtype name:

apng

Required parameters:

N/A

Optional parameters:

N/A

Encoding considerations:

binary

Security considerations:

AnAPNG document is composed of a collection of explicitly typed "chunks". For each of the chunk types defined in the PNG specification (except forgIFx), the only effect associated with those chunks is to cause an animated image to be rendered on the recipient's display.

The text chunks,tEXt,iTXt andzTXt, contain data that can be displayed in the form of comments, etc. Some operating systems or terminals might allow the display of textual data with embedded control characters to perform operations such as re-mapping of keys, creation of files, etc. For this reason, the specification recommends that the text chunks be filtered for control characters before direct display.

If one creates anAPNG file with unrelated static image and animated image chunks, somebody using a tool not supporting the APNG format would only see the static image and be unaware of the additional content. This could be used e.g. to bypass moderation.

Interoperability considerations:

None

Published specification:

Portable Network Graphics (PNG) Specification,https://www.w3.org/TR/png/

Applications which use this media:

Animated PNG (APNG) is widely implemented in all Web browsers, and is increasingly available in image viewers, and animation and image creation tools

Fragment identifier considerations:

N/A

Restrictions on usage:

N/A

Provisional registration? (standards tree only):

Additional information:

Deprecated alias names for this type:: image/vnd.mozilla.apng
Magic number(s):: 89 50 4E 47 0D 0A 1A 0A
File extension(s):: .apng
Object Identifiers:: N/A

General Comments:

image/apng has been in widespread, unregistered use since 2015. Animated PNG was not part of the official PNG specification until 2022. This registration, plus the PNG specification (3rd Edition) brings official documentation into alignment with already widely-deployed reality.

Person to contact for further information:

Name:: PNG Working Group
Email:: public-png@w3.org

Intended usage:

Common

Author/Change controller:

W3C

B.Guidelines for private chunk types

This section is non-normative.

The following specifies guidelines for the definition of private chunks:

Do not define new chunks that redefine the meaning of existing chunks or change the interpretation of an existing standardized chunk, e.g., do not add a new chunk to say that RGB and alpha values actually mean CMYK.
Minimize the use of private chunks to aid portability.
Avoid defining chunks that depend on total datastream contents. If such chunks have to be defined, make them critical chunks.
For textual information that is representable in Latin-1 avoid defining a new chunk type. Use atEXt orzTXt chunk with a suitable keyword to identify the type of information. For textual information that is not representable in Latin-1 but which can be represented in UTF-8, use aniTXt chunk with a suitable keyword.
Group mutually dependent ancillary information into a single chunk. This avoids the need to introduce chunk ordering relationships.
Avoid defining private critical chunks.

C.Gamma and chromaticity

This section is non-normative.

Agamma value is a numerical parameter used to describe approximations to certain non-lineartransfer functions encountered in image capture and reproduction. Thegamma value is the exponent in a power law function. For example the function:

intensity = (voltage + constant)^exponent

which is used to model the non-linearity ofCRT displays. It is often assumed, as in this International Standard, that the constant is zero.

For the purposes of this specification, it is convenient to consider five places in a general image pipeline at which non-linear transfer functions may occur and which may be modelled by power laws. The characteristic exponent associated with each is given a specific name.

`input_exponent`	the exponent of the image sensor.
`encoding_exponent`	the exponent of anytransfer function performed by the process or device writing the datastream.
`decoding_exponent`	the exponent of anytransfer function performed by the software reading theimage datastream.
`LUT_exponent`	the exponent of thetransfer function applied between theframe buffer and the display device (typically this is applied by a Look Up Table).
`output_exponent`	the exponent of the display device. For aCRT, this is typically a value close to 2.2.

It is convenient to define some additional entities that describe some compositetransfer functions, or combinations of stages.

`display_exponent`	exponent of thetransfer function applied between theframe buffer and the display surface of the display device. `display_exponent = LUT_exponent * output_exponent`
`gamma`	exponent of the function mapping display output intensity to samples in the PNG datastream. `gamma = 1.0 / (decoding_exponent * display_exponent)`
`end_to_end_exponent`	the exponent of the function mapping image sensor input intensity to display output intensity. This is generally a value in the range 1.0 to 1.5.

The PNGgAMA chunk is used to record thegamma value. This information may be used by decoders together with additional information about the display environment in order to achieve, or approximate, the desired display output.

Additional information about this subject may be found [GAMMA-FAQ].

Additional information on the impact of color space on image encoding may be found in [Kasson] and [Hill].

Background information aboutchromaticity and color spaces may be found in [COLOR-FAQ].

D.SampleCRC implementation

The following sample code — which is informative — represents a practical implementation of theCRC (Cyclic Redundancy Check) employed in PNG chunks. (See also ISO 3309 [ISO-3309] or ITU-T V.42 [ITU-T-V.42] for a formal specification.)

The sample code is in the ISO C [ISO_9899] programming language. The hints inTable31 may help non-C users to read the code more easily.

Table31 Hints for reading ISO C code
Operator	Description
`&`	Bitwise AND operator.
`^`	Bitwise exclusive-OR operator.
`>>`	Bitwise right shift operator. When applied to an unsigned quantity, as here, right shift inserts zeroes at the left.
`!`	Logical NOT operator.
`++`	"`n++`" increments the variable`n`. In "for" loops, it is applied after the variable is tested.
`0xNNN`	`0x` introduces a hexadecimal (base 16) constant. Suffix`L` indicates a long value (at least 32 bits).

/* Table of CRCs of all 8-bit messages. */unsigned long crc_table[256];/* Flag: has the table been computed? Initially false. */int crc_table_computed =0;/* Make the table for a fast CRC. */voidmake_crc_table(void){  unsigned long c;  int n, k;for (n =0; n <256; n++) {    c = (unsigned long) n;for (k =0; k <8; k++) {if (c &1)        c = 0xedb88320L ^ (c >>1);else        c = c >>1;    }    crc_table[n] = c;  }  crc_table_computed =1;}

/* Update a running CRC with the bytes buf[0..len-1]--the CRC   should be initialized to all 1's, and the transmitted value   is the 1's complement of the final running CRC (see the   crc() routine below). */unsigned longupdate_crc(unsigned long crc, unsigned char *buf,                         int len){  unsigned long c = crc;  int n;if (!crc_table_computed)make_crc_table();for (n =0; n < len; n++) {    c = crc_table[(c ^ buf[n]) &0xff] ^ (c >>8);  }return c;}/* Return the CRC of the bytes buf[0..len-1]. */unsigned longcrc(unsigned char *buf, int len){returnupdate_crc(0xffffffffL, buf, len) ^ 0xffffffffL;}

E.Online resources

This section is non-normative.

Introduction

This annex gives the locations of some Internet resources for PNG software developers. By the nature of the Internet, the list is incomplete and subject to change.

E.1ICC profile specifications

ICC profile specifications are available at:https://www.color.org/

E.2PNG web site

There is a World Wide Web site for PNG athttp://www.libpng.org/pub/png/. This page is a central location for current information about PNG and PNG-related tools.

Additional documentation and portable C code fordeflate, and an optimized implementation of theCRC algorithm are available from the zlib web site,https://www.zlib.net/.

E.3Sample implementation and test images

A sample implementation in portable C,libpng, is available athttp://www.libpng.org/pub/png/libpng.html. Sample viewer and encoder applications of libpng are available athttp://www.libpng.org/pub/png/book/sources.html and are described in detail inPNG: The Definitive Guide [ROELOFS]. Test images can also be accessed from the PNG web site.

F.Changes

This section is non-normative.

F.1Changes since theProposed Recommendation of 15 May 2025

none

F.2Changes since theCandidate Recommendation Snapshot of 13 March 2025

none

F.3Changes since theCandidate Recommendation Draft of 21 January 2025 (Third Edition)

Credited original authors of theeXIf chunk
Removed pointless constraint that unsigned integers cannot be negative (!)
Moved "Collection" keyword from extensions document to main specification
Copied original security note regarding EXIF, from Extensions document
Updated reference to latest version of ITU H.273

F.4Changes since theCandidate Recommendation Draft of 18 July 2024 (Third Edition)

RenamedmDCv tomDCV
RenamedcLLi tocLLI
Consistently require thatacTL only comes beforeIDAT, notPLTE

F.5Changes since theCandidate Recommendation Snapshot of 21 September 2023 (Third Edition)

Clarified that bit depth and color type fields can take private values
Listed both decimal and hexadecimal values in MaxCLL and MaxFALL examples
Corrected terminology in mDCV section
Corrected "PNG image" in cHRM section
Consolidated color chunk precedence information
Added order of precedence for color space chunks
Clarified color chunk priorities description
Mark SMPTE RP 2077 as informative, since it is not freely available
Added color chunk priority table
Clarified alpha channels are always full-range
AddedEOTF and OETF definitions
Added recommendation for handling negative values resulting from narrow-range images using an extended range transfer function
Clarified "video" from "video full range flag) is used to match H.273 wording but still applies to still images
Added link to IANA subtag registry
Fixed links to alhpa compaction and alpha separation sections
Fixed broken tRNS link
Added Display P3 cICP example
Changed "perceived" wording to "measured", as luminance and chromaticity don't define a perceived color
Clarified which metadata forms part of HDR10
Updated link to latest version of ITU-R BT.2390
Noted that currently there is no published standard to adapt an SDR image from default viewing conditions (display luminance and ambient illumination) to those given inmDCV
Noted that the adaptation of BT.2100 HLG images to differing viewing conditions, given in BT.2390, may also be used with SDR images.
Noted that tone mapping, to adjust the luminance levels given incLLI to those of a target display to prevent clipping, is particularly important for formats such as BT.2100 PQ which use absolute luminance.
Noted that use of full-range BT.709 values, while common, is not part of the BT.709 standard
Added mention of protected code values for Serial Digital Interface (baseband video) in description of narrow-range and full-range video
Added reference to ITU-R-BT.2390
Changed description ofmDCV to focus on BT.2100 PQ, as use with other formats is currently rare
Clarified matching requirements for 16-bittRNS matching
Used more precise "primary chromaticities" rather than just "primaries"
Added requirement thatmDCV requires an accompanyingcICP to fit with expectations of existing HDR10 workflows
Better explanation ofVideo Full Range Flag
Clarified ordering and encoding of sequence numbers, clarifiedfdAT concatenation is in order of sequence number
Added missing mentions ofcICP in descriptions of related chunks
Fully specified the ordering and byte-order of fields inmDCV
Added missing definition of PNG two-byte unsigned integer
Used Display P3, rather than sRGB, in SDRmDCV examples
Added Chris Needham as co-author
Updated examples formDCV, added reference to SMPTE-ST-2086.
Added reference to Display P3
Explicitly mentioned that the concatenated data fromIDAT andfdAT may include chunks with zero-length data
Assorted non-substantive improvements to markup, styling, internal cross-linking, correction of typos, punctuation, grammar, consistent use of US English, etc

F.6Changes since the Working Draft of 20 July 2023 (Third Edition)

Clarified descriptions ofmDCV andcLLi
Added note to Security Considerations about potentially malicious data afterIEND.
Clarified thatcLLI is for HDR content
Added an informative reference to Smith & Zink "On the Calculation and Usage of HDR Static Content Metadata"
Added an informative reference to CTA-861.3-A for static HDR metadata
Fixed broken reference to ITU-R BT.2100
Updated reference to SMPTE-ST-2067-21
Added guidance on calculating MaxCLL and MaxFALL values
Added example (live streaming) wherecLLI could not be pre-calculated
Added definitions for stop, SDR, HDR, HLG and PQ
Clarified definition of narrow-range
Updated ITU-T H Suppl. 19 reference to latest version
Added Simon Thompson as an author
Mandated current browser handling of out-of-range palette indices
Updated "Additional Information" table to addmDCV andcLLI.

F.7Changes since the First Public Working Draft of 25 October 2022 (Third Edition)

Explained preferable handling of trailing bytes in the finalIDAT chunk for encoders and decoders.
Linked to open issue on tone-mappingHDR [ITU-R-BT.2100] images in the presence ofmDCV.
Follow the Encoding Standard on UTF-8 encode and decode.
Added definition of a frame.
Required the Matrix Coefficients incICP to be zero (RGB data).
Added known Privacy issue with recoverable data that only appears to have been redacted.
Improved advice on choosing filters.
Added links to color image type definitions.
Clarified that MaxFALL uses the values of the frame with highest mean luminance.
Clarified luminance units.
Prefer RFC 3339 format for Creation Time.
Improved the definition ofmDCV, with better descriptions, default values, and reference to SMPTE standards.
Refactored the terms and definitions, for clarity.
Improved definitions of source, reference, and PNG images.
Moved concepts from the terms and definitions section to the main prose.
Corrected error ineXIf chunk, which conflicted with thechunk ordering section.
SimplifiedScope section to remove redundant detail described elsewhere.
Redrew chunk-ordering lattice diagrams to be clearer and more consistent.
Added a new chunk,cLLI, to describe the Maximum Single-Pixel and Frame-Average Luminance Levels for both static and animatedHDR [ITU-R-BT.2100] PNG images.
Updated external links to latest versions, preferring https over http.
Specified interoperable handling of extra sample bits, beyond the specified bit depth, intRNS andbKGD chunks.
Added a new chunk,mDCV to describe the color volume of the mastering display used to gradeHDR [ITU-R-BT.2100] content.
Used correct Unicode character names.
Changed chunk type codes to use hexadecimal, rather than decimal.
Described textual chunk processing more clearly.
RecommendediTXt for new content.
Clarifications on the language tag field of theiTXt chunk, corrected examples to conform to BCP47.
Updated image/apng registration appendix.APNG MIME type registered with IANA.
Converted ACII-art figures to more accessible diagrams.

F.8Changes since the W3C Recommendation of 10 November 2003 (PNG Second Edition)

The three previously defined, but unofficial, chunks for Animated PNG (APNG) have been added:
- acTL Animation Control Chunk
- fcTL Frame Control Chunk
- fdAT Frame Data Chunk
This brings the PNG specification into alignment with widely deployed industry practice.
Added thecICP chunk, Coding-independent code points for video signal type identification, to contain image format metadata defined in [ITU-T-H.273] which enables PNG to contain [ITU-R-BT.2100] High Dynamic Range (HDR) and Wide Color Gamut (WCG) images.
For chunks which define the image color space, the order of precedence is clearly defined, if more than one is present.
The previously definedeXIf chunk has been moved from the PNG-Extensions document [PNG-EXTENSIONS] into the main body of this specification, to reflect its increasing use.
To help with tonemapping HDR content, added themDCV chunk, which contains metadata about the display used in mastering, andcLLI, which contains metadata about peak and average light levels. This enabled more accurate color matching on heterogeneous platforms
Clarified that theiCCP chunk, which contains an ICC profile, can contain profiles conforming to any version of the ICC.1 specification. PNG Second Edition only referenced the then-current v2 of ICC.1, although it has since become industry practice to also used higher versions.
Clarified handling of out-of-range indexes, for indexed-color PNG
Clarified error recovery for unknown and invalid ancillary chunks
Incorporation of allPNG Second Edition Errata. Notably, clarified that PNG images with unknown gamma value, when embedded in formats such as HTML or SVG, must be treated asuntagged images
Various editorial clarifications in response to community feedback
References updated to latest versions
Markup corrections and link fixes
Document source reformatted to use ReSpec

F.9Changes between First and Second Editions

For the list of changes betweenW3C Recommendation PNG Specification Version 1.0 andPNG Second Edition, seePNG Second Edition changelist

Movatterモバイル変換

Portable Network Graphics (PNG) Specification (Third Edition)

Abstract

Status of This Document

1.Introduction

2.Scope

3.Terms, definitions, and abbreviated terms

4.Concepts

4.1Static and Animated images

4.2Images

4.3Color spaces

4.4Reference image to PNG image transformation

Introduction

4.4.1Alpha separation

4.4.2Indexing

4.4.3RGB merging

4.4.4Alpha compaction

4.4.5Sample depth scaling

4.5PNG image

4.6Encoding the PNG image

Introduction

4.6.1Pass extraction

4.6.2Scanline serialization

4.6.3Filtering

4.6.4Compression

4.6.5Chunking

4.7Additional information

4.8PNG datastream

4.8.1Chunks

4.8.2Chunk types

4.9APNG: frame-based animation

Introduction

4.9.1Structure

4.9.2Sequence numbers

4.9.3Output buffer

4.9.4Canvas

4.10Error handling

4.11Extensions

5.Datastream structure

5.1PNG datastream

5.2PNG signature

5.3Chunk layout

5.4Chunk naming conventions

5.5CRC algorithm

5.6Chunk ordering

5.7Defining chunks

5.7.1General

5.7.2Defining public chunks

5.7.3Defining private chunks

5.8Private field values

6.Reference image to PNG image transformation

6.1Color types and values

6.2Alpha representation

7.Encoding the PNG image as a PNG datastream

7.1Integers and byte order

7.2Scanlines

7.3Filtering

8.Interlacing and pass extraction

Introduction

8.1Interlace methods

9.Filtering

9.1Filter methods and filter types

9.2Filter types for filter method 0

9.3Filter type 3: Average

9.4Filter type 4: Paeth

10.Compression

10.1Compression method 0

10.2Compression of the sequence of filtered scanlines

10.3Other uses of compression

11.Chunk specifications

11.1General

11.2Critical chunks

Introduction

11.2.1IHDR Image header

11.2.2PLTE Palette

11.2.3IDAT Image data

11.2.4IEND Image trailer

11.3Ancillary chunks

Introduction

11.3.1Transparency information