This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Grayscale" – news ·newspapers ·books ·scholar ·JSTOR(March 2023) (Learn how and when to remove this message) |

| Color depth |
|---|
| Related |
Indigital photography,computer-generated imagery, andcolorimetry, agrayscale (American English) orgreyscale (Commonwealth English)image is one in which the value of eachpixel holds no color information and only expresses a shade of gray. Pixel values are typically stored in the range 0 to 255 (black to white).[1]
Grayscale images, areblack-and-white or graymonochrome, and composed exclusively of shades of gray. Thecontrast ranges fromblack at the weakest intensity towhite at the strongest.[2] Grayscale images are distinct from one-bit bi-tonal black-and-white images, which, in the context of computer imaging, are images with only twocolors: black and white (also calledbilevel orbinary images). Grayscale images have many shades of gray in between.
Grayscale images can be the result of measuring the intensity of light at each pixel according to a particular weighted combination of frequencies (or wavelengths), and in such cases they aremonochromatic proper when only a singlefrequency (in practice, a narrow band of frequencies) is captured. The frequencies can in principle be from anywhere in theelectromagnetic spectrum (e.g.infrared,visible light,ultraviolet, etc.).
Acolorimetric (or more specificallyphotometric) grayscale image is an image that has a defined grayscalecolorspace, which maps the stored numeric sample values to the achromatic channel of a standard colorspace, which itself is based on measured properties ofhuman vision.
If the original color image has no defined colorspace, or if the grayscale image is not intended to have the same human-perceived achromatic intensity as the color image, then there is no uniquemapping from such a color image to a grayscale image.
The intensity of a pixel is expressed within a given range between a minimum and a maximum, inclusive. This range is represented in an abstract way as a range from 0 (or 0%) (total absence, black) and 1 (or 100%) (total presence, white), with any fractional values in between. This notation is used in academic papers, but this does not define what "black" or "white" is in terms ofcolorimetry. Sometimes the scale is reversed, as inprinting where the numeric intensity denotes how much ink is employed inhalftoning, with 0% representing the paper white (no ink) and 100% being a solid black (full ink).
In computing, although the grayscale can be computed throughrational numbers, image pixels are usuallyquantized to store them as unsigned integers, to reduce the required storage and computation. Some early grayscale monitors can only display up to sixteen different shades, which would be stored inbinary form using 4bits.[citation needed] But today grayscale images intended for visual display are commonly stored with 8 bits per sampled pixel. This pixeldepth allows 256 different intensities (i.e., shades of gray) to be recorded, and also simplifies computation as each pixel sample can be accessed individually as one fullbyte. However, if these intensities were spaced equally in proportion to the amount of physical light they represent at that pixel (called a linear encoding or scale), the differences between adjacent dark shades could be quite noticeable as bandingartifacts, while many of the lighter shades would be "wasted" by encoding a lot of perceptually-indistinguishable increments. Therefore, the shades are instead typically spread out evenly on agamma-compressed nonlinear scale, which better approximates uniform perceptual increments for both dark and light shades, usually making these 256 shades enough to avoid noticeable increments.[3]
Technical uses (e.g. inmedical imaging orremote sensing applications) often require more levels, to make full use of thesensor accuracy (typically 10 or 12 bits per sample) and to reduce rounding errors in computations. Sixteen bits per sample (65,536 levels) is often a convenient choice for such uses, as computers manage 16-bitwords efficiently. TheTIFF andPNG (among other)image file formats support 16-bit grayscale natively, although browsers and many imaging programs tend to ignore the low order 8 bits of each pixel. Internally for computation and working storage, image processing software typically uses integer orfloating-point numbers of size 16 or 32 bits.

Conversion of an arbitrary color image to grayscale is not unique in general; different weighting of the color channels effectively represent the effect of shooting black-and-white film with different-coloredphotographic filters on the cameras.
A common strategy is to use the principles ofphotometry or, more broadly,colorimetry to calculate the grayscale values (in the target grayscale colorspace) so as to have the same luminance (technically relative luminance) as the original color image (according to its colorspace).[4][5] In addition to the same (relative) luminance, this method also ensures that both images will have the sameabsolute luminance when displayed, as can be measured by instruments in itsSI units ofcandelas per square meter, in any given area of the image, given equalwhitepoints. Luminance itself is defined using a standard model of human vision, so preserving the luminance in the grayscale image also preserves other perceptuallightness measures, such asL* (as in the 1976 CIELab color space) which is determined by the linear luminanceY itself (as in theCIE 1931XYZ color space) which we will refer to here asYlinear to avoid any ambiguity.
To convert a color from a colorspace based on a typicalgamma-compressed (nonlinear)RGB color model to a grayscale representation of its luminance, the gamma compression function must first be removed via gamma expansion (linearization) to transform the image to a linear RGB colorspace, so that the appropriateweighted sum can be applied to the linear color components () to calculate the linear luminanceYlinear, which can then be gamma-compressed back again if the grayscale result is also to be encoded and stored in a typical nonlinear colorspace.[6]
For the commonsRGB color space, gamma expansion is defined as
whereCsrgb represents any of the three gamma-compressed sRGB primaries (Rsrgb,Gsrgb, andBsrgb, each in range [0,1]) andClinear is the corresponding linear-intensity value (Rlinear,Glinear, andBlinear, also in range [0,1]). Then, linear luminance is calculated as a weighted sum of the three linear-intensity values. ThesRGB color space is defined in terms of theCIE 1931 linear luminanceYlinear, which is given by[7]
These three particular coefficients represent the intensity (luminance) perception of typicaltrichromat humans to light of the preciseRec. 709 additive primary colors (chromaticities) that are used in the definition of sRGB. Human vision is most sensitive to green, so this has the greatest coefficient value (0.7152), and least sensitive to blue, so this has the smallest coefficient (0.0722). To encode grayscale intensity in linear RGB, each of the three color components can be set to equal the calculated linear luminance (replacing by the values to get this linear grayscale), which then typically needs to begamma compressed to get back to a conventional non-linear representation.[8] For sRGB, each of its three primaries is then set to the same gamma-compressedYsrgb given by the inverse of the gamma expansion above as
Because the three sRGB components are then equal, indicating that it is actually a gray image (not color), it is only necessary to store these values once, and we call this the resulting grayscale image. This is how it will normally be stored in sRGB-compatible image formats that support a single-channel grayscale representation, such as JPEG or PNG. Web browsers and other software that recognizes sRGB images should produce the same rendering for such a grayscale image as it would for a "color" sRGB image having the same values in all three color channels.
For images in color spaces such asY'UV and its relatives, which are used in standard color TV and video systems such asPAL,SECAM, andNTSC, a nonlinearluma component(Y′) is calculated directly from gamma-compressed primary intensities as a weighted sum, which, although not a perfect representation of the colorimetric luminance, can be calculated more quickly without the gamma expansion and compression used in photometric/colorimetric calculations. In theY'UV andY'IQ models used by PAL and NTSC, therec601 luma(Y′) component is computed aswhere we use the prime to distinguish these nonlinear values from the sRGB nonlinear values (discussed above) which use a somewhat different gamma compression formula, and from the linear RGB components. TheITU-R BT.709 standard used forHDTV developed by theATSC uses different color coefficients, computing the luma component asAlthough these are numerically the same coefficients used in sRGB above, the effect is different because here they are being applied directly to gamma-compressed values rather than to the linearized values. TheITU-R BT.2100 standard forHDR television uses yet different coefficients, computing the luma component as
Normally these colorspaces are transformed back to nonlinear R'G'B' before rendering for viewing. To the extent that enough precision remains, they can then be rendered accurately.
But if the luma component Y' itself is instead used directly as a grayscale representation of the color image, luminance is not preserved: two colors can have the same lumaY′ but different CIE linear luminanceY (and thus different nonlinearYsrgb as defined above) and therefore appear darker or lighter to a typical human than the original color. Similarly, two colors having the same luminanceY (and thus the sameYsrgb) will in general have different luma by either of theY′ luma definitions above.[9]
This sectiondoes notcite anysources. Please helpimprove this section byadding citations to reliable sources. Unsourced material may be challenged andremoved.(March 2023) (Learn how and when to remove this message) |
Color images are often built of several stackedcolor channels, each of them representing value levels of the given channel. For example,RGB images are composed of three independent channels for red, green and blueprimary color components;CMYK images have four channels for cyan, magenta, yellow and blackink plates, etc.
Here is an example of color channel splitting of a full RGB color image. The column at left shows the isolated color channels in natural colors, while at right there are their grayscale equivalences:

The reverse is also possible: to build a full-color image from their separate grayscale channels. By mangling channels, using offsets, rotating and other manipulations, artistic effects can be achieved instead of accurately reproducing the original image.[10]