Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Video quality

From Wikipedia, the free encyclopedia
Perceived video degradation

Video quality is a characteristic of avideo passed through a video transmission or processing system that describes perceived video degradation (typically compared to the original video). Video processing systems may introduce some amount of distortion or artifacts in the video signal that negatively impact the user's perception of the system. For many stakeholders invideo production and distribution, ensuring video quality is an important task.

Video quality evaluation is performed to describe the quality of a set of video sequences under study. Video quality can be evaluated objectively (bymathematical models) or subjectively (by asking users for their rating). Also, the quality of a system can be determined offline (i.e., in a laboratory setting for developing new codecs or services) orin-service (to monitor and ensure a certain level of quality).

From analog to digital video

[edit]

Since the world's first video sequence was recorded and transmitted, many video processing systems have been designed. Such systemsencode video streams and transmit them over various kinds of networks or channels. In the age ofanalog video systems, it was possible to evaluate the quality aspects of a video processing system by calculating the system'sfrequency response using test signals (for example, a collection of color bars and circles).

Digital video systems have almost fully replaced analog ones, and quality evaluation methods have changed. The performance of a digital video processing and transmission system can vary significantly and depends on many factors, including the characteristics of the input video signal (e.g., amount of motion or spatial details), the settings used for encoding and transmission, and the channel fidelity ornetwork performance.

Objective video quality

[edit]

Objective video quality models aremathematical models that approximate results fromsubjective quality assessment, in which human observers are asked to rate the quality of a video.[1] In this context, the termmodel may refer to a simple statistical model in which several independent variables (e.g., thepacket loss rate on a network and the video coding parameters) are fit against results obtained in a subjective quality evaluation test usingregression techniques. A model may also be a more complicated algorithm implemented insoftware orhardware.

Terminology

[edit]

The termsmodel andmetric are often used interchangeably in the field to mean adescriptive statistic that provides anindicator of quality. The term “objective” refers to the fact that, in general, quality models are based on criteria that can bemeasured objectively, that is, free from human interpretation. They can be automatically evaluated by acomputer program. Unlike a panel of human observers, an objective model should always deterministically output the same quality score for a given set of input parameters.

Objective quality models are sometimes also referred to asinstrumental (quality) models,[2][3] in order to emphasize their application as measurement instruments. Some authors suggest that the term “objective” is misleading, as it “implies that instrumental measurements bear objectivity, which they only do in cases where they can be generalized.”[4]

Classification of objective video quality models

[edit]
Classification of objective video quality models into Full-Reference, Reduced-Reference and No-Reference.
No-reference image and video quality assessment methods.

Objective models can be classified by the amount of information available about the original signal, the received signal, or whether there is a signal present at all:[5]

  • Full Reference Methods (FR): FR models compute the quality difference by comparing the original video signal against the receivedvideo signal. Typically, every pixel from the source is compared against the corresponding pixel in the received video, with no knowledge about the encoding or transmission process in between. More elaborate algorithms may choose to combine the pixel-based estimation with other approaches, such as those described below. FR models are usually the most accurate, at the expense of higher computational effort. As they require the availability of the original video before transmission or coding, they cannot be used in all situations (e.g., where the quality is measured from a client device).
  • Reduced Reference Methods (RR): RR models extract some features of both videos and compare them to give a quality score. They are used when all the original video is not available or when it would be practically impossible to do so, e.g., in a transmission with a limited bandwidth. This makes them more efficient than FR models at the expense of lower accuracy.
  • No-Reference Methods (NR): NR models try to assess the quality of a distorted video without any reference to the original signal. Due to the absence of an original signal, they may be less accurate than FR or RR approaches but are more efficient to compute. The Video Quality Experts Group has a dedicated working group on developing no-reference metrics (calledNORM).
    • Pixel-Based Methods (NR-P):Pixel-based models use a decoded representation of the signal and analyze the quality based on the pixel information. Some of these evaluate specific degradation types only, such as blurring or othercoding artifacts.
    • Parametric/Bitstream Methods (NR-B):These models make use of features extracted from the transmission container and/or video bitstream, e.g.,MPEG-TS packet headers,motion vectors, and quantization parameters. They do not have access to the original signal and require no decoding of the video, which makes them more efficient. In contrast to NR-P models, they have no access to the final decoded signal. In some cases, the prediction accuracy of bitstream-based metrics can reach one full reference without requiring a reference.[6]
    • Hybrid Methods (Hybrid NR-P-B):Hybrid models combine parameters extracted from the bitstream with a decoded video signal.[7] They are therefore a mix between the NR-P and NR-B models.

Use of picture quality models for video quality estimation

[edit]

Some models that are used for video quality assessment (such asPSNR orSSIM) are simplyimage quality models, whose output is calculated for every frame of a video sequence. An overview of recent no-referenceimage quality models has also been given in a journal paper by Shahid et al.[5]

The quality measure of every frame in a video (as determined by an image quality model) can then be recorded and pooled over time to assess the quality of an entire video sequence. While this method is easy to implement, it does not factor in certain kinds of degradations that develop over time, such as the moving artifacts caused bypacket loss and itsconcealment. A video quality model that considers the temporal aspects of quality degradations, likeVQM or theMOVIE Index, may be able to produce more accurate predictions of human-perceived quality.

Video quality artifacts

[edit]

The estimation ofvisual artifacts is a well known technique for estimating overall video quality. The majority of these artifacts arecompression artifacts caused by lossy compression. Some of the attributes typically estimated by pixel-based metrics include:

Spatial

  • Blurring — a result of loss of high spatial frequency image detail, usually at sharp edges.
  • Blocking — is caused by multiple algorithms because of the internal representation of an image with blocks size 8, 16, or 32. With specific parameters, they can average pixels inside a block making blocks distinct
  • Ringing, echoing or ghosting - takes the form of a “halo,” band, or “ghost” near sharp edges.
  • Color bleeding — occurs when the edges of one colour in the image unintentionally bleeds or overlaps into another colour
  • Staircase noise — is a special case of blocking along a diagonal or curved edge. Rather than rendering as smooth, it takes on the appearance of stair steps

Temporal

  • Flickering — is usually frequent brightness or colour changes along the time dimension. It is often broken out as fine-grain flickering and coarse-grain flickering.
  • Mosquito noise — a variant of flickering, it’s typified as haziness and/or shimmering around high-frequency content (sharp transitions between foreground entities and the background or hard edges).
  • Floating — refers to illusory motion in certain regions while the surrounding areas remain static. Visually, these regions appear as if they were floating on top of the surrounding background
  • Jerkiness or judder — is the perceived uneven or wobbly motion due to frame sampling. It’s often caused by the conversion of 24 fps movies to a 30 or 60 fps video format.

Examples of video quality metrics

[edit]

This section lists examples of video quality metrics.

This table isincomplete; you can help byexpanding it.
MetricUsageDescription
Full-ReferencePSNR (Peak Signal-to-Noise Ratio)ImageIt is calculated between every frame of the original and the degraded video signal. PSNR is the most widely used objective image quality metric. However, PSNR values do not correlate well with perceived picture quality due to the complex, highly non-linear behaviour of the human visual system.
SSIM[8] (Structural SIMilarity)ImageSSIM is a perception-based model that considers image degradation as perceived change in structural information, while also incorporating important perceptual phenomena, including both luminance masking and contrast masking terms.
MOVIE Index[9] (MOtion-based Video Integrity Evaluation)VideoThe MOVIE index is a neuroscience-based model for predicting the perceptual quality of a (possibly compressed or otherwise distorted) motion picture or video against a pristine reference video.
VMAF[10] (Video Multimethod Assessment Fusion)VideoVMAF uses different features to predict video quality, which are fused using aSVM-based regression to provide a single output score. These scores are then temporally pooled over the entire video sequence using thearithmetic mean to provide an overallmean opinion score (MOS).
VQM[11]VideoThis model has been standardized in ITU-T Rec. J.144 in 2001.
Reduced-ReferenceSRR[12] (SSIM Reduced-Reference)VideoSRR value is calculated as the ratio of received (target) video signal SSIM with reference video pattern SSIM values.
ST-RRED[13]VideoCompute wavelet coefficients of frame differences between the adjacent frames in a video sequence (modeled by a Gaussian Scale Mixture). It is used to evaluate RR entropic differences leading to temporal RRED.It in conjunction with spatial RRED indices evaluated by applying the RRED index on every frame of the video, yield the spatio-temporal RRED
ITU-T Rec.P.1204.4VideoThis reduced-reference model compares features extracted from a reference video with a distorted (compressed video).[14]
No-ReferenceNIQE[15] Naturalness Image Quality EvaluatorImageThis IQA model is founded on perceptually relevant spatial domain natural scene statistic (NSS) features extracted from local image patches that effectively capture the essential low-order statistics of natural images.
BRISQUE[16] Blind/Referenceless Image Spatial Quality EvaluatorImageThe method extracts the pointwise statistics of local normalized luminance signals and measures image naturalness (or lack thereof) based on measured deviations from a natural image model. It also models the distribution of pairwise statistics of adjacent normalized luminance signals which provides distortion orientation information.
Video-BLIINDS[17]VideoComputes statistical models on DCT coefficients of frame differences and calculates motion characterization. Pedicts score based on those features usingSVM.
ITU-T Rec.P.1203.1VideoThis is a metric that is part of theP.1203 family of standards, which can use either metadata only (codec, resolution, bitrate, framerate), frame information (frame types and sizes), or the entire bitstream to analyze the quality of a compressed video. It is primarily intended to be used in the context of HTTP adaptive streaming.
ITU-T Rec.P.1204.3VideoThis model uses the video bitstream to analyze compression/coding quality based on features like quantization parameters and motion vectors.[14]
ITU-T Rec.P.1204.5VideoThis is a hybrid model that uses the decoded pixels and information about the video codec to determine final video quality.[14]

Training and performance evaluation

[edit]

Since objective video quality models are expected to predict results given by human observers, they are developed with the aid ofsubjective test results. During the development of an objective model, its parameters should be trained so as to achieve the best correlation between the objectively predicted values and the subjective scores, often available asmean opinion scores (MOS).

The most widely used subjective test materials are in the public domain and include still pictures, motion pictures, streaming video, high definition, 3-D (stereoscopic), and special-purpose picture quality-related datasets.[18] These so-called databases are created by various research laboratories around the world. Some of them have become de facto standards, including several public-domain subjective picture quality databases created and maintained by theLaboratory for Image and Video Engineering (LIVE) as well theTampere Image Database 2008. A collection of databases can be found in theQUALINET Databases repository. TheConsumer Digital Video Library (CDVL) hosts freely available video test sequences for model development.

Some databases also provide pre-computed metric scores to allow others to benchmark new metrics against existing ones. Examples can be seen in the table below

Examples of Video Model Benchmark Databases
BenchmarkNumber of videosNumber of metricsType of metrics
LIVE-VQC58511No-reference
KoNViD-1k1,20011No-reference
YouTube-UGC1,5009No-reference
MSU No-Reference VQA2,50015No-reference
MSU Full-Reference VQA2,50044Full-reference
LIVE-FB Large-Scale Social Video Quality39,0006No-reference
LIVE-ETRI4375No-reference
LIVE Livestream3153No-reference

In theory, a model can be trained on a set of data in such a way that it produces perfectly matching scores on that dataset. However, such a model will beover-trained and will therefore not perform well on new datasets. It is therefore advised tovalidate models against new data and use the resulting performance as a real indicator of the model's prediction accuracy.

To measure the performance of a model, some frequently used metrics are thelinear correlation coefficient,Spearman's rank correlation coefficient, and theroot mean square error (RMSE). Other metrics are thekappa coefficient and theoutliers ratio. ITU-T Rec.P.1401 gives an overview of statistical procedures to evaluate and compare objective models.

Uses and application of objective models

[edit]

Objective video quality models can be used in various application areas. Invideo codec development, the performance of a codec is often evaluated in terms of PSNR or SSIM. For service providers, objective models can be used for monitoring a system. For example, anIPTV provider may choose to monitor their service quality by means of objective models, rather than asking users for their opinion, or waiting for customer complaints about bad video quality. Few of these standards have found commercial applications, includingPEVQ andVQuad-HD.SSIM is also part of a commercially available video quality toolset (SSIMWAVE).VMAF is used byNetflix to tune their encoding and streaming algorithms, and to quality-control all streamed content.[19][20] It is also being used by other technology companies likeBitmovin[21] and has been integrated into software such asFFmpeg.

An objective model should only be used in the context that it was developed for. For example, a model that was developed using a particular video codec is not guaranteed to be accurate for another video codec. Similarly, a model trained on tests performed on a large TV screen should not be used for evaluating the quality of a video watched on a mobile phone.

Other approaches

[edit]

When estimating quality of a video codec, all the mentioned objective methods may require repeating post-encoding tests in order to determine the encoding parameters that satisfy a required level of visual quality, making them time consuming, complex and impractical for implementation in real commercial applications. There is ongoing research into developing novel objective evaluation methods which enable prediction of the perceived quality level of the encoded video before the actual encoding is performed.[22]

Subjective video quality

[edit]
Main article:Subjective video quality

The main goal of many-objective video quality metrics is to automatically estimate the average user's (viewer's) opinion on the quality of a video processed by a system. Procedures forsubjective video quality measurements are described inITU-R recommendationBT.500 and ITU-T recommendationP.910. In such tests, video sequences are shown to a group of viewers. The viewers' opinion is recorded and averaged into themean opinion score to evaluate the quality of each video sequence. However, the testing procedure may vary depending on what kind of system is tested.

Tools for video quality assessment

[edit]
This table isincomplete; you can help byexpanding it.
ToolDescriptionАvailabilityLicenseIncluded metrics
FFmpegFree and open-source multimedia tool that incorporates some video quality metricsFreeOpen sourcePSNR, SSIM,VMAF
MSU VQMTA software suite for objective video quality assessment (full reference and no reference)Free for basic metrics

Paid for HDR metrics

ProprietaryPSNR, SSIM, MS-SSIM, 3SSIM,VMAF, NIQE, VQM, Delta, MSAD, MSE

MSU developed metrics: Blurring Metric, Blocking Metric, Brightness Flicking Metric, Drop Frame Metric, Noise Estimation Metric

EPFL VQMTVarious metrics implemented in OpenCV (C++) based on existing MATLAB implementationsFreeOpen sourcePSNR, PSNR-HVS, PSNR-HVS-M, SSIM, MS-SSIM, VIFp
OpenVQA toolkit implementing various metrics including the authors' OPVQFreeOpen sourcePSNR, SSIM, OPVQ
ElecardA commercial video quality estimation programDemo version availableProprietaryPSNR, APSNR, MSAD, MSE, SSIM, Delta, VQM, NQI,VMAF, VIF
AviSynthA video processing tool that can be used as a plugin or via scriptionFreeOpen sourceSSIM
VQ ProbeA software to calculate video quality metricsFreeProprietaryPSNR, SSIM,VMAF
vmaf.devAn online video quality calculation software implementing VMAFFreeOpen sourceVMAF

See also

[edit]

References

[edit]
  1. ^"Objective video quality assessment methods for Video assistant refereeing (VAR) System"(PDF).
  2. ^Raake, Alexander (2006).Speech quality of VoIP : assessment and prediction. Wiley InterScience (Online service). Chichester, England: Wiley.ISBN 9780470030608.OCLC 85785040.
  3. ^Möller, Sebastian (2000).Assessment and Prediction of Speech Quality in Telecommunications. Boston, MA: Springer US.ISBN 9781475731170.OCLC 851800613.
  4. ^Raake, Alexander; Egger, Sebastian (2014).Quality of Experience. T-Labs Series in Telecommunication Services. Springer, Cham. pp. 11–33.doi:10.1007/978-3-319-02681-7_2.ISBN 9783319026800.
  5. ^abShahid, Muhammad; Rossholm, Andreas; Lövström, Benny; Zepernick, Hans-Jürgen (2014-08-14)."No-reference image and video quality assessment: a classification and review of recent approaches".EURASIP Journal on Image and Video Processing.2014: 40.doi:10.1186/1687-5281-2014-40.ISSN 1687-5281.
  6. ^Barman, Nabajeet; Reznik, Yuriy; Martini, Maria G. (2023). "A Subjective Dataset for Multi-Screen Video Streaming Applications".arXiv:2305.03138 [cs.MM].
  7. ^Lee, Seon-Oh; Jung, Kwang-Su; Sim, Dong-Gyu (2010)."Real-time Objective Quality Assessment based on Coding Parameters Extracted from H.264/AVC Bitstream".IEEE Transactions on Consumer Electronics.56 (2):1071–1078.doi:10.1109/TCE.2010.5506041.S2CID 23190244.
  8. ^Wang, Zhou; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. (2004-04-01). "Image quality assessment: from error visibility to structural similarity".IEEE Transactions on Image Processing.13 (4):600–612.Bibcode:2004ITIP...13..600W.CiteSeerX 10.1.1.2.5689.doi:10.1109/TIP.2003.819861.ISSN 1057-7149.PMID 15376593.S2CID 207761262.
  9. ^Seshadrinathan, K.; Bovik, A.C. (2010-02-01). "Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos".IEEE Transactions on Image Processing.19 (2):335–350.Bibcode:2010ITIP...19..335S.CiteSeerX 10.1.1.153.9018.doi:10.1109/TIP.2009.2034992.ISSN 1057-7149.PMID 19846374.S2CID 15356687.
  10. ^vmaf: Perceptual video quality assessment based on multi-method fusion, Netflix, Inc., 2017-07-14, retrieved2017-07-15
  11. ^"Description of Video Quality Metric (VQM) Software - ITS".its.ntia.gov. Retrieved2023-07-12.
  12. ^Kourtis, M.-A.; Koumaras, H.; Liberal, F. (July–August 2016)."Reduced-reference video quality assessment using a static video pattern".Journal of Electronic Imaging.25 (4): 043011.Bibcode:2016JEI....25d3011K.doi:10.1117/1.jei.25.4.043011.
  13. ^Soundararajan, R.; Bovik, A.C. (2013-04-04). "Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing".IEEE Transactions on Circuits and Systems for Video Technology.23 (4):684–694.doi:10.1109/tcsvt.2012.2214933.S2CID 206661510.
  14. ^abcRaake, Alexander; Borer, Silvio; Satti, Shahid M.; Gustafsson, Jorgen; Rao, Rakesh Rao Ramachandra; Medagli, Stefano; List, Peter; Goring, Steve; Lindero, David; Robitza, Werner; Heikkila, Gunnar; Broom, Simon; Schmidmer, Christian; Feiten, Bernhard; Wustenhagen, Ulf (2020)."Multi-Model Standard for Bitstream-, Pixel-Based and Hybrid Video Quality Assessment of UHD/4K: ITU-T P.1204".IEEE Access.8:193020–193049.doi:10.1109/ACCESS.2020.3032080.ISSN 2169-3536.S2CID 226293635.
  15. ^Mittal, A.; Soundararajan, R.; Bovik, A.C. (March 2013). "Making a "Completely Blind" Image Quality Analyzer".IEEE Signal Processing Letters.20 (3):209–212.Bibcode:2013ISPL...20..209M.doi:10.1109/lsp.2012.2227726.S2CID 16892725.
  16. ^Mittal, A.; Moorthy, A.K.; Bovik, A.C. (2011-11-09). "Blind/Referenceless Image Spatial Quality Evaluator".2011 Conference Record of the Forty Fifth Asilomar Conference on Signals, Systems and Computers (ASILOMAR). pp. 723–727.doi:10.1109/acssc.2011.6190099.ISBN 978-1-4673-0323-1.S2CID 16388844.
  17. ^Saad, M. A.; Bovik, A. C.; Charrier, C. (March 2014). "Blind Prediction of Natural Video Quality".IEEE Transactions on Image Processing.23 (3):1352–1365.Bibcode:2014ITIP...23.1352S.CiteSeerX 10.1.1.646.9045.doi:10.1109/tip.2014.2299154.ISSN 1057-7149.PMID 24723532.S2CID 14314450.
  18. ^Liu, Tsung-Jung; Lin, Yu-Chieh; Lin, Weisi; Kuo, C.-C. Jay (2013)."Visual quality assessment: recent developments, coding applications and future trends".APSIPA Transactions on Signal and Information Processing.2.doi:10.1017/atsip.2013.5.hdl:10356/106287.ISSN 2048-7703.
  19. ^Blog, Netflix Technology (2016-06-06)."Toward A Practical Perceptual Video Quality Metric".Netflix TechBlog. Retrieved2017-10-08.
  20. ^Blog, Netflix Technology (2018-10-26)."VMAF: The Journey Continues".Medium. Retrieved2019-10-23.
  21. ^"Per-Scene Adaptation: Going Beyond Bitrate".Bitmovin. 2018-01-05. Retrieved2019-10-23.
  22. ^Koumaras, H.; Kourtis, A.; Martakos, D.; Lauterjung, J. (2007-09-01). "Quantified PQoS assessment based on fast estimation of the spatial and temporal activity level".Multimedia Tools and Applications.34 (3):355–374.doi:10.1007/s11042-007-0111-1.ISSN 1380-7501.S2CID 14136479.

Further reading

[edit]
Retrieved from "https://en.wikipedia.org/w/index.php?title=Video_quality&oldid=1259107991"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp