Movatterモバイル変換

[0]ホーム

Jump to content

Video super-resolution

中文

Edit links

From Wikipedia, the free encyclopedia

Generating high-resolution video frames from given low-resolution ones

This article is about video frame restoration technique. For video upscaling tool by Nvidia, seeVideo Super Resolution.

VSR and SISR methods' outputs comparison. VSR restores more details by using temporal information.

Video super-resolution (VSR) is the process of generating high-resolution video frames from the given low-resolution video frames. Unlikesingle-image super-resolution (SISR), the main goal is not only to restore more fine details while saving coarse ones, but also to preserve motion consistency.

There are many approaches for this task, but this problem still remains to be popular and challenging.

Dataset	Videos	Mean video length	Ground-truth resolution	Motion in frames	Fine details
Vid4	4	43 frames	720×480	Without fast motion	Some small details, without text
SPMCS	30	31 frames	960×540	SLow motion	A lot of small details
Vimeo-90K (test SR set)	7824	7 frames	448×256	A lot of fast, difficult, diverse motion	Few details, text in a few sequences
Xiph HD (complete sets)	70	2 seconds	from 640×360 to 4096×2160	A lot of fast, difficult, diverse motion	Few details, text in a few sequences
Ultra Video Dataset 4K	16	10 seconds	4096×2160	Diverse motion	Few details, without text
REDS (test SR)	30	100 frames	1280×720	A lot of fast, difficult, diverse motion	Few details, without text
Space-Time SR	5	100 frames	1280×720	Diverse motion	Without small details and text
Harmonic	—	—	4096×2160	—	—
CDVL	—	—	1920×1080	—	—

Benchmark	Organizer	Dataset	Upscale factor	Metrics
NTIRE 2019 Challenge	CVPR (Computer Vision and pattern recognition)	REDS	4	PSNR,SSIM
Youku-VESR Challenge 2019	Youku	Youku-VESR	4	PSNR,VMAF
AIM 2019 Challenge	ECCV (European Conference on Computer Vision)	Vid3oC	16	PSNR,SSIM,MOS
AIM 2020 Challenge	ECCV (European Conference on Computer Vision)	Vid3oC	16	PSNR,SSIM, LPIPS
Mobile Video Restoration Challenge	ICIP (International Conference of Image Processing),Kwai	—	—	PSNR,SSIM,MOS
MSU Video Super-Resolution Benchmark 2021	MSU (Moscow State University)	—	4	ERQAv1.0,PSNR andSSIM with shift compensation, QRCRv1.0, CRRMv1.0
MSU Super-Resolution for Video Compression Benchmark 2022	MSU (Moscow State University)	—	4	ERQAv2.0,PSNR,MS-SSIM,VMAF, LPIPS

Team	Model name	PSNR (clean track)	SSIM (clean track)	PSNR (blur track)	SSIM (blur track)	Runtime per image in sec (clean track)	Runtime per image in sec (blur track)	Platform	GPU	Open source
HelloVSR	EDVR	31.79	0.8962	30.17	0.8647	2.788	3.562	PyTorch	TITAN Xp	YES
UIUC-IFP	WDVR	30.81	0.8748	29.46	0.8430	0.980	0.980	PyTorch	Tesla V100	YES
SuperRior	ensemble of RDN, RCAN, DUF	31.13	0.8811	—	—	120.000	—	PyTorch	Tesla V100	NO
CyberverseSanDiego	RecNet	31.00	0.8822	27.71	0.8067	3.000	3.000	TensorFlow	RTX 2080 Ti	YES
TTI	RBPN	30.97	0.8804	28.92	0.8333	1.390	1.390	PyTorch	TITAN X	YES
NERCMS	PFNL	30.91	0.8782	28.98	0.8307	6.020	6.020	PyTorch	GTX 1080 Ti	YES
XJTU-IAIR	FSTDN	—	—	28.86	0.8301	—	13.000	PyTorch	GTX 1080 Ti	NO

Team	PSNR	VMAF
Avengers Assemble	37.851	41.617
NJU_L1	37.681	41.227
ALONG_NTES	37.632	40.405

Team	Model name	PSNR	SSIM	MOS	Runtime per image in sec	Platform	GPU/CPU	Open source
fenglinglwb	based on EDVR	22.53	0.64	first result	0.35	PyTorch	4× Titan X	NO
NERCMS	PFNL	22.35	0.63	—	0.51	PyTorch	2× 1080 Ti	NO
baseline	RLSP	21.75	0.60	—	0.09	TensorFlow	Titan Xp	NO
HIT-XLab	based on EDSR	21.45	0.60	second result	60.00	PyTorch	V100	NO

Team	Model name	Params number	PSNR	SSIM	Runtime per image in sec	GPU/CPU	Open source
KirinUK	EVESRNet	45.29M	22.83	0.6450	6.1 s	1 × 2080 Ti 6	NO
Team-WVU	—	29.51M	22.48	0.6378	4.9 s	1 × Titan Xp	NO
BOE-IOT-AIBD	3D-MGBP	53M	22.48	0.6304	4.83 s	1 × 1080	NO
sr xxx	based on EDVR	—	22.43	0.6353	4 s	1 × V100	NO
ZZX	MAHA	31.14M	22.28	0.6321	4 s	1 × 1080 Ti	NO
lyl	FineNet	—	22.08	0.6256	13 s	—	NO
TTI	based on STARnet	—	21.91	0.6165	0.249 s	—	NO
CET CVLab		—	21.77	0.6112	0.04 s	1 × P100	NO

Model name	Multi-frame	Subjective	ERQAv1.0	PSNR	SSIM	QRCRv1.0	CRRMv1.0	Runtime per image in sec	Open source
DBVSR	YES	5.561	0.737	31.071	0.894	0.629	0.992	—	YES
LGFN	YES	5.040	0.740	31.291	0.898	0.629	0.996	1.499	YES
DynaVSR-R	YES	4.751	0.709	28.377	0.865	0.557	0.997	5.664	YES
TDAN	YES	4.036	0.706	30.244	0.883	0.557	0.994	—	YES
DUF-28L	YES	3.910	0.645	25.852	0.830	0.549	0.993	2.392	YES
RRN-10L	YES	3.887	0.627	24.252	0.790	0.557	0.989	0.390	YES
RealSR	NO	3.749	0.690	25.989	0.767	0.000	0.886	—	YES

Model name	BSQ-rate (Subjective score)	BSQ-rate (ERQAv2.0)	BSQ-rate (VMAF)	BSQ-rate (PSNR)	BSQ-rate (MS-SSIM)	BSQ-rate (LPIPS)	Open source
RealSR + x264	0.196	0.770	0.775	0.675	0.487	0.591	YES
ahq-11 + x264	0.271	0.883	0.753	0.873	0.719	0.656	NO
SwinIR + x264	0.304	0.760	0.642	6.268	0.736	0.559	YES
Real-ESRGAN + x264	0.335	5.580	0.698	7.874	0.881	0.733	YES
SwinIR + x265	0.346	1.575	1.304	8.130	4.641	1.474	YES
COMISR + x264	0.367	0.969	1.302	6.081	0.672	1.118	YES
RealSR + x265	0.502	1.622	1.617	1.064	1.033	1.206	YES

Movatterモバイル変換

Mathematical explanation

Methods

Traditional methods

Frequency domain

Spatial domain

Deep learning based methods

Aligned by motion estimation and motion compensation

Aligned by deformable convolution

Aligned by homography

Spatial non-aligned

3D convolutions

Recurrent neural networks

Videos

Metrics

Datasets

Benchmarks

NTIRE 2019 Challenge

Youku-VESR Challenge 2019

AIM 2019 Challenge

AIM 2020 Challenge

MSU Video Super-Resolution Benchmark

MSU Super-Resolution for Video Compression Benchmark

Application

See also

References