The SSIMplus Index for Video Quality-of-Experience Assessment

 

 


The SSIMplus Index

SSIMplus is an objective full-reference perceptual video quality-of-experience (QoE) index ranges between 0 and 100. SSIMplus in many ways goes far beyond what SSIM can do. Its distinctive features include:

-        High accuracy

-        High speed

-        Straightforward and easy-to-use

-        Device-adaptive QoE assessment

-        Cross-resolution QoE assessment

-        Cross-content QoE measurement

-        Detailed quality map

SSIMplus was developed by the team of Prof. Zhou Wang in Waterloo, Ontario, Canada. Details can be found in the following papers:

1.     A. Rehman, K. Zeng and Z. Wang, "Display device-adapted video quality-of-experience assessment," IS&T-SPIE Electronic Imaging, Human Vision and Electronic Imaging XX, Feb. 2015.

More details and results will also be presented in future publications/talks. Please be advised from the publication page of Prof. Wang.


Applications of SSIMplus

SSIMplus may be employed in many application scenarios. In the field of video delivery over multimedia communication networks, SSIMplus may be applied in the following ways:

-        Live and file-based video QoE monitoring

-        Benchmarking video encoders and transcoders

-        Guiding adaptive bit-rate video coding

-        Enabling smart quality-driven adaptive bit-rate video streaming

 


SSIMplus Features

SSIMplus not only employs advanced computational models derived from vision science and state-of-the-art visual quality assessment research, but is also optimized for speed and designed for providing meaningful quality scores comparable across device, resolution and content. The SSIMplus features are briefly explained as follows.

 

High Accuracy

SSIMplus has been tested using many video quality databases and exhibits high prediction accuracy of perceived video quality by humans. Results of a recent test performed at University of Waterloo are given here:

 

A set of raw videos sequences, consisting of 1080p and 640p resolutions, was compressed at various distortion levels to produce H.264-compliant bitstreams. The decompressed test videos were rated by subjects under the following viewing conditions:

 

-        Display Device: iPhone 5S, viewing distance: 10 inches

-        Display Device: iPad Air, viewing distance: 16 inches

-        Display Device: Lenovo W530 laptop, viewing distance: 20 inches

-        Display Device: Sony 55" TV, viewing distance: 90 inches

-        Display Device: Sony 55" TV, viewing distance: 20 inches (referred to as TV-Expert)

 

This database is one of the first of its kind for cross-device video quality assessment. The mean opinion score (MOS) of human subjects for each video viewed at each condition is computed and used as the ground-truth to test the performance of SSIMplus and other objective video quality models. The scatter plots of MOS versus popular models (PSNR, SSIM, MS-SSIM, MOVIE and VQM) are given in Figure 1.

 

  

  

 

Figure 1. Scatter plots of MOS versus various video quality models, including PSNR, SSIM, MS-SSIM, VQM, MOVIE and SSIMplus.

 

The superior performance of SSIMplus is evident from Figure 1, and is also confirmed by numerical evaluations using Pearson linear correlation coefficient (PLCC), mean absolute prediction error (MAE), root mean squared prediction error (RMS), Spearman rank-order correlation coefficient (SRCC) and Kendall rank-order correlation coefficient (KRCC). A better model should have higher PLCC, SRCC and KRCC, and lower MAE and RMS values. The results are summarized in Table 1.

 

Table 1. Performance comparison of state-of-the-art video quality models

Model

PLCC

MAE

RMS

SRCC

KRCC

Computation Time

(normalized based on PSNR)

PSNR

0.9062

7.4351

9.8191

0.8804

0.6886

1

SSIM

0.9253

6.9203

8.8069

0.9014

0.7246

22.65

MS-SSIM

0.8945

8.1969

10.384

0.8619

0.6605

48.49

VQM

0.8981

8.0671

10.214

0.8703

0.6711

174.53

MOVIE

0.9096

7.4761

9.6493

0.8892

0.7001

3340.27

SSIMplus

0.9732

4.3192

5.3451

0.9349

0.7888

7.83

 

High Speed

Video QoE measurement is often a computationally demanding task but real-world applications such as live QoE monitoring often desire videos being evaluated in real-time. The SSIMplus method has been optimized for speed. The last column of Table 1 compares SSIMplus with state-of-the-art video quality assessment models, where it can be observed that the computational cost of SSIMplus, the most accurate model in predicting visual QoE, is only 7.83 times of PSNR, far less than the other popular models. The low computational cost allows SSIMplus to be computed faster than real-time, even when comparing ultra-HD 4K video.

 

Straightforward and Easy-To-Use

SSIMplus provides straightforward predictions on what an average human viewer would say about the quality of the video being viewed on a scale of 0-100, which is evenly divided to five quality ranges of bad (0-19), poor (20-39), fair (40-59), good (60-79), and excellent (80-100), respectively. An example is shown in Figure 2. SSIMplus can be easily embedded into real-world systems for various applications such as live or file-based QoE monitoring, benchmarking of video encoders/transcoders, guiding adaptive bit-rate video encoding, and smart adaptive bitrate streaming.

 

Figure 2. SSIMplus assessment of video QoE. Top-left: test video; Top-right: reference video; Bottom-left: QoE curves over time; Bottom-right: running bars of device-adaptive QoE assessment.

 

Device-Adaptive QoE Assessment

The same video stream delivered to different display devices (cellphones, tablets, laptops, desktop monitors, TVs, etc) at different viewing conditions could result in drastically different visual QoE. Traditional video quality assessment approaches such as PSNR, SSIM, VQM and MOVIE, are unable to adapt to devices. The advanced modeling underlying SSIMplus allows it to be easily adapted to different viewing devices and conditions, and provides meaningful quality scores comparable across devices. An example is given in Figure 2, where the same video viewed on different devices are predicted to have very different QoE scores. The superior cross-device performance is evidenced by Figure 1 and Table 1, where videos viewed on different devices are mixed together. This is a critical feature for video content providers and distribution service providers to predict what each user in the network is experiencing, and also allows for smart adaptive bit-rate video streaming.

 

Cross-Resolution QoE Assessment

The spatial resolution of a video signal is often altered before, during and after delivery to fit bandwidth, power, buffering and display constraints. In the case of adaptive bit-rate streaming, video is typically transcoded into many versions with different spatial resolutions. SSIMplus allows for assessing the quality of a test video when the reference video has a different spatial resolution, and provides meaningful quality scores comparable across resolutions. This is another distinctive feature missing from existing video quality assessment approaches. The feature is partially demonstrated through the cross-resolution results shown in Figure 1.

 

Cross-Content QoE Assessment

One of the most challenging task in the development of video quality models is to supply meaningful quality scores comparable across content. Traditional video quality models such as PSNR often exhibits satisfying monotonicity when tested on the same video content compressed by the same codec, but fail miserably when videos of different content and with distortion types are mixed together. SSIMplus demonstrates the strongest robustness to variations of video content and compression techniques, so that the same score has the same meaning, regardless of the type of video content (news, natural scenes, sports, and animations, etc), codecs (MPEG2, H.264, HEVC, VPx) and other distortion types (noise, blur, and unknown types).

 

Detailed Quality Map

For users who are interested in deep analysis of video quality and diagnosis of video defects, SSIMplus delivers localized quality predictions deep to per-pixel granularity by producing a quality map that predicts the quality at every pixel in every frame in the test video. Using the quality maps, users can inspect quality defects of a video at each spatial location of each video frame, allowing for precise and in-depth evaluation, comparison, design, and optimization of their video compression, processing, storage, reproduction, delivery, and display systems.

 


Requests for SSIMplus

Interested users can request the SSIMplus video quality-of-experience monitoring software from the link below:

 

SSIMplus Quality-of-Experience Monitor (SQM)

 


Created November 9, 2014

Last updated November 9, 2014