Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessArticle

Peer-Review Record

Multitask Learning Based Intra-Mode Decision Framework for Versatile Video Coding

Electronics 2022, 11(23), 4001; https://doi.org/10.3390/electronics11234001

by Naima Zouidi^1,2,*

, Amina Kessentini^2,3,†, Wassim Hamidouche^1,†, Nouri Masmoudi^2,† and Daniel Menard^1,†

Reviewer 1:

Miroslav Uhrina

Reviewer 2:

Grzegorz Pastuszak

Electronics 2022, 11(23), 4001; https://doi.org/10.3390/electronics11234001

Submission received: 21 October 2022 / Revised: 23 November 2022 / Accepted: 24 November 2022 / Published: 2 December 2022

(This article belongs to the Special Issue Video Coding, Processing, and Delivery for Future Applications)

Round 1

Reviewer 1 Report

The paper deals with reducing the computational complexity of the VVC compression standard. It established a large database for intra prediction and proposes a Multi-task Learning based intra mode decision framework. Experimental results showed that the proposal enables up to 30% of complexity reduction while slightly increasing the Bjontegaard Bitrate (BD-BR).

The whole paper is logically divided into five chapters including the introduction. The paper is written adequate to the research domain and sound good from the technical point of view.

Considering my positive statements, I fully recommend to accept the paper in present mode.

Author Response

Thank you so much for your recommendation.

Reviewer 2 Report

The manuscript presents the use of CNN to estimate probabilities of intra-prediction modes. Results are applied to VTM of VVC to limit its computational complexity. Results show that the reduction is above 21%, and compression efficiency loss is 1.66% on average. This proves the efficiency and the membership to the set of the best reduction methods. The manuscript is well-written and clear.

Please explain the descriptions „no skill” and “logistic” in Fig. 9.

Author Response

Thank you for your comment
Point 1: the article was corrected accordingly and an additional proof reading has been performed.
Point 2: logistic was replaced by MTL to refer to the precision vs recall curves of our model and no skill model was explained as follow:
a no skill model is a model that outputs random guesses or predictions.

Article Menu

Multitask Learning Based Intra-Mode Decision Framework for Versatile Video Coding

Further Information

Guidelines

MDPI Initiatives

Follow MDPI