Object-Cooperated Ternary Tree Partitioning Decision Method for Versatile Video Coding

Sujin Lee, Sang Hyo Park, Dongsan Jun

Research output: Contribution to journalArticlepeer-review

Abstract

In this paper, we propose an object-cooperated decision method for efficient ternary tree (TT) partitioning that reduces the encoding complexity of versatile video coding (VVC). In most previous studies, the VVC complexity was reduced using decision schemes based on the encoding context, which do not apply object detecion models. We assume that high-level objects are important for deciding whether complex TT partitioning is required because they can provide hints on the characteristics of a video. Herein, we apply an object detection model that discovers and extracts the high-level object features—the number and ratio of objects from frames in a video sequence. Using the extracted features, we propose machine learning (ML)-based classifiers for each TT-split direction to efficiently reduce the encoding complexity of VVC and decide whether the TT-split process can be skipped in the vertical or horizontal direction. The TT-split decision of classifiers is formulated as a binary classification problem. Experimental results show that the proposed method more effectively decreases the encoding complexity of VVC than a state-of-the-art model based on ML.

Original languageEnglish
Article number6328
JournalSensors
Volume22
Issue number17
DOIs
StatePublished - Sep 2022

Keywords

  • encoding complexity
  • machine learning
  • object detection
  • ternary tree
  • versatile video coding

Fingerprint

Dive into the research topics of 'Object-Cooperated Ternary Tree Partitioning Decision Method for Versatile Video Coding'. Together they form a unique fingerprint.

Cite this