Document Type : Research Articles

Authors

Faculty of Engineering, University of Sistan and Baluchestan, Zahedan, Iran.

Abstract

This paper proposes a novel Optimal Deep Rate Controller (ODRC) designed for intra-coding configuration of the High-Efficiency Video Coding standard. The ODRC incorporates a Convolutional Neural Network-based Rate-Quantization Model (CRQM) to effectively predict bit consumption across the entire Quantization Parameter (QP) range at the Coding Tree Unit (CTU) level. The proposed rate controller employs an optimization algorithm to minimize the buffering delay required for video communications. By establishing a specific search space through the CRQM, a greedy search algorithm is utilized to determine the optimal frame-level QP, thereby minimizing discrepancies between buffer occupancy and target occupancy. Unlike CTU-level rate controllers, which can introduce quality variations due to QP fluctuations among CTUs, the frame-level ODRC maintains consistent objective quality across CTUs within a frame. The ODRC is integrated within the standard reference software HM-16.20. Comparative evaluations with the default rate controller, RC-HM, in the same software, demonstrate the superior performance of ODRC in terms of both delay and bit error ratio. Experimental results indicate that ODRC achieves a notably lower average buffering delay of 0.02s and a lower bit error ratio of 11.25%, in contrast to RC-HM's 0.3s and 44.72%, respectively, emphasizing its effectiveness for HEVC low-delay applications.

Keywords

Main Subjects

[1] G. J. Sullivan, J.-R. Ohm, W.-J. Han, and T. Wiegand, "Overview of the high efficiency video coding (HEVC) standard," IEEE Transactions on circuits and systems for video technology, vol. 22, no. 12, pp. 1649-1668, 2012, doi: 10.1109/TCSVT.2012.2221191.

[2] T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra, "Overview of the H. 264/AVC video coding standard," IEEE Transactions on circuits and systems for video technology, vol. 13, no. 7, pp. 560-576, 2003, doi: 10.1109/TCSVT.2003.815165.

[3] I. Ahmad, V. Swaminathan, A. Aved, and S. Khalid, "An overview of rate control techniques in HEVC and SHVC video encoding," Multimedia Tools and Applications, vol. 81, no. 24, pp. 34919-34950, 2022, doi: 10.1007/s11042-021-11249-5.

[4] A. A. Ramanand, I. Ahmad, and V. Swaminathan, "A survey of rate control in HEVC and SHVC video encoding," in 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), pp. 145-150, 2017, doi: 10.1109/ICMEW.2017.8026268.

[5] J. Zou and B. Li, "Rate Control in HEVC: A Survey," in Signal and Information Processing, Networking and Computers: Proceedings of the 5th International Conference on Signal and Information Processing, Networking and Computers (ICSINC), pp. 578-583, 2019, doi: 10.1007/978-981-13-7123-3_67.

[6] H. Esmaeeli and M. Rezaei, "Methods and Criteria for Evaluating Controllability of Video Bit Rate in HEVC-SCC," International Journal of Industrial Electronics Control and Optimization, vol. 4, no. 3, pp. 313-320, 2021, doi: 10.22111/ieco.2021.37880.1344.

[7] X. Wei, M. Zhou, H. Wang, H. Yang, L. Chen, and S. Kwong, "Recent advances in rate control: From optimization to implementation and beyond," IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 1, pp. 17-33, 2023, doi: 10.1109/TCSVT.2023.3287561.

[8] C. Yang, Y. Liu, Q. Liu, C. Zhang, and W. Wang, "Joint frame-level and CTU-level rate control based on constant perceptual quality," Multimedia Systems, vol. 30, no. 3, p. 133, 2024, doi: 10.1007/s00530-024-01329-5.

[9] Y. Yan, G. Xiang, H. Jia, J. Chen, X. Huang, and X. Xie, "Two-stage perceptual quality oriented rate control algorithm for HEVC," ACM Transactions on Multimedia Computing, Communications and Applications, vol. 20, no. 5, pp. 1-20, 2024, doi: 10.1145/3636510.

[10] C. Zhang and W. Gao, "Learned rate control for frame-level adaptive neural video compression via dynamic neural network," in European conference on computer vision, Springer, pp. 239-255, 2025, doi: 10.1007/978-3-031-73013-9_14.

[11] L. Jia, H. Ren, Z. Zhang, L. Song, and K. Jia, "Visual information fidelity based frame level rate control for H. 265/HEVC," Signal Processing: Image Communication, vol. 131, p. 117245, 2025, doi: 10.1016/j.image.2024.117245.

[12] Y. Zhou, L. Tian, and X. Ning, "Intra frame constant rate control scheme for high efficiency video coding," in 2013 International Conference on Computing, Networking and Communications (ICNC), pp. 648-652, 2013, doi: 10.1109/ICCNC.2013.6504163.

[13] C. Sheng, F. Chen, Z. Peng, and W. Chen, "An adaptive bit mismatch rectification algorithm for intra frame rate control in HEVC," in 2015 8th International Congress on Image and Signal Processing (CISP), pp. 80-84, 2015, doi: 10.1109/CISP.2015.7407854.

[14] B. Hosking, D. Agrafiotis, D. Bull, and N. Eastern, "An adaptive resolution rate control method for intra coding in HEVC," in 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1486-1490, 2016, doi: 10.1109/ICASSP.2016.7471924.

[15] Z. Ma, J. Xiao, T. Tian, and S. Sun, "Research and optimization for adaptive intra frame complexity rate control based on HEVC," in 2017 17th International Symposium on Communications and Information Technologies (ISCIT), pp. 1-5, 2017, doi: 10.1109/ISCIT.2017.8261179.

[16] F. Cen, Q. Lu, and W. Xu, "Efficient rate control for intraframe coding in high efficiency video coding," in 2014 International Conference on Signal Processing and Multimedia Applications (SIGMAP), pp. 54-59, 2014, doi: 10.5220/000506710054005.

[17] M. Wang, K. N. Ngan, and H. Li, "An efficient frame-content based intra frame rate control for high efficiency video coding," IEEE Signal Processing Letters, vol. 22, no. 7, pp. 896-900, 2014, doi: 10.1109/LSP.2014.2377032.

[18] P. Wang, C. Ni, G. Zhang, and K. Li, "R-Lambda model based CTU-level rate control for intra frames in HEVC," Multimedia Tools and Applications, vol. 78, no. 1, pp. 125-139, 2019, doi: 10.1007/s11042-017-5507-y.

[19] V. Sanchez, "Rate control for HEVC intra-coding based on piecewise linear approximations," in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1782-1786, 2018, doi: 10.1109/ICASSP.2018.8461970.

[20] Y. Li, B. Li, D. Liu, and Z. Chen, "A convolutional neural network-based approach to rate control in HEVC intra coding," in 2017 IEEE Visual Communications and Image Processing (VCIP), pp. 1-4, 2017, doi: 10.1109/VCIP.2017.8305050.

[21] X. Lu, B. Zhou, X. Jin, and G. Martin, "A rate control scheme for HEVC intra coding using convolution neural network (CNN)," in 2020 Data Compression Conference (DCC), pp. 382-382, 2020, doi: 10.1109/DCC47342.2020.00055.

[22] L. Wei, Z. Yang, Z. Wang, and G. Wang, "A CNN-based optimal CTU λ decision for HEVC intra rate control," IEICE TRANSACTIONS on Information and Systems, vol. 104, no. 10, pp. 1766-1769, 2021, doi: 10.1587/transinf.2021EDL8047.

[23] B. Xu, X. Pan, Y. Zhou, Y. Li, D. Yang, and Z. Chen, "CNNbased rate-distortion modeling for H. 265/HEVC," in 2017 IEEE Visual Communications and Image Processing (VCIP), pp. 1-4, 2017, doi: 10.1109/VCIP.2017.8305151.

[24] M. Santamaria, E. Izquierdo, S. Blasi, and M. Mrak, "Estimation of rate control parameters for video coding using CNN," in 2018 IEEE Visual Communications and Image Processing (VCIP), pp. 1-4, 2018, doi: 10.1109/VCIP.2018.8698721.

[25] Y. Rahimi, M. Rezaei, and P. Jafari, "Content-based coding tree unit level rate-quantization model for intra-coding in high efficiency video coding standard using convolutional neural network," Journal of Electronic Imaging, vol. 31, no. 3, pp. 033026-033026, 2022, doi: 10.1117/1.JEI.31.3.033026.

[26] J.-H. Hu, W.-H. Peng, and C.-H. Chung, "Reinforcement learning for HEVC/H. 265 intra-frame rate control," in 2018 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5, 2018, doi: 10.1109/ISCAS.2018.8351575.

[27] https://vcgit.hhi.fraunhofer.de/jvet/HM/-/tree/HM-16.20, ed.

[28] D.-T. Dang-Nguyen, C. Pasquini, V. Conotter, and G. Boato, "Raise: A raw images dataset for digital image forensics," in Proceedings of the 6th ACM multimedia systems conference, pp. 219-224, 2015, doi: 10.1145/2713168.2713194.