ILCAS: Imitation Learning-Based Configuration- Adaptive Streaming for Live Video Analytics With Cross-Camera Collaboration

Duo Wu, Dayou Zhang, Miao Zhang, Ruoyu Zhang, Fangxin Wang, Shuguang Cui

Type: Article

Publication Date: 2023-10-25

Citations: 2

DOI: https://doi.org/10.1109/tmc.2023.3327097

Abstract

The high-accuracy and resource-intensive deep neural networks (DNNs) have been widely adopted by live video analytics (VA), where camera videos are streamed over the network to resource-rich edge/cloud servers for DNN inference. Common video encoding configurations (e.g., resolution and frame rate) have been identified with significant impacts on striking the balance between bandwidth consumption and inference accuracy and therefore their adaption scheme has been a focus of optimization. However, previous profiling-based solutions suffer from high profiling cost, while existing deep reinforcement learning (DRL) based solutions may achieve poor performance due to the usage of fixed reward function for training the agent, which fails to craft the application goals in various scenarios. In this paper, we propose <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> , the first imitation learning (IL) based configuration-adaptive VA streaming system. Unlike DRL-based solutions, <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> trains the agent with demonstrations collected from the expert which is designed as an offline optimal policy that solves the configuration adaption problem through dynamic programming. To tackle the challenge of video content dynamics, <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> derives motion feature maps based on motion vectors which allow <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> to visually "perceive" video content changes. Moreover, <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> incorporates a cross-camera collaboration scheme to exploit the spatio-temporal correlations of cameras for more proper configuration selection. Extensive experiments confirm the superiority of <monospace xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">ILCAS</monospace> compared with state-of-the-art solutions, with 2-20.9% improvement of mean accuracy and 19.9–85.3% reduction of chunk upload lag.

Locations

arXiv (Cornell University) - View - PDF
DataCite API - View
IEEE Transactions on Mobile Computing - View

Similar Works

Action	Title	Year	Authors
+ PDF Chat	Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning	2019	Tianchi Huang Chao Zhou Rui-Xiao Zhang Chenglei Wu Xin Yao Lifeng Sun
+	Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction	2022	Chia-Chi Chuang Donglin Yang Chuan Wen Yang Gao
+ PDF Chat	Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle	2024	Shuoyao Wang Jiawei Lin Fangwei Ye
+	System-status-aware Adaptive Network for Online Streaming Video Understanding	2023	Lin Geng Foo Jia Gong Zhipeng Fan Jun Liu
+ PDF Chat	System-Status-Aware Adaptive Network for Online Streaming Video Understanding	2023	Lin Geng Foo Jia Gong Zhipeng Fan Jun Liu
+	Meta-Reinforcement Learning via Buffering Graph Signatures for Live Video Streaming Events	2021	Stefanos Antaris Dimitrios Rafailidis Šarūnas Girdzijauskas
+	Meta-reinforcement learning via buffering graph signatures for live video streaming events	2021	Stefanos Antaris Dimitrios Rafailidis Šarūnas Girdzijauskas
+ PDF Chat	CamTuner: Reinforcement-Learning based System for Camera Parameter Tuning to enhance Analytics	2021	Sibendu Paul Kunal Rao Giuseppe Coviello Murugan Sankaradas Oliver Po Y. Charlie Hu Srimat Chakradhar
+	Karma: Adaptive Video Streaming via Causal Sequence Modeling	2023	Bowei Xu Hao Chen Zhan Ma
+	APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning	2022	Sibendu Paul Kunal Rao Giuseppe Coviello Murugan Sankaradas Oliver Po Y. Charlie Hu Srimat Chakradhar
+	Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning	2022	Sibendu Paul Kunal Rao Giuseppe Coviello Murugan Sankaradas Oliver Po Yuanming Hu Srimat Chakradhar
+ PDF Chat	APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning	2022	Sibendu Paul Kunal Rao Giuseppe Coviello Murugan Sankaradas Oliver Po Yuanming Hu Srimat Chakradhar
+	Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning	2021	Sibendu Paul Kunal Rao Giuseppe Coviello Murugan Sankaradas Oliver Po Yuanming Hu Srimat Chakradhar
+	Collaborative Video Analytics on Distributed Edges with Multiagent Deep Reinforcement Learning	2022	Yuqi Dong Guanyu Gao Ran Wang Zhisheng Yan
+ PDF Chat	Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle	2024	Shuoyao Wang Jiawei Lin Fangwei Ye
+	Visual Imitation Learning with Calibrated Contrastive Representation	2024	Yunke Wang Linwei Tao Bo Du Yutian Lin Chang Xu
+ PDF Chat	A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline	2021	Yingying Zhao Mingzhi Dong Yujiang Wang Da Yong Feng Qin Lv Robert P. Dick Dongsheng Li Tun Lu Ning Gu Li Shang
+ PDF Chat	Heterogeneous 360 Degree Videos in Metaverse: Differentiated Reinforcement Learning Approaches	2023	Wenhan Yu Jun Zhao
+	Heterogeneous 360 Degree Videos in Metaverse: Differentiated Reinforcement Learning Approaches	2023	Wenhan Yu Jun Zhao
+	From Ember to Blaze: Swift Interactive Video Adaptation via Meta-Reinforcement Learning	2023	Xuedou Xiao Mingxuan Yan Yingying Zuo Boxi Liu Paul Ruan Yang Cao Wei Wang

Works That Cite This (0)

Action	Title	Year	Authors

Works Cited by This (12)

Action	Title	Year	Authors
+	Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting	2015	Xingjian Shi Zhourong Chen Hao Wang Dit‐Yan Yeung Wai Kin Wong Wang‐chun Woo
+ PDF Chat	Learning Spatiotemporal Features with 3D Convolutional Networks	2015	Du Tran Lubomir Bourdev Rob Fergus Lorenzo Torresani Manohar Paluri
+ PDF Chat	Real-Time Action Recognition with Enhanced Motion Vector CNNs	2016	Bowen Zhang Limin Wang Zhe Wang Yu Qiao Hanli Wang
+	Proximal Policy Optimization Algorithms	2017	John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov
+	Focus: Querying Large Video Datasets with Low Latency and Low Cost	2018	Kevin Hsieh Ganesh Ananthanarayanan Peter Bodík Paramvir Bahl Matthai Philipose Phillip B. Gibbons Onur Mutlu
+	Scaling Video Analytics Systems to Large Camera Deployments	2019	Samvit Jain Ganesh Ananthanarayanan Junchen Jiang Yuanchao Shu Joseph E. Gonzalez
+ PDF Chat	You Only Look Once: Unified, Real-Time Object Detection	2016	Joseph Redmon Santosh Divvala Ross Girshick Ali Farhadi
+	Generative Adversarial Imitation Learning	2016	Jonathan Ho Stefano Ermon
+ PDF Chat	ICNet for Real-Time Semantic Segmentation on High-Resolution Images	2018	Hengshuang Zhao Xiaojuan Qi Xiaoyong Shen Jianping Shi Jiaya Jia
+	CrossRoI	2021	Hongpeng Guo Shuochao Yao Zhe Yang Qian Zhou Klara Nahrstedt