How to Train Your Deep Multi-Object Tracker

Type: Preprint

Publication Date: 2020-06-01

Citations: 188

DOI: https://doi.org/10.1109/cvpr42600.2020.00682

Abstract

The recent trend in vision-based multi-object tracking (MOT) is heading towards leveraging the representational power of deep learning to jointly learn to detect and track objects. However, existing methods train only certain sub-modules using loss functions that often do not correlate with established tracking evaluation measures such as Multi-Object Tracking Accuracy (MOTA) and Precision (MOTP). As these measures are not differentiable, the choice of appropriate loss functions for end-to-end training of multi-object tracking methods is still an open research problem. In this paper, we bridge this gap by proposing a differentiable proxy of MOTA and MOTP, which we combine in a loss function suitable for end-to-end training of deep multi-object trackers. As a key ingredient, we propose a Deep Hungarian Net (DHN) module that approximates the Hungarian matching algorithm. DHN allows estimating the correspondence between object tracks and ground truth objects to compute differentiable proxies of MOTA and MOTP, which are in turn used to optimize deep trackers directly. We experimentally demonstrate that the proposed differentiable framework improves the performance of existing multi-object trackers, and we establish a new state of the art on the MOTChallenge benchmark. Our code is publicly available from https://github.com/yihongXU/deepMOT.

Locations

  • arXiv (Cornell University) - View - PDF
  • HAL (Le Centre pour la Communication Scientifique Directe) - View - PDF
  • 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) - View

Similar Works

Action Title Year Authors
+ How To Train Your Deep Multi-Object Tracker 2019 Yihong Xu
Aljoša Ošep
Yutong Ban
Radu Horaud
Laura Leal-Taixé
Xavier Alameda-Pineda
+ DeepMOT: A Differentiable Framework for Training Multiple Object Trackers 2019 Yihong Xu
Yutong Ban
Xavier Alameda-Pineda
Radu Horaud
+ FANTrack: 3D Multi-Object Tracking with Feature Association Network 2019 Doruk Erkan
Venkateshwaran Balasubramanian
Prarthana Bhattacharyya
Krzysztof Czarnecki
+ PDF Chat FANTrack: 3D Multi-Object Tracking with Feature Association Network 2019 Doruk Erkan
Venkateshwaran Balasubramanian
Prarthana Bhattacharyya
Krzysztof Czarnecki
+ FANTrack: 3D Multi-Object Tracking with Feature Association Network 2019 Doruk Erkan
Venkateshwaran Balasubramanian
Prarthana Bhattacharyya
Krzysztof Czarnecki
+ PDF Chat MCTR: Multi Camera Tracking Transformer 2024 Alexandru Niculescu-Mizil
Deep Patel
Iain Melvin
+ Quasi-Dense Similarity Learning for Multiple Object Tracking 2020 Jiangmiao Pang
Linlu Qiu
Xia Li
Haofeng Chen
Qi Li
Trevor Darrell
Fisher Yu
+ PDF Chat Quasi-Dense Similarity Learning for Multiple Object Tracking 2021 Jiangmiao Pang
Linlu Qiu
Xia Li
Haofeng Chen
Qi Li
Trevor Darrell
Fisher Yu
+ Online Multi-Object Tracking with Dual Matching Attention Networks 2019 Ji Zhu
Hua Yang
Nian Liu
Minyoung Kim
Wenjun Zhang
Ming–Hsuan Yang
+ PDF Chat FAMNet: Joint Learning of Feature, Affinity and Multi-Dimensional Assignment for Online Multiple Object Tracking 2019 Peng Chu
Haibin Ling
+ FAMNet: Joint Learning of Feature, Affinity and Multi-dimensional Assignment for Online Multiple Object Tracking 2019 Peng Chu
Haibin Ling
+ QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking 2022 Tobias Fischer
Jiangmiao Pang
Thomas E. Huang
Linlu Qiu
Haofeng Chen
Trevor Darrell
Fisher Yu
+ PDF Chat QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking 2023 Tobias Fischer
T. Huang
Jiangmiao Pang
Linlu Qiu
Haofeng Chen
Trevor Darrell
Fisher Yu
+ Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer 2022 Siddharth Sagar Nijhawan
Leo Hoshikawa
Atsushi Irie
Masakazu Yoshimura
Junji Otsuka
Takeshi Ohashi
+ End-to-End Learning Deep CRF models for Multi-Object Tracking 2019 Jun Xiang
Chao Ma
Guohan Xu
Jianhua Hou
+ DEFT: Detection Embeddings for Tracking 2021 Mohamed Chaabane
Peter Zhang
J. Ross Beveridge
Stephen O′Hara
+ PDF Chat Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking 2022 Gaoang Wang
Yizhou Wang
Renshu Gu
Weijie Hu
Jenq‐Neng Hwang
+ GCNNMatch: Graph Convolutional Neural Networks for Multi-Object Tracking via Sinkhorn Normalization 2020 Ioannis Papakis
Abhijit Sarkar
Anuj Karpatne
+ GCNNMatch: Graph Convolutional Neural Networks for Multi-Object Tracking via Sinkhorn Normalization. 2020 Ioannis Papakis
Abhijit Sarkar
Anuj Karpatne
+ PDF Chat TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model 2020 Bo Pang
Yizhuo Li
Yifan Zhang
Muchen Li
Cewu Lu

Works Cited by This (26)

Action Title Year Authors
+ PDF Chat Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor 2015 Wongun Choi
+ PDF Chat An on-line variational Bayesian model for multi-person tracking from cluttered scenes 2016 Silèye Ba
Xavier Alameda-Pineda
Alessio Xompero
Radu Horaud
+ MOT16: A Benchmark for Multi-Object Tracking 2016 Anton Milan
Laura Leal-Taixé
Ian Reid
Stefan Roth
Konrad Schindler
+ PDF Chat Performance Measures and a Data Set for Multi-target, Multi-camera Tracking 2016 Ergys Ristani
Francesco Solera
Roger S. Zou
Rita Cucchiara
Carlo Tomasi
+ Improvements to Frank-Wolfe optimization for multi-detector multi-object tracking. 2017 Roberto Henschel
Laura Leal-Taixé
Daniel Cremers
Bodo Rosenhahn
+ PDF Chat Combined image- and world-space tracking in traffic scenes 2017 Aljoša Ošep
Wolfgang Mehner
Markus Mathias
Bastian Leibe
+ Trajectory Factory: Tracklet Cleaving and Re-connection by Deep Siamese Bi-GRU for Multiple Object Tracking 2018 Cong Ma
Changshui Yang
Fan Yang
Yueqing Zhuang
Ziwei Zhang
Huizhu Jia
Xiaodong Xie
+ PDF Chat Online Multi-Object Tracking with Dual Matching Attention Networks 2018 Ji Zhu
Hua Yang
Nian Liu
Minyoung Kim
Wenjun Zhang
Ming–Hsuan Yang
+ FAMNet: Joint Learning of Feature, Affinity and Multi-dimensional Assignment for Online Multiple Object Tracking 2019 Peng Chu
Haibin Ling
+ Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks 2015 Shaoqing Ren
Kaiming He
Ross Girshick
Jian Sun