Citation: | ZHANG Yongze, DA Feipeng. A Multi-object Tracking Method Based on Dilatation Region Matching and Adaptive Trajectory Management Strategy[J]. Geomatics and Information Science of Wuhan University, 2024, 49(4): 572-581. DOI: 10.13203/j.whugis20230359 |
Multi-object tracking (MOT) is a pivotal research area within the computer vision domain. Despite significant strides in MOT research, the field continues to grapple with formidable challenges: Indistinct appearance attributes of objects,objects exhibit irregular motion, anomalies in tracking arising from rigid trajectory lifecycle management strategies. These elements substantially undermine the precision and robustness of multi-object tracking endeavors.
In response to these challenges, we present an advanced multi-object tracking algorithm that integrates dilatation intersection over union (DIOU) matching with an adaptive trajectory management approach. Initially, we introduce a metric based on a refined DIOU area for the primary matching between active trajectories and high-confidence detections, thereby improving the direct matching performance for high-quality detection boxes. Subsequently, for the re-matching of active trajectories with low-confidence detections, we implement a metric centered on a moderately dilated DIOU area, enhancing the tracking continuity of these detections. Furthermore, for reconnecting inactive trajectories with unmatched high-confidence detections, we employ a metric utilizing an extensively dilated DIOU area to bolster the probability of reactivating dormant trajectories. Lastly, an adaptive trajectory management strategy predicated on detection confidence scores is deployed to dynamically modulate the lifespan of trajectories, thereby mitigating the incidence of tracking anomalies and identity switches induced by occlusions and misidentifications.
(1) The application of the DIOU-based matching framework has yielded 5.4% increase in HOTA(higher order tracking accuracy) and a 1.5% increase in MOTA(multiple object tracking accuracy) on the DanceTrack dataset, corroborating the method's efficacy in densely populated scenes and complex motion environments. (2) The implementation of the adaptive trajectory management module has further resulted in 4.6% rise in HOTA, 0.8% elevation in MOTA, and 2.1% improvement in IDF1(identification F-score) on the DanceTrack dataset, demonstrating its capacity to efficiently counteract the limitations of fixed lifecycle sensitivities to false detections and missed detections.
Although the refinement of data association and trajectory management strategies has led to a surge in tracking accuracy, the layering of multiple strategies has introduced a trade-off with computational efficiency, curtailing the peak performance of the tracking system.
[1] |
邹北骥, 李伯洲, 刘姝. 基于中心点检测和重识别的多行人跟踪算法[J]. 武汉大学学报(信息科学版), 2021, 46(9): 1345-1353.
Zou Beiji, Li Bozhou, Liu Shu. A Multi-pedestrian Tracking Algorithm Based on Center Point Detection and Person Re-identification[J]. Geomatics and Information Science of Wuhan University, 2021, 46(9): 1345-1353.
|
[2] |
罗霄月, 王艳慧, 张兴国. 视频与GIS协同的交通违规行为分析方法[J]. 武汉大学学报(信息科学版), 2023, 48(4): 647-655.
Luo Xiaoyue,Wang Yanhui, Zhang Xingguo.A Violation Analysis Method of Traffic Targets Based on Video and GIS[J].Geomatics and Information Scien‑ce of Wuhan University, 2023, 48(4): 647-655.
|
[3] |
张星, 刘涛, 孙龙培, 等. 一种视觉与惯性协同的室内多行人目标定位方法[J].武汉大学学报(信息科学版),2021,46(5): 672-680.
Zhang Xing,Liu Tao,Sun Longpei, et al. A Visual-Inertial Collaborative Indoor Localization Method for Multiple Moving Pedestrian Targets[J]. Geomatics and Information Science of Wuhan University, 2021, 46(5): 672-680.
|
[4] |
Bai H X, Cheng W S, Chu P, et al. GMOT-40: A Benchmark for Generic Multiple Object Tracking[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, USA, 2021.
|
[5] |
Cao J K, Pang J M, Weng X S, et al. Observation-Centric SORT: Rethinking SORT for Robust Multi-object Tracking[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, Canada, 2023.
|
[6] |
Zeng F G, Dong B, Zhang Y A, et al. MOTR: End-to-End Multiple-Object Tracking with Transformer[M]//The 17th European Conference on Computer Vision, Tel-Aviv, Israel, 2022.
|
[7] |
Yu F W, Li W B, Li Q Q, et al. POI: Multiple Object Tracking with High Performance Detection and Appearance Feature[M]//The 14th European Conference on Computer Vision,Amsterdam,Netherlands, 2016.
|
[8] |
Ren S,He K,Girshick R,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. Advances in Neural Information Processing Systems, 2015, 28(10):489-502.
|
[9] |
Zhang Y F, Wang C Y, Wang X G, et al. FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking[J]. International Journal of Computer Vision,2021,129(11):3069-3087.
|
[10] |
Bochinski E, Eiselein V, Sikora T. High-Speed Tracking-by-Detection Without Using Image Information[C]//The 14th IEEE International Conferen‑ce on Advanced Video and Signal Based Surveillance, Lecce, Italy, 2017.
|
[11] |
Zheng L, Bie Z, Sun Y F, et al. MARS: A Video Benchmark for Large-Scale Person Re-identification[C]//The14th European Conference on Computer Vision, Amsterdam, Netherlands, 2016.
|
[12] |
Bewley A,Ge Z Y,Ott L,et al.Simple Online and Realtime Tracking[C]//The 23rd IEEE International Conference on Image Processing,Phoenix,USA, 2016.
|
[13] |
Wojke N, Bewley A, Paulus D. Simple Online and Realtime Tracking with a Deep Association Metric[C]//IEEE International Conference on Image Processing, Beijing, China, 2017.
|
[14] |
Chen L, Ai H Z, Zhuang Z J, et al. Real-Time Multiple People Tracking with Deeply Learned Candidate Selection and Person Re-Identification[C]//IEEE International Conference on Multimedia and Expo, San Diego, USA, 2018.
|
[15] |
Wang Z D, Zheng L, Liu Y X, et al.Towards Real-Time Multi-object Tracking[C]//The 18th European Conference on Computer Vision, Munich, Germa
|