TrackRefine: A Plug-and-Play Decoupled Enhancement Framework for Online Multi-Object Tracking and Segmentation
Highlights
- We present TrackRefine, a plug-and-play, decoupled framework that enhances online multi-object tracking and segmentation without requiring modifications to the front-end instance segmenter or additional end-to-end joint training.
- Improving mask quality, memory reliability, and progressive association jointly is beneficial for enhancing robustness in complex scenes with occlusion, dense targets, and long-term trajectory interruptions.
- The experimental results demonstrate that online MOTS performance can be effectively improved through modular back-end enhancement rather than relying solely on end-to-end joint training.
- Owing to its plug-and-play and decoupled characteristics, TrackRefine is easier to deploy, migrate, and extend in real-world applications such as autonomous driving, surveillance, and robotic perception.
Abstract
1. Introduction
- (1)
- A lightweight mask refinement module based on Fast GrabCut is introduced, locally refining initial masks without altering the front-end instance segmenter, thereby providing more stable foreground representations for subsequent feature extraction and cross-frame association.
- (2)
- An enhanced multimodal long-short-term memory bank is constructed to jointly model appearance, semantic, and shape information. By integrating selective updating with a quality-aware gating mechanism, memory contamination caused by occlusion and background interference is alleviated, thereby improving identity discrimination when targets reappear after long-term occlusion.
- (3)
- A progressive three-stage association strategy is designed, in which a dynamic buffering and candidate filtering mechanism is introduced in the second stage, and additional spatial constraints are imposed during long-term recovery, enabling stable progressive association from high-confidence geometric matching to long-term lost trajectory recovery.
2. Background and Motivation
2.1. Multi-Object Tracking and Segmentation Methods
2.2. Mask Quality and Mask Refinement in Online MOTS
2.3. Memory Mechanisms and Multimodal Feature Representations
2.4. Data Association and Lost Trajectory Recovery
3. Methods
3.1. Overall Framework
- a lightweight mask refinement module for improving the quality of initial instance masks;
- an enhanced multimodal long-short-term memory bank for maintaining more stable identity representations during long-term occlusion and target reappearance; and
- a progressive three-stage association strategy for achieving hierarchical association from high-confidence geometric matching to long-term lost-track recovery.
| Algorithm 1. Overall Pipeline of TrackRefine |
| Input: Video sequence , front-end instance segmenter , memory bank , thresholds Output: Trajectory set 1 Initialize and 2 for to do 3 Obtain frame-wise detections: |
| 4 Suppress highly overlapped duplicate masks in 5 Refine initial masks by local Fast GrabCut: |
| 6 Extract appearance feature , semantic feature , and shape feature from 7 Fuse semantic and shape cues: |
| 8 Construct multimodal feature: |
| 9 Construct the multimodal feature set: |
| 10 Predict existing trajectory states using Kalman filtering and camera motion compensation 11 Partition trajectories into active, recently unmatched, and lost states 12 Perform progressive three-stage association: |
| 13 for each matched pair do 14 Update the state, box, mask, and identity of using 15 Estimate the observation quality 16 Selectively update short-term memory and long-term prototypes: |
| 17 end for 18 Mark unmatched trajectories as occluded or lost according to missing duration 19 Initialize new tentative trajectories for unmatched detections in 20 Remove expired trajectories from 21 end for 22 |
3.2. Lightweight Mask Refinement
3.2.1. Instance Overlap Suppression
3.2.2. Mask Refinement Based on Fast GrabCut
3.3. Enhanced Multimodal Long Short-Term Memory Bank
3.3.1. Multimodal Feature Representation
3.3.2. Long-Short-Term Memory Structure
3.3.3. Selective Update
3.3.4. Overall Pipeline of Memory Bank Update Mechanism
3.4. Progressive Three-Stage Association Strategy
Progressive Three-Stage Matching Strategy
4. Experiments
4.1. Experimental Settings
4.2. Datasets and Evaluation Metrics
4.3. Comparison Experiments on the MOTS20 and KITTI MOTS Dataset
4.3.1. MOTS20 Dataset
4.3.2. KITTI MOTS Dataset
4.3.3. Statistical Reliability Analysis on MOTS20
4.3.4. Qualitative Visualization Results
4.4. Performance Analysis with Different Front-End Instance Segmenters
4.5. Component Ablation Study
4.5.1. Results Analysis
4.5.2. Discussion
4.6. Hyperparameter Ablation Study
4.7. Computational Complexity and Latency Analysis
5. Conclusions and Future Work
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| MOTS | Multi-Object Tracking and Segmentation |
| MOT | Multi-Object Tracking |
| IoU | Intersection over Union |
| HOTA | Higher Order Tracking Accuracy |
| MOTSA | Multi-Object Tracking and Segmentation Accuracy |
| sMOTSA | soft Multi-Object Tracking and Segmentation Accuracy |
| IDSW | Identity Switches |
| Frag | Fragmentations |
References
- Zhang, W.; Li, J.; Xia, M.; Gao, X.; Tan, X.; Shi, Y.; Huang, Z.; Li, G. OffsetNet: Towards Efficient Multiple Object Tracking, Detection, and Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 949–960. [Google Scholar] [CrossRef] [PubMed]
- Qian, H.; Wang, M.; Zhu, M.; Wang, H. A Review of Multi-Sensor Fusion in Autonomous Driving. Sensors 2025, 25, 6033. [Google Scholar] [CrossRef] [PubMed]
- Mirzaei, B.; Nezamabadi-Pour, H.; Raoof, A.; Derakhshani, R. Small Object Detection and Tracking: A Comprehensive Review. Sensors 2023, 23, 6887. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Zhu, R.; Ye, Z.; Li, Y. Robot Object Detection and Tracking Based on Image-Point Cloud Instance Matching. Sensors 2026, 26, 718. [Google Scholar] [CrossRef] [PubMed]
- Voigtlaender, P.; Krause, M.; Osep, A.; Luiten, J.; Sekar, B.B.G.; Geiger, A.; Leibe, B. MOTS: Multi-Object Tracking and Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019. [Google Scholar]
- Qin, Z.; Wang, L.; Zhou, S.; Fu, P.; Hua, G.; Tang, W. Towards Generalizable Multi-Object Tracking. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 17–21 June 2024; pp. 18995–19004. [Google Scholar]
- Li, S.; Ke, L.; Danelljan, M.; Piccinelli, L.; Segu, M.; Van Gool, L.; Yu, F. Matching Anything by Segmenting Anything. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 17–21 June 2024; pp. 18963–18973. [Google Scholar]
- Ravi, N.; Gabeur, V.; Hu, Y.-T.; Hu, R.; Ryali, C.; Ma, T.; Khedr, H.; Rädle, R.; Rolland, C.; Gustafson, L.; et al. SAM 2: Segment Anything in Images and Videos. In Proceedings of the International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]
- Jiang, J.; Wang, Z.; Zhao, M.; Li, Y.; Jiang, D. SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation. Proc. AAAI Conf. Artif. Intell. 2026, 40, 5388–5396. [Google Scholar] [CrossRef]
- Zhang, T.; Zhang, Q.; Ding, G.; Han, J. Tracking and Segmenting Anything in Any Modality. Proc. AAAI Conf. Artif. Intell. 2026, 40, 12762–12770. [Google Scholar] [CrossRef]
- Gao, R.; Wang, L. MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 2–6 October 2023. [Google Scholar]
- Cheng, H.K.; Oh, S.W.; Price, B.; Schwing, A.; Lee, J.Y. Putting the Object Back into Video Object Segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 17–21 June 2024. [Google Scholar]
- Yang, M.; Han, G.; Yan, B.; Zhang, W.; Qi, J.; Lu, H.; Wang, D. Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 20–27 February 2024. [Google Scholar]
- Segu, M.; Piccinelli, L.; Li, S.; Yang, Y.-H.; Van Gool, L.; Schiele, B. Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking. In Proceedings of the International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]
- Luo, W.; Yan, F.; Zhong, Y.; Gan, Y.; Ma, L. CO-MOT: Boosting End-to-End Transformer-Based Multi-Object Tracking via Coopetition Label Assignment and Shadow Sets. In Proceedings of the International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]
- Brasó, G.; Cetintas, O.; Leal-Taixé, L. Multi-Object Tracking and Segmentation via Neural Message Passing. Int. J. Comput. Vis. 2022, 130, 3035–3053. [Google Scholar] [CrossRef]
- Xu, Z.; Yang, W.; Zhang, W.; Tan, X.; Huang, H.; Huang, L. Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 6424–6437. [Google Scholar] [CrossRef] [PubMed]
- Gao, Y.; Xu, H.; Zheng, Y.; Li, J.; Gao, X. An Object Point Set Inductive Tracker for Multi-Object Tracking and Segmentation. IEEE Trans. Image Process. 2022, 31, 6083–6096. [Google Scholar] [CrossRef] [PubMed]
- Galoaa, B.; Bai, X.; Nandi, U.; Rangoju, S.S.V.D.; Amraee, S.; Ostadabbas, S. UniTrack: Differentiable Graph Representation Learning for Multi-Object Tracking. In Proceedings of the International Conference on Learning Representations, Rio de Janeiro, Brazil, 23–27 April 2026. [Google Scholar]
- Kirillov, A.; Mintun, E.; Ravi, N.; Mao, H.; Rolland, C.; Gustafson, L.; Xiao, T.; Whitehead, S.; Berg, A.C.; Lo, W.-Y.; et al. Segment Anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 2–6 October 2023; pp. 4015–4026. [Google Scholar]
- Ding, S.; Schuster, R.; Lin, W.; Cremers, D. SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Honolulu, HI, USA, 19–20 October 2025. [Google Scholar]
- Krähenbühl, P.; Koltun, V. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials. In Proceedings of the Advances in Neural Information Processing Systems, Granada, Spain, 12–17 December 2011. [Google Scholar]
- Rother, C.; Kolmogorov, V.; Blake, A. “GrabCut”: Interactive Foreground Extraction Using Iterated Graph Cuts. ACM Trans. Graph. 2004, 23, 309–314. [Google Scholar] [CrossRef]
- Cai, J.; Xu, M.; Li, W.; Xiong, Y.; Xia, W.; Tu, Z.; Soatto, S. MeMOT: Multi-Object Tracking with Memory. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022. [Google Scholar]
- Wojke, N.; Bewley, A.; Paulus, D. Simple Online and Realtime Tracking with a Deep Association Metric. In Proceedings of the IEEE International Conference on Image Processing, Beijing, China, 17–20 September 2017. [Google Scholar]
- Li, Y.; Li, Q.; Wang, H.; Ma, X.; Yao, J.; Dong, S.; Fan, H.; Zhang, L. Beyond MOT: Semantic Multi-Object Tracking. In Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; pp. 276–293. [Google Scholar]
- Sun, P.; Cao, J.; Jiang, Y.; Yuan, Z.; Bai, S.; Kitani, K.; Luo, P. DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022. [Google Scholar]
- Ren, H.; Han, S.; Ding, H.; Zhang, Z.; Wang, H.; Wang, F. Focus on Details: Online Multi-Object Tracking with Diverse Fine-Grained Representation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 18–22 June 2023. [Google Scholar]
- Gao, R.; Qi, J.; Wang, L. Multiple Object Tracking as ID Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville TN, USA, 11–15 June 2025; pp. 27883–27893. [Google Scholar]
- Zhang, Y.; Sun, P.; Jiang, Y.; Yu, D.; Weng, F.; Yuan, Z.; Luo, P.; Liu, W.; Wang, X. ByteTrack: Multi-Object Tracking by Associating Every Detection Box. In Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel, 23–27 October 2022. [Google Scholar]
- Aharon, N.; Orfaig, R.; Bobrovsky, B.-Z. BoT-SORT: Robust Associations Multi-Pedestrian Tracking. arXiv 2022, arXiv:2206.14651. [Google Scholar]
- Zhu, Z.; Nie, J.; Wu, H.; He, Z.; Gao, M. MSA-MOT: Multi-Stage Association for 3D Multimodality Multi-Object Tracking. Sensors 2022, 22, 8650. [Google Scholar]
- Li, Y.; Zhan, L.; Wu, L.; Liu, H.; Min, J.; Wang, X. Re-Identification Assistance and Multi-Stage Association for Pedestrian Multi-Object Tracking. Sci. Rep. 2025, 15, 22533. [Google Scholar] [CrossRef] [PubMed]
- Ultralytics. Ultralytics YOLO26. Available online: https://www.ultralytics.com/ (accessed on 29 April 2026).
- Ultralytics. YOLO26 Training Recipe. Available online: https://docs.ultralytics.com/guides/yolo26-training-recipe/ (accessed on 29 April 2026).
- MOTSChallenge Dataset. MOTS: Multi-Object Tracking and Segmentation. Available online: https://www.vision.rwth-aachen.de/page/mots (accessed on 29 April 2026).
- KITTI MOTS Dataset. Multi-Object Tracking and Segmentation. Available online: https://www.cvlibs.net/datasets/kitti/eval_mots.php (accessed on 29 April 2026).
- Codabench. Competition 10051. Available online: https://www.codabench.org/competitions/10051/ (accessed on 29 April 2026).
- Xu, Z.; Meng, A.; Shi, Z.; Yang, W.; Chen, Z.; Huang, L. Continuous Copy-Paste for One-Stage Multi-Object Tracking and Segmentation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 15323–15332. [Google Scholar]
- Yan, L.; Wang, Q.; Ma, S.; Wang, J.; Yu, C. Solve the Puzzle of Instance Segmentation in Videos: A Weakly Supervised Framework with Spatio-Temporal Collaboration. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 393–406. [Google Scholar] [CrossRef]
- Mendonça, D.; Barros, T.; Premebida, C.; Nunes, U.J. Seg2Track-SAM2: SAM2-Based Multi-Object Tracking and Segmentation. arXiv 2026, arXiv:2509.11772. [Google Scholar]








| Method | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|
| TrackRCNN [5] | 40.6 | 42.4 | 55.2 | 576 | 868 |
| MPNTrackSeg [16] | 58.7 | 68.8 | 73.7 | 202 | 858 |
| CCPNet [39] | 59.3 | 58.1 | 75.5 | 484 | 645 |
| PointTrack [17] | 62.3 | 42.9 | 76.8 | 541 | 868 |
| OPITrack [18] | 63.5 | 45.4 | 75.5 | 342 | 769 |
| TrackRefine (YOLO11x) | 67.3 | 65.8 | 83.3 | 324 | 508 |
| TrackRefine (YOLO26x) | 69.4 | 63.9 | 82.7 | 325 | 478 |
| Method | sMOTSA () | MOTSP () | MOTSA () | IDSW () | HOTA () | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Car | Ped | Car | Ped | Car | Ped | Car | Ped | Car | Ped | |
| TrackRCNN [5] | 67.0 | 47.3 | 85.1 | 74.6 | 79.7 | 66.1 | 692 | 482 | 56.6 | 41.9 |
| STC-Seg [40] | 66.2 | 42.6 | 82.8 | 75.6 | 81.1 | 57.7 | 676 | 408 | 62.8 | 43.9 |
| Seg2Track-SAM2 [41] | 68.8 | 49.7 | 86.2 | 77.4 | 81.0 | 68.1 | 95 | 79 | 74.1 | 60.0 |
| OPITrack [18] | 78.0 | 61.1 | 87.2 | 81.3 | 90.4 | 75.8 | 542 | 234 | 73.0 | 60.4 |
| PointTrack [17] | 78.5 | 61.5 | 87.1 | 81.0 | 90.9 | 76.5 | 346 | 176 | 62.0 | 54.4 |
| TrackRefine (YOLO11x) | 74.1 | 59.8 | 87.0 | 78.8 | 85.5 | 76.7 | 366 | 374 | 66.7 | 51.4 |
| TrackRefine (YOLO26x) | 73.7 | 62.4 | 88.0 | 80.7 | 85.4 | 78.0 | 494 | 363 | 66.9 | 53.5 |
| Sequence | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|
| MOTS20-01 | 67.8 | 62.4 | 85.5 | 23 | 33 |
| MOTS20-06 | 73.1 | 64.0 | 85.7 | 164 | 203 |
| MOTS20-07 | 66.6 | 57.5 | 80.1 | 96 | 176 |
| MOTS20-12 | 69.8 | 76.2 | 81.7 | 44 | 69 |
| Mean ± Std. | 69.3 ± 2.8 | 65.0 ± 7.9 | 83.3 ± 2.8 | 81.8 ± 62.8 | 120.3 ± 82.0 |
| CV (%) | 4.1 | 12.2 | 3.4 | - | - |
| Official Combined | 69.4 | 63.9 | 82.7 | 325 | 478 |
| Segmenter | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|
| RF-DETR 2XL | 58.2 | 62.0 | 74.2 | 543 | 571 |
| YOLOv8x-seg | 65.8 | 65.5 | 81.6 | 312 | 490 |
| YOLOv9c-seg | 64.5 | 64.6 | 80.1 | 337 | 574 |
| YOLOv9e-seg | 66.4 | 66.6 | 81.9 | 302 | 466 |
| YOLO11x-seg | 67.3 | 65.8 | 83.3 | 324 | 508 |
| YOLO26x-seg | 69.4 | 63.9 | 82.7 | 325 | 478 |
| Mask Refinement | Memory Bank | Association | sMOTSA (↑) | IDF1 (↑) | MOTSA (↑) | IDSW (↓) | Frag (↓) | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Appearance | Semantic | Shape | Two-Stage | Three-Stage | ||||||
| ✗ | ✓ | ✗ | ✗ | ✓ | ✗ | 62.2 | 56.5 | 75.0 | 626 | 763 |
| ✗ | ✓ | ✗ | ✗ | ✗ | ✓ | 65.8 | 57.4 | 79.7 | 512 | 513 |
| ✓ | ✓ | ✗ | ✗ | ✗ | ✓ | 68.0 | 62.4 | 80.9 | 359 | 509 |
| ✓ | ✓ | ✓ | ✗ | ✗ | ✓ | 69.1 | 63.4 | 82.4 | 362 | 511 |
| ✓ | ✓ | ✓ | ✓ | ✗ | ✓ | 69.4 | 63.9 | 82.7 | 325 | 478 |
| ✗ | ✓ | ✓ | ✓ | ✗ | ✓ | 68.4 | 63.6 | 82.6 | 322 | 491 |
| ✓ | ✓ | ✓ | ✓ | ✓ | ✗ | 65.1 | 63.9 | 77.7 | 401 | 472 |
| Memory Bank Update Threshold Setting | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|
| (0.3, 0.4) | 69.4 | 63.9 | 82.7 | 325 | 478 |
| (0.4, 0.5) | 68.5 | 64.6 | 81.5 | 312 | 460 |
| (0.5, 0.6) | 66.7 | 64.2 | 79.1 | 291 | 428 |
| Dynamic IoU Weights () | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|
| (0.25, 0.20, 0.15) | 67.5 | 63.1 | 80.3 | 305 | 438 |
| (0.30, 0.20, 0.10) | 68.5 | 63.9 | 81.5 | 315 | 464 |
| (0.45, 0.30, 0.15) | 69.4 | 63.9 | 82.7 | 325 | 478 |
| Parameter | Value | sMOTSA () | IDF1 () | MOTSA () | IDSW () | Frag () |
|---|---|---|---|---|---|---|
| 0.1 | 69.3 | 63.5 | 82.7 | 327 | 478 | |
| 0.5 | 69.3 | 63.9 | 82.6 | 330 | 485 | |
| 0.9 (default) | 69.4 | 63.9 | 82.7 | 325 | 478 | |
| 0.5 | 69.3 | 64.5 | 82.6 | 322 | 487 | |
| 0.7 (default) | 69.4 | 63.9 | 82.7 | 325 | 478 | |
| 0.9 | 69.0 | 63.9 | 82.4 | 362 | 477 | |
| 128 | 68.7 | 63.1 | 82.5 | 331 | 483 | |
| 256 (default) | 69.4 | 63.9 | 82.7 | 325 | 478 | |
| 384 | 69.4 | 64.0 | 82.7 | 327 | 485 |
| Module | Min Latency (ms) | Max Latency (ms) | Mean Latency (ms) |
|---|---|---|---|
| Fast GrabCut Refinement | 67.4 | 172.5 | 105.9 |
| Memory Bank | 5.01 | 33.5 | 9.6 |
| Three-Stage Association | 113.6 | 583.6 | 285.4 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Qie, L.; Chai, C.; Wang, R.; Bi, C.; Ma, R.; Zhang, A.; Tang, J. TrackRefine: A Plug-and-Play Decoupled Enhancement Framework for Online Multi-Object Tracking and Segmentation. Sensors 2026, 26, 3696. https://doi.org/10.3390/s26123696
Qie L, Chai C, Wang R, Bi C, Ma R, Zhang A, Tang J. TrackRefine: A Plug-and-Play Decoupled Enhancement Framework for Online Multi-Object Tracking and Segmentation. Sensors. 2026; 26(12):3696. https://doi.org/10.3390/s26123696
Chicago/Turabian StyleQie, Longfei, Chunlei Chai, Ruixue Wang, Chao Bi, Ruiqi Ma, Aijun Zhang, and Jiakui Tang. 2026. "TrackRefine: A Plug-and-Play Decoupled Enhancement Framework for Online Multi-Object Tracking and Segmentation" Sensors 26, no. 12: 3696. https://doi.org/10.3390/s26123696
APA StyleQie, L., Chai, C., Wang, R., Bi, C., Ma, R., Zhang, A., & Tang, J. (2026). TrackRefine: A Plug-and-Play Decoupled Enhancement Framework for Online Multi-Object Tracking and Segmentation. Sensors, 26(12), 3696. https://doi.org/10.3390/s26123696

