MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network
Abstract
:1. Introduction
2. Related Works
2.1. Detection-Based Methods
2.2. Regression-Based Methods
2.3. Density Map Estimation Methods
3. Methods
3.1. Multi-Resolution Network
3.2. Multi-Scale Feature Selection Module
3.3. Dynamic Sparse Attention Mechanism
4. Experimentation
4.1. Datasets
4.2. Metrics
4.3. Environment of the Experiment
4.4. Analysis of the Results of Dense Crowd Counting
4.5. More Dense Counting Application Results
4.6. Ablation Experiment
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Handte, M.; Iqbal, M.U.; Wagner, S.; Apolinarski, W.; Marrón, P.J.; Navarro, E.M.M.; Martinez, S.; Barthelemy, S.I.; Fernández, M.G. Crowd Density Estimation for Public Transport Vehicles. In Proceedings of the EDBT/ICDT Workshops, Athens, Greece, 24–28 March 2014; pp. 315–322. [Google Scholar]
- Hussain, N.; Yatim, H.S.M.; Hussain, N.L.; Yan, J.L.S.; Haron, F.J.S.S. CDES: A pixel-based crowd density estimation system for Masjid al-Haram. Saf. Sci. 2011, 49, 824–833. [Google Scholar] [CrossRef]
- Wang, Z.; Liu, H.; Qian, Y.; Xu, T. Crowd density estimation based on local binary pattern co-occurrence matrix. In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, Melbourne, VIC, Australia, 9–13 July 2012; pp. 372–377. [Google Scholar]
- Yuan, Y.; Qiu, C.; Xi, W.; Zhao, J. Crowd density estimation using wireless sensor networks. In Proceedings of the 2011 Seventh International Conference on Mobile Ad-Hoc and Sensor Networks, Beijing, China, 16–18 December 2011; pp. 138–145. [Google Scholar]
- Marsden, M.; McGuinness, K.; Little, S.; Keogh, C.E.; O’Connor, N.E. People, penguins and petri dishes: Adapting object counting models to new visual domains and object types without forgetting. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 8070–8079. [Google Scholar]
- Mundhenk, T.N.; Konjevod, G.; Sakla, W.A.; Boakye, K. A large contextual dataset for classification, detection and counting of cars with deep learning. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Part III 14. pp. 785–800. [Google Scholar]
- Arteta, C.; Lempitsky, V.; Zisserman, A. Counting in the wild. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Part VII 14. pp. 483–498. [Google Scholar]
- Barbedo, J.G.A.; Koenigkan, L.V.; Santos, P.M.; Ribeiro, A.R.B.J.S. Counting cattle in UAV images—Dealing with clustered animals and animal/background contrast changes. Sensors 2020, 20, 2126. [Google Scholar] [CrossRef] [PubMed]
- Laradji, I.H.; Rostamzadeh, N.; Pinheiro, P.O.; Vazquez, D.; Schmidt, M. Where are the blobs: Counting by localization with point supervision. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 547–562. [Google Scholar]
- Lu, H.; Cao, Z.; Xiao, Y.; Zhuang, B.; Shen, C.J.P.m. TasselNet: Counting maize tassels in the wild via local counts regression network. Plant Methods 2017, 13, 79. [Google Scholar] [CrossRef] [PubMed]
- Ma, Y.; Sanchez, V.; Guha, T. Fusioncount: Efficient crowd counting via multiscale feature fusion. In Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16–19 October 2022; pp. 3256–3260. [Google Scholar]
- Meng, C.; Kang, C.; Lyu, L. Hierarchical feature aggregation network with semantic attention for counting large-scale crowd. Int. J. Intell. Syst. 2022, 37, 9957–9981. [Google Scholar] [CrossRef]
- Wang, M.; Cai, H.; Han, X.-F.; Zhou, J.; Gong, M. STNet: Scale tree network with multi-level auxiliator for crowd counting. IEEE Trans. Multimed. 2022, 25, 2074–2084. [Google Scholar] [CrossRef]
- Cheng, Z.-Q.; Li, J.-X.; Dai, Q.; Wu, X.; Hauptmann, A.G. Learning spatial awareness to improve crowd counting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6152–6161. [Google Scholar]
- Du, Z.; Shi, M.; Deng, J.; Zafeiriou, S. Redesigning multi-scale neural network for crowd counting. IEEE Trans. Image Process. 2023, 32, 3664–3678. [Google Scholar] [CrossRef]
- Ma, Z.; Wei, X.; Hong, X.; Gong, Y. Learning scales from points: A scale-aware probabilistic model for crowd counting. In Proceedings of the 28th ACM International Conference on Multimedia, Seattle, WA, USA, 12–16 October 2020; pp. 220–228. [Google Scholar]
- Newell, A.; Yang, K.; Deng, J. Stacked hourglass networks for human pose estimation. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Part VIII 14. pp. 483–499. [Google Scholar]
- Xiao, B.; Wu, H.; Wei, Y. Simple baselines for human pose estimation and tracking. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 466–481. [Google Scholar]
- Liu, X.; Li, G.; Han, Z.; Zhang, W.; Yang, Y.; Huang, Q.; Sebe, N. Exploiting sample correlation for crowd counting with multi-expert network. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 3215–3224. [Google Scholar]
- Song, Q.; Wang, C.; Wang, Y.; Tai, Y.; Wang, C.; Li, J.; Wu, J.; Ma, J. To choose or to fuse? Scale selection for crowd counting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; pp. 2576–2583. [Google Scholar]
- Subburaman, V.B.; Descamps, A.; Carincotte, C. Counting people in the crowd using a generic head detector. In Proceedings of the 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance, Beijing, China, 18–21 September 2012; pp. 470–475. [Google Scholar]
- Viola; Jones; Snow. Detecting pedestrians using patterns of motion and appearance. In Proceedings of the Ninth IEEE International Conference on Computer Vision, Nice, France, 13–16 October 2003; Volume 732, pp. 734–741. [Google Scholar]
- Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–25 June 2005; pp. 886–893. [Google Scholar]
- Enzweiler, M.; Gavrila, D.M. Monocular pedestrian detection: Survey and experiments. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 31, 2179–2195. [Google Scholar] [CrossRef]
- Leibe, B.; Seemann, E.; Schiele, B. Pedestrian detection in crowded scenes. In Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA, 20–25 June 2005; pp. 878–885. [Google Scholar]
- Tuzel, O.; Porikli, F.; Meer, P. Pedestrian detection via classification on riemannian manifolds. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 1713–1727. [Google Scholar] [CrossRef]
- Felzenszwalb, P.F.; Girshick, R.B.; McAllester, D.; Ramanan, D. Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 32, 1627–1645. [Google Scholar] [CrossRef]
- Lin, S.-F.; Chen, J.-Y.; Chao, H.-X. Estimation of number of people in crowded scenes using perspective transformation. IEEE Trans. Syst. Man Cybern. Part A Syst. Humans 2001, 31, 645–654. [Google Scholar]
- Wu, B.; Nevatia, R. Detection and tracking of multiple, partially occluded humans by bayesian combination of edgelet based part detectors. Int. J. Comput. Vis. 2007, 75, 247–266. [Google Scholar] [CrossRef]
- Li, M.; Zhang, Z.; Huang, K.; Tan, T. Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection. In Proceedings of the 2008 19th International Conference on Pattern Recognition, Tampa, FL, USA, 8–11 December 2008; pp. 1–4. [Google Scholar]
- Zhao, T.; Nevatia, R.; Wu, B. Segmentation and tracking of multiple humans in crowded environments. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 1198–1211. [Google Scholar] [CrossRef]
- Ge, W.; Collins, R.T. Marked point processes for crowd counting. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 2913–2920. [Google Scholar]
- Zhao, T.; Nevatia, R. Bayesian human segmentation in crowded situations. In Proceedings of the 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Madison, WI, USA, 18–20 June 2003; pp. II–459. [Google Scholar]
- Cheng, Z.; Qin, L.; Huang, Q.; Yan, S.; Tian, Q.J.N. Recognizing human group action by layered model with multiple cues. Neurocomputing 2014, 136, 124–135. [Google Scholar] [CrossRef]
- Paragios, N.; Ramesh, V. A MRF-based approach for real-time subway monitoring. In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai, HI, USA, 8–14 December 2001; pp. I-1034–I-1040. [Google Scholar]
- Chan, A.B.; Liang, Z.-S.J.; Vasconcelos, N. Privacy preserving crowd monitoring: Counting people without people models or tracking. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; pp. 1–7. [Google Scholar]
- Bilmes, J.A.; Bartels, C. Graphical model architectures for speech recognition. IEEE Signal Process. Mag. 2005, 22, 89–100. [Google Scholar] [CrossRef]
- Lempitsky, V.; Zisserman, A. Learning to count objects in images. Proc. Adv. Neural Inf. Process. Syst. 2010, 23, 1324–1332. [Google Scholar]
- Rodriguez, M.; Laptev, I.; Sivic, J.; Audibert, J.-Y. Density-aware person detection and tracking in crowds. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; pp. 2423–2430. [Google Scholar]
- Pham, V.-Q.; Kozakaya, T.; Yamaguchi, O.; Okada, R. Count forest: Co-voting uncertain number of targets using random forest for crowd density estimation. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 3253–3261. [Google Scholar]
- Wang, Y.; Zou, Y. Fast visual object counting via example-based density estimation. In Proceedings of the 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; pp. 3653–3657. [Google Scholar]
- Zhu, L.; Wang, X.; Ke, Z.; Zhang, W.; Lau, R.W. Biformer: Vision transformer with bi-level routing attention. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 10323–10333. [Google Scholar]
- Sun, K.; Xiao, B.; Liu, D.; Wang, J. Deep high-resolution representation learning for human pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 5693–5703. [Google Scholar]
- Zhang, Y.; Zhou, D.; Chen, S.; Gao, S.; Ma, Y. Single-image crowd counting via multi-column convolutional neural network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 589–597. [Google Scholar]
- Idrees, H.; Tayyab, M.; Athrey, K.; Zhang, D.; Al-Maadeed, S.; Rajpoot, N.; Shah, M. Composition loss for counting, density map estimation and localization in dense crowds. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 532–546. [Google Scholar]
- Wang, Q.; Gao, J.; Lin, W.; Li, X. NWPU-crowd: A large-scale benchmark for crowd counting and localization. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 2141–2149. [Google Scholar] [CrossRef]
- Guerrero-Gómez-Olmedo, R.; Torre-Jiménez, B.; López-Sastre, R.; Maldonado-Bascón, S.; Onoro-Rubio, D. Extremely overlapping vehicle counting. In Proceedings of the Pattern Recognition and Image Analysis: 7th Iberian Conference, IbPRIA 2015, Santiago de Compostela, Spain, 17–19 June 2015; Proceedings 7. pp. 423–431. [Google Scholar]
- Liu, W.; Salzmann, M.; Fua, P. Context-aware crowd counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 5099–5108. [Google Scholar]
- Wang, Q.; Gao, J.; Lin, W.; Yuan, Y. Learning from synthetic data for crowd counting in the wild. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 8198–8207. [Google Scholar]
- Xiong, H.; Lu, H.; Liu, C.; Liu, L.; Cao, Z.; Shen, C. From open set to closed set: Counting objects by spatial divide-and-conquer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Long Beach, CA, USA, 15–20 June 2019; pp. 8362–8371. [Google Scholar]
- Ma, Z.; Wei, X.; Hong, X.; Gong, Y. Bayesian loss for crowd count estimation with point supervision. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Long Beach, CA, USA, 15–20 June 2019; pp. 6142–6151. [Google Scholar]
- Sindagi, V.A.; Patel, V.M. Multi-level bottom-top and top-bottom feature fusion for crowd counting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Long Beach, CA, USA, 15–20 June 2019; pp. 1002–1012. [Google Scholar]
- Wan, J.; Wang, Q.; Chan, A.B. Kernel-based density map generation for dense object counting. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 44, 1357–1370. [Google Scholar] [CrossRef]
- Sam, D.B.; Peri, S.V.; Sundararaman, M.N.; Kamath, A.; Babu, R.V. Locate, size, and count: Accurately resolving people in dense crowds via detection. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 2739–2751. [Google Scholar]
- Jiang, X.; Zhang, L.; Xu, M.; Zhang, T.; Lv, P.; Zhou, B.; Yang, X.; Pang, Y. Attention scaling for crowd counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 4706–4715. [Google Scholar]
- Liu, X.; Yang, J.; Ding, W.; Wang, T.; Wang, Z.; Xiong, J. Adaptive mixture regression network with local counting map for crowd counting. In Proceedings of the Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, 23–28 August 2020; Part XXIV 16. pp. 241–257. [Google Scholar]
- Wan, J.; Chan, A. Modeling noisy annotations for crowd counting. Adv. Neural Inf. Process. Syst. 2020, 33, 3386–3396. [Google Scholar]
- Wang, B.; Liu, H.; Samaras, D.; Nguyen, M.H. Distribution matching for crowd counting. Adv. Neural Inf. Process. Syst. 2020, 33, 1595–1607. [Google Scholar]
- Zhou, J.T.; Zhang, L.; Du, J.; Peng, X.; Fang, Z.; Xiao, Z.; Zhu, H. Locality-aware crowd counting. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 3602–3613. [Google Scholar] [CrossRef] [PubMed]
- Xu, C.; Liang, D.; Xu, Y.; Bai, S.; Zhan, W.; Bai, X.; Tomizuka, M. Autoscale: Learning to scale for crowd counting. Int. J. Comput. Vis. 2022, 130, 405–434. [Google Scholar] [CrossRef]
- Wan, J.; Liu, Z.; Chan, A.B. A generalized loss function for crowd counting and localization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 1974–1983. [Google Scholar]
- Cheng, J.; Xiong, H.; Cao, Z.; Lu, H. Decoupled two-stage crowd counting and beyond. IEEE Trans. Image Process. 2021, 30, 2862–2875. [Google Scholar] [CrossRef] [PubMed]
- Song, Q.; Wang, C.; Jiang, Z.; Wang, Y.; Tai, Y.; Wang, C.; Li, J.; Huang, F.; Wu, Y. Rethinking counting and localization in crowds: A purely point-based framework. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 3365–3374. [Google Scholar]
- Ma, Z.; Hong, X.; Wei, X.; Qiu, Y.; Gong, Y. Towards a universal model for cross-dataset crowd counting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 3205–3214. [Google Scholar]
- Shu, W.; Wan, J.; Tan, K.C.; Kwong, S.; Chan, A.B. Crowd counting in the frequency domain. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 19618–19627. [Google Scholar]
- Cheng, Z.-Q.; Dai, Q.; Li, H.; Song, J.; Wu, X.; Hauptmann, A.G. Rethinking spatial invariance of convolutional networks for object counting. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 19638–19648. [Google Scholar]
- Wang, M.; Cai, H.; Dai, Y.; Gong, M. Dynamic mixture of counter network for location-agnostic crowd counting. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA, 2–7 January 2023; pp. 167–177. [Google Scholar]
- Guo, X.; Song, K.; Gao, M.; Zhai, W.; Li, Q.; Jeon, G. Crowd counting in smart city via lightweight ghost attention pyramid network. Future Gener. Comput. Syst. 2023, 147, 328–338. [Google Scholar] [CrossRef]
- Alhawsawi, A.N.; Khan, S.D.; Ur Rehman, F.J.I. Crowd Counting in Diverse Environments Using a Deep Routing Mechanism Informed by Crowd Density Levels. Information 2024, 15, 275. [Google Scholar] [CrossRef]
- Zhang, S.; Wu, G.; Costeira, J.P.; Moura, J.M. Fcn-rlstm: Deep spatio-temporal neural networks for vehicle counting in city cameras. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 3667–3676. [Google Scholar]
- Xiong, H.; Cao, Z.; Lu, H.; Madec, S.; Liu, L.; Shen, C. TasselNetv2: In-field counting of wheat spikes with context-augmented local regression networks. Plant Methods 2019, 15, 150. [Google Scholar] [CrossRef]
- Li, Y.; Zhang, X.; Chen, D. Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 1091–1100. [Google Scholar]
Methods | SHHA | SHHB | UCF-QNRF | NWPU | ||||
---|---|---|---|---|---|---|---|---|
MAE | MSE | MAE | MSE | MAE | MSE | MAE | MSE | |
CAN [48] | 62.3 | 100.0 | 7.8 | 12.2 | 107.0 | 183.0 | 106.3 | 386.5 |
SFCN [49] | 64.8 | 107.5 | 7.6 | 13.0 | 102.0 | 171.4 | 105.7 | 424.1 |
S-DCNet [50] | 58.3 | 95.0 | 6.7 | 10.7 | 104.4 | 176.1 | - | - |
BL [51] | 62.8 | 101.8 | 7.7 | 12.7 | 88.7 | 154.8 | 105.4 | 454.2 |
MBTTBF [52] | 60.2 | 94.1 | 8.0 | 15.5 | 97.5 | 165.2 | - | - |
KDMG [53] | 63.8 | 99.2 | 7.8 | 12.7 | 99.5 | 173.0 | 100.5 | 415.5 |
LSCCNN [54] | 66.5 | 101.8 | 7.7 | 12.7 | 120.5 | 218.2 | - | - |
ASNet [55] | 57.8 | 90.1 | - | - | 91.6 | 159.7 | - | - |
AMRNet [56] | 61.5 | 98.3 | 7.0 | 11.0 | 86.6 | 152.2 | - | - |
NoiseCC [57] | 61.9 | 99.6 | 7.4 | 11.3 | 85.8 | 150.6 | 96.9 | 534.2 |
DM-Count [58] | 59.7 | 95.7 | 7.4 | 11.8 | 85.6 | 148.3 | 88.4 | 388.6 |
LB-Batch [59] | 65.8 | 103.6 | 8.6 | 13.9 | 113.0 | 210.0 | - | - |
AutoScale [60] | 65.8 | 112.1 | 8.6 | 13.9 | 104.4 | 174.2 | 94.1 | 388.2 |
GL [61] | 61.3 | 95.4 | 7.3 | 11.7 | 84.3 | 147.5 | 79.3 | 346.1 |
D2CNet [62] | 57.2 | 93.0 | 6.3 | 10.7 | 81.7 | 137.9 | 85.5 | 361.5 |
P2PNet [63] | 52.7 | 85.1 | 6.3 | 9.9 | 85.3 | 154.5 | 77.4 | 362.0 |
SDA+DM [64] | 55.0 | 92.7 | - | - | 80.7 | 146.3 | - | - |
Chfl [65] | 57.5 | 94.3 | 6.9 | 11.0 | 80.3 | 137.6 | 76.8 | 343.0 |
RSI-ResNet50 [66] | 54.8 | 89.1 | 6.2 | 9.9 | 81.6 | 153.7 | - | - |
DMCNet [67] | 58.5 | 84.6 | 8.6 | 13.7 | 96.5 | 164.0 | - | - |
GAPNet [68] | 67.1 | 110.4 | 9.8 | 15.2 | 118.5 | 217.2 | 174.1 | 514.7 |
DRMICrowd [69] | 57.7 | 97.5 | - | - | 97.2 | 156.4 | - | - |
MRSNet (ours) | 54.2 | 88.5 | 6.3 | 9.7 | 78.5 | 130.4 | 69.3 | 319.7 |
Methods | TRANCOS | MTC | ||
---|---|---|---|---|
MAE | MSE | MAE | MSE | |
FCN-HA [70] | 4.2 | - | - | - |
TasselNetv2 [71] | - | - | 5.4 | 8.8 |
S-DCNet [50] | 2.9 | - | 5.6 | 9.1 |
CSRNet [72] | 3.6 | - | 9.4 | 14.4 |
RSI-ResNet [66] | 2.1 | 2.6 | 3.1 | 4.3 |
MRSNet (ours) | 1.7 | 3.0 | 2.6 | 3.7 |
Imbedding | Val Set | Test Set | |||
---|---|---|---|---|---|
Block1 | Block2 | MAE↓ | MSE↓ | MAE↓ | MSE↓ |
× | × | 8.3 | 12.8 | 11.9 | 16.6 |
√ | × | 7.8 | 12.1 | 7.8 | 12.0 |
× | √ | 7.6 | 11.3 | 9.7 | 13.4 |
√ | √ | 6.8 | 10.1 | 6.3 | 9.7 |
Methods | Val Set | Test Set | ||
---|---|---|---|---|
MAE↓ | MSE↓ | MAE↓ | MSE↓ | |
PGO | 8.5 | 12.4 | 9.8 | 14.6 |
POG | 37.4 | 58.2 | 38.2 | 56.8 |
OPG | 20.7 | 17.6 | 18.7 | 17.4 |
OGP | 7.6 | 11.3 | 9.7 | 13.4 |
GPO | 9.0 | 15.8 | 10.5 | 15.5 |
GOP | 6.8 | 10.1 | 6.3 | 9.7 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, Y.; Song, W.; Shao, M.; Liu, X. MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network. Sensors 2024, 24, 5974. https://doi.org/10.3390/s24185974
Zhang Y, Song W, Shao M, Liu X. MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network. Sensors. 2024; 24(18):5974. https://doi.org/10.3390/s24185974
Chicago/Turabian StyleZhang, Yi, Wei Song, Mingyue Shao, and Xiangchun Liu. 2024. "MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network" Sensors 24, no. 18: 5974. https://doi.org/10.3390/s24185974
APA StyleZhang, Y., Song, W., Shao, M., & Liu, X. (2024). MRSNet: Multi-Resolution Scale Feature Fusion-Based Universal Density Counting Network. Sensors, 24(18), 5974. https://doi.org/10.3390/s24185974