DARE-YOLO: A Lightweight Object Detection Algorithm and Its FPGA Acceleration for Sustainable PV Panel Inspection
Abstract
1. Introduction
- The DARE-YOLO model employs RepConv to construct a hardware-friendly single-path backbone network, and integrates a Dilated Context Block (DCB) and a Dual-scale Decoupled Head (DDH) to achieve high-precision photovoltaic panel defect detection while reducing the number of model parameters.
- Under the on-chip resource constraints of Zynq, we design a custom convolution IP core, which pushes the hardware throughput of single-path forward inference toward the physical limit.
- We adopt a mixed fixed-point quantization strategy on the Zynq-7020 platform to improve data transfer efficiency and reduce on-chip resource usage, thereby providing practical support for PV defect detection deployment.
2. DARE-YOLO Algorithm Architecture Design
2.1. Mathematical Formalization of Hardware and Software Co-Design
2.2. Overall Architecture of DARE-YOLO
2.3. RepConv-Based Backbone
2.4. Dilated Context Block
2.5. Dual-Scale Decoupled Head
3. Zynq-Based Hardware Accelerator and System Design
3.1. Overall Accelerator Architecture and Dataflow Scheduling
3.2. Hardware–Software Co-Designed Quantization Strategy
3.2.1. Batch Normalization Fusion
3.2.2. Mixed Fixed-Point Strategy for the PL
3.3. Design of the Custom Convolution IP Core
4. Experiments and Results Analysis
4.1. Photovoltaic Dataset and Experimental Deployment Environment
4.1.1. Photovoltaic Panel Defect Dataset and Software Platform for Model Training
4.1.2. Edge Hardware Acceleration Deployment Platform
4.2. Ablation Study
4.3. Comprehensive Comparison with Mainstream YOLO Series
4.4. FPGA Resource Utilization and Power Analysis
4.5. Comparison Across Different Hardware Platforms
4.6. Performance Comparison with Existing Advanced Hardware Acceleration Schemes
4.7. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| DARE | Dilated Aggregation and Reparameterized Edge |
| DCB | Dilated Context Block |
| DDH | Dual-scale Decoupled Head |
| CNNs | convolutional neural networks |
| cls | classification |
| reg | regression |
| BN | Batch Normalization |
| ICNN | Improved Convolutional Neural Network Algorithm |
| PV | photovoltaic |
| PL | programmable logic |
| PS | processing system |
| RAM | Random Access Memory |
| BRAM | block random access memory |
| DSPs | digital signal processors |
| FFs | flip-flops |
| LUTs | lookup tables |
References
- Wang, J.; Bi, L.; Sun, P.; Jiao, X.; Ma, X.; Lei, X.; Luo, Y. Deep-learning-based automatic detection of photovoltaic cell defects in electroluminescence images. Sensors 2022, 23, 297. [Google Scholar] [CrossRef] [PubMed]
- Karakan, A. Detection of Defective Solar Panel Cells in Electroluminescence Images with Deep Learning. Sustainability 2025, 17, 1141. [Google Scholar] [CrossRef]
- Lu, H.; Huang, X.; Shi, H.; He, H. Defect detection of photovoltaic panels based on deep learning. In Proceedings of the 2023 3rd International Conference on Neural Networks, Information and Communication Engineering (NNICE); IEEE: New York, NY, USA, 2023; pp. 99–103. [Google Scholar]
- Islam, M.; Rashel, M.R.; Ahmed, M.T.; Islam, A.K.M.K.; Tlemçani, M. Artificial intelligence in photovoltaic fault identification and diagnosis: A systematic review. Energies 2023, 16, 7417. [Google Scholar] [CrossRef]
- Su, B.; Chen, H.; Chen, P.; Bian, G.-B.; Liu, K.; Liu, W. Deep learning-based solar-cell manufacturing defect detection with complementary attention network. IEEE Trans. Ind. Inform. 2020, 17, 4084–4095. [Google Scholar] [CrossRef]
- Kuznetsov, P.; Kotelnikov, D.; Yuferev, L.; Panchenko, V.; Bolshev, V.; Jasiński, M.; Flah, A. Method for the automated inspection of the surfaces of photovoltaic modules. Sustainability 2022, 14, 11930. [Google Scholar] [CrossRef]
- Fan, Z.; Yan, Z.; Wen, S. Deep learning and artificial intelligence in sustainability: A review of SDGs, renewable energy, and environmental health. Sustainability 2023, 15, 13493. [Google Scholar] [CrossRef]
- Alhazmi, A.; Kholoud, M.; Eke, C.I. A Systematic Review of Advances in Deep Learning Architectures for Efficient and Sustainable Photovoltaic Solar Tracking: Research Challenges and Future Directions. Sustainability 2025, 17, 9625. [Google Scholar] [CrossRef]
- Czimmermann, T.; Ciuti, G.; Milazzo, M.; Chiurazzi, M.; Roccella, S.; Oddo, C.M.; Dario, P. Visual-based defect detection and classification approaches for industrial applications—A survey. Sensors 2020, 20, 1459. [Google Scholar] [CrossRef]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Parhi, K.K. VLSI Digital Signal Processing Systems: Design and Implementation; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
- Guo, K.; Zeng, S.; Yu, J.; Wang, Y.; Yang, H. A survey of FPGA-based neural network inference accelerators. ACM Trans. Reconfigurable Technol. Syst. 2019, 12, 2. [Google Scholar] [CrossRef]
- Tang, J.; Liu, S.; Zhao, D.; Tang, L.; Zou, W.; Zheng, B. PCB-YOLO: An improved detection algorithm of PCB surface defects based on YOLOv5. Sustainability 2023, 15, 5963. [Google Scholar] [CrossRef]
- Jiang, P.; Ergu, D.; Liu, F.; Cai, Y.; Ma, B. A review of YOLO algorithm developments. Procedia Comput. Sci. 2022, 199, 1066–1073. [Google Scholar] [CrossRef]
- Wang, C.Y.; Yeh, I.H.; Mark Liao, H.Y. YOLOv9: Learning what you want to learn using programmable gradient information. In Proceedings of the European Conference on Computer Vision; Springer: Cham, Switzerland, 2024; pp. 1–21. [Google Scholar]
- Liang, Q.; Tian, W.; Han, S.; Sun, J.; Jia, M.; Chi, J. ACN-YOLO: Solar cell surface defect detection based on improved YOLOv8s. AIP Adv. 2025, 15, 105132. [Google Scholar] [CrossRef]
- Li, W.; Li, J.; Cao, B.; Zhu, J.; Tian, M. LW-YOLO: A Lightweight Network for Defect Detection in Photovoltaic Modules. In Proceedings of the 2024 4th International Conference on Robotics, Automation and Intelligent Control (ICRAIC); IEEE: New York, NY, USA, 2024; pp. 538–543. [Google Scholar]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2023; pp. 7464–7475. [Google Scholar]
- Tong, K.; Wu, Y.; Zhou, F. Recent advances in small object detection based on deep learning: A review. Image Vis. Comput. 2020, 97, 103910. [Google Scholar] [CrossRef]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision; IEEE: New York, NY, USA, 2017; pp. 2980–2988. [Google Scholar]
- Zheng, Z.; Wang, P.; Ren, D.; Liu, W.; Ye, R.; Hu, Q.; Zuo, W. Enhancing geometric factors in model learning and inference for object detection and instance segmentation. IEEE Trans. Cybern. 2021, 52, 8574–8586. [Google Scholar] [CrossRef] [PubMed]
- Qiao, S.; Chen, L.C.; Yuille, A. Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2021; pp. 10213–10224. [Google Scholar]
- Ding, X.; Zhang, X.; Ma, N.; Han, J.; Ding, G.; Sun, J. Repvgg: Making vgg-style convnets great again. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2021; pp. 13733–13742. [Google Scholar]
- Qiu, S.; Yang, C.H.; Wu, L.; Song, W.; Pan, J.-Z. Improved Reparameterization You-Only-Look-Once v5 Model for Strip-Steel Surface Defect Detection. Sens. Mater. 2024, 36, 4881–4902. [Google Scholar] [CrossRef]
- Hou, Q.; Zhou, D.; Feng, J. Coordinate attention for efficient mobile network design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2021; pp. 13713–13722. [Google Scholar]
- Ding, X.; Zhang, X.; Han, J.; Ding, G. Scaling up your kernels to 31 × 31: Revisiting large kernel design in CNNs. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2022; pp. 11963–11975. [Google Scholar]
- Saidani, T.; Ghodhbani, R.; Alhomoud, A.; Alshammari, A.; Zayani, H.; Ben Ammar, M. Hardware acceleration for object detection using yolov5 deep learning algorithm on xilinx zynq fpga platform. Eng. Technol. Appl. Sci. Res. 2024, 14, 13066–13071. [Google Scholar] [CrossRef]
- Wang, Z.; Xu, K.; Wu, S.; Liu, L.; Liu, L.; Wang, D. Sparse-YOLO: Hardware/software co-design of an FPGA accelerator for YOLOv2. IEEE Access 2020, 8, 116569–116585. [Google Scholar] [CrossRef]
- Mani, P.; Komarasamy, P.R.G.; Rajamanickam, N.; Shorfuzzaman, M.; Abdelfattah, W.M. Enhancing sustainable transportation infrastructure management: A high-accuracy, FPGA-based system for emergency vehicle classification. Sustainability 2024, 16, 6917. [Google Scholar] [CrossRef]
- Ponzina, F.; Machetti, S.; Rios, M.; Denkinger, B.W.; Levisse, A.; Ansaloni, G.; Peón-Quirós, M.; Atienza, D. A hardware/software co-design vision for deep learning at the edge. IEEE Micro 2022, 42, 48–54. [Google Scholar] [CrossRef]
- Kim, H.; Kim, T.K. Design and Implementation of a YOLOv2 Accelerator on a Zynq-7000 FPGA. Sensors 2025, 25, 6359. [Google Scholar] [CrossRef] [PubMed]
- Xu, G.; Zhao, W.; Ren, Z.; Denkinger, B.W.; Levisse, A.; Ansaloni, G.; Peon-Quiros, M.; Atienza, D. Design and Implementation of the High-Performance YOLO Accelerator Based on Zynq FPGA. In Proceedings of the 2024 3rd International Conference on Electronics and Information Technology (EIT); IEEE: New York, NY, USA, 2024; pp. 246–250. [Google Scholar]
- Lv, Q.; Yu, X.; Liu, Y.; Hu, T.; Wu, H.; Wang, A. Design and implementation of object detection system based on Zynq hardware acceleration. In Proceedings of the International Conference on Measurement, Communication, and Virtual Reality (MCVR 2024); SPIE: San Francisco, CA, USA, 2025; Volume 13634, pp. 229–239. [Google Scholar]
- Wang, J.; Gu, S. Fpga implementation of object detection accelerator based on vitis-ai. In Proceedings of the 2021 11th International Conference on Information Science and Technology (ICIST); IEEE: New York, NY, USA, 2021; pp. 571–577. [Google Scholar]
- Li, S.; Yu, C.; Xie, T.; Feng, W. A power-efficient optimizing framework FPGA accelerator for YOLO. In Proceedings of the 2022 15th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI); IEEE: New York, NY, USA, 2022; pp. 1–6. [Google Scholar]
- Tan, A.; Yang, Y.; Duan, S.; Wang, L. Fpga-based convolution accelerator and memristor ip core for cooperative acceleration of the yolo network. In Proceedings of the 2024 4th International Conference on Computer Science, Electronic Information Engineering and Intelligent Control Technology (CEI); IEEE: New York, NY, USA, 2024; pp. 54–57. [Google Scholar]
- Ye, J.; Liu, Y.; Chen, H.; Wang, C.; Zhou, Y.; Yang, L.; Zhang, W. Edge computing accelerator for real-time defect detection of photovoltaic panel on lightweight FPGAs. IEEE Trans. Instrum. Meas. 2025, 74, 3001815. [Google Scholar] [CrossRef]
- Zhao, J.; Xiao, H.; Zhi, M.; Su, W. An Improved Convolutional Neural Network Algorithm (ICNN) with Channel Attention Mechanism and Hardware Acceleration Concept for Solar Panel Dust Detection. IEEE Access 2025, 13, 189814–189832. [Google Scholar] [CrossRef]
- Calì, R.; Falaschetti, L.; Biagetti, G. Optimized implementation of YOLOv3-tiny for real-time image and video recognition on FPGA. Electronics 2025, 14, 3993. [Google Scholar] [CrossRef]
- Fang, N.; Li, L.; Zhou, X.; Zhang, W.; Chen, F. An FPGA-based hybrid overlapping acceleration architecture for small-target remote sensing detection. Remote Sens. 2025, 17, 494. [Google Scholar] [CrossRef]












| Network Stage (Output) | DARE-YOLO | YOLOv6N/YOLOv6S |
|---|---|---|
| Stem | 1 RepConv | 1 RepConv |
| Layer1 (P2) | 2 RepConv | 3 RepConv |
| Layer2 (P3) | 4 RepConv | 5 RepConv |
| Layer3 (P4) | 4 RepConv | 7 RepConv |
| Layer4 (P5) | 0 (Truncated) | 3 RepConv |
| Total Blocks | 11 | 19 |
| DARE-YOLO | Weight Bit-Width | Feature Map Bit-Width | Accumulator Bit-Width | mAP@0.5 (%) | Accuracy Drop (%) |
|---|---|---|---|---|---|
| Full-Precision Baseline | FP32 | FP32 | FP32 | 95.2 | Baseline |
| Conventional Quantization | INT8 | INT8 | INT8 | 90.7 | 4.5 |
| Proposed Mixed Fixed-Point Strategy | INT8 | INT8 | INT32 | 93.84 | 1.36 |
| Component | Specification/Version |
|---|---|
| Operating System (OS) | Windows 11 64-bit |
| Central Processing Unit (CPU) | Intel Core i9-13980 |
| Graphics Processing Unit (GPU) | NVIDIA GeForce RTX 4070 Laptop GPU |
| Random Access Memory (RAM) | 16 GB DDR5 |
| Deep Learning Framework | PyTorch 2.6.0 |
| Parallel Computing Platform | CUDA 12.6 |
| Programming Language | Python 3.13.5 |
| Model | mAP@0.5 (%) | Params (M) | GFLOPs (G) |
|---|---|---|---|
| Baseline | 84.12 | 5.5 | 21.6 |
| Rep-YOLO | 87.8 | 5.5 | 21.6 |
| DCB-Rep-YOLO | 90.28 | 6.1 | 23.9 |
| DARE-YOLO | 93.84 | 6.4 | 24.8 |
| Model | Input Size | mAP@0.5 (%) | Params (M) | GFLOPs (G) |
|---|---|---|---|---|
| YOLOv5s | 640 × 640 | 86.4 | 7.0 | 16.5 |
| YOLOv8s | 640 × 640 | 89.3 | 11.1 | 28.6 |
| YOLOv10s | 640 × 640 | 91.6 | 7.2 | 21.6 |
| YOLO11s | 640 × 640 | 92.7 | 9.3 | 21.5 |
| DARE-YOLO | 640 × 640 | 93.84 | 6.4 | 24.8 |
| Module | LUT | FF | BRAM | DSP |
|---|---|---|---|---|
| Available on board | 53,200 | 106,400 | 140 | 220 |
| Conv Accelerators | 28,450 (53.5%) | 34,200 (32.1%) | 114 (81.4%) | 192 (87.3%) |
| Axi Interconnect | 3120 (5.9%) | 4250 (4.0%) | 4 (2.9%) | 0 |
| Miscellaneous | 237 (0.4%) | 405 (0.3%) | 0 | 0 |
| Total utilization | 31,807 (59.8%) | 38,855 (36.5%) | 118 (84.3%) | 192 (87.3%) |
| Platform | Power (W) | Throughputs (GOPs) | Energy Efficiency (GOPS/W) | Latency (ms) |
|---|---|---|---|---|
| Intel i9-13980 | 103.4 | 617.0 | 5.97 | 40.2 |
| RTX 4070 | 165.9 | 2137.4 | 12.88 | 11.6 |
| Zynq-7020 | 1.95 | 37.5 | 19.23 | 661.3 |
| [38] | [39] | [40] | [41] | This Work | |
|---|---|---|---|---|---|
| Year | 2025 | 2025 | 2025 | 2025 | 2026 |
| Platform | Zynq-7020 | ZU9EG | Zynq-7020 | Zynq-7020 | Zynq-7020 |
| Board cost | $150 | $1500 | $150 | $150 | $150 |
| Frequency (MHz) | 200 | - | 200 | 100 | 200 |
| Network | YOLOv4-Tiny | ICNN | YOLOv3-Tiny | YOLOv5s | DARE-YOLO |
| BRAM | 94 | 375 | 138 | 107 | 118 |
| DSP | 220 | 148 | 204 | 205 | 192 |
| LUT | 44.4 k | 25.1 k | 41.6 k | 46 k | 31.8 k |
| Throughput (GOPS) | 24.25 | - | 64.1 | 10.2 | 37.5 |
| Energy efficiency (GOPS/W) | 5.29 | - | 25.1 | 3.64 | 19.23 |
| Latency (ms) | 272 | 87 | 6.07 | 477 | 661.3 |
| Power (W) | 4.58 | 1.2 | 2.55 | 2.8 | 1.95 |
| mAP@0.5 (%) | 62.73 | - | 17.68 | 93.2 | 93.84 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yang, Y.; Xing, F.; Qin, C.; Chen, S.; Shin, H.; Lee, S. DARE-YOLO: A Lightweight Object Detection Algorithm and Its FPGA Acceleration for Sustainable PV Panel Inspection. Sustainability 2026, 18, 4999. https://doi.org/10.3390/su18104999
Yang Y, Xing F, Qin C, Chen S, Shin H, Lee S. DARE-YOLO: A Lightweight Object Detection Algorithm and Its FPGA Acceleration for Sustainable PV Panel Inspection. Sustainability. 2026; 18(10):4999. https://doi.org/10.3390/su18104999
Chicago/Turabian StyleYang, Yuchuan, Feng Xing, Caiyan Qin, Shuxu Chen, Hyundong Shin, and Sungyoung Lee. 2026. "DARE-YOLO: A Lightweight Object Detection Algorithm and Its FPGA Acceleration for Sustainable PV Panel Inspection" Sustainability 18, no. 10: 4999. https://doi.org/10.3390/su18104999
APA StyleYang, Y., Xing, F., Qin, C., Chen, S., Shin, H., & Lee, S. (2026). DARE-YOLO: A Lightweight Object Detection Algorithm and Its FPGA Acceleration for Sustainable PV Panel Inspection. Sustainability, 18(10), 4999. https://doi.org/10.3390/su18104999

