Point Cloud Quality Assessment via Complexity-Driven Patch Sampling and Attention-Enhanced Swin-Transformer
Abstract
1. Introduction
- Considering that the quality-related information density varies significantly across local patches in PC projection maps and that high-complexity patches contain richer distortion-sensitive cues, a complexity-driven patch sampling strategy is designed to preferentially select high-information-density patches and thereby enhance subsequent distortion-sensitive feature representation.
- Considering that the indistinct response strengths among critical and redundant channels may weaken the representation capability during feature extraction, an Attention-Enhanced Swin-Transformer is proposed to adaptively highlight informative channels and thereby enhance feature extraction performance.
- To address the inherent limitations of traditional linear regression heads in handling diverse distortion patterns, a gated regression head is constructed, which integrates global semantic features with channel statistical descriptors. Through a statistics-driven gating mechanism using channel-wise mean and standard deviation, the proposed model can adaptively balance the contributions of the two feature types, thereby improving prediction robustness and generalization.
2. Motivation
3. Proposed Method
3.1. Complexity-Driven Patch Sampling
3.2. Attention-Enhanced Swin-Transformer
3.3. Gated Regression Head
4. Experiments
4.1. PC Dataset
4.2. Experimental Settings
4.3. Results and Analysis
4.4. Ablation Study
4.5. Complexity Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| PCQA | PC Quality Assessment |
| NR-PCQA | No-Reference PC Quality Assessment |
| QMM | Quality Mapping Module |
| SJTU-PCQA | Shanghai Jiao Tong University PC Quality Assessment Database |
| WPC | Waterloo Point Cloud |
References
- Zhang, Y.; Yang, Q.; Zhou, Y.; Xu, X.; Yang, L.; Xu, Y. TCDM: Transformational Complexity Based Distortion Metric for Perceptual Point Cloud Quality Assessment. IEEE Trans. Vis. Comput. Graph. 2024, 30, 6707–6724. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z.; Wu, H.; Zhou, Y.; Li, C.; Sun, W.; Chen, C.; Min, X.; Liu, X.; Lin, W.; Zhai, G. LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM. arXiv 2024, arXiv:2404.18203v2. [Google Scholar] [CrossRef]
- Duan, H.; Fu, K.; Wu, S.; Li, Y.; Zhang, Z.; Hu, Q.; Min, X.; Zhai, G. BMPCQA: Bioinspired Metaverse Point Cloud Quality Assessment Based on Large Multimodal Models. Adv. Intell. Syst. 2025, 2500504. [Google Scholar] [CrossRef]
- Zhang, Z.; Sun, W.; Min, X.; Wang, T.; Lu, W.; Zhai, G. No-Reference Quality Assessment for 3D Colored Point Cloud and Mesh Models. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 7618–7631. [Google Scholar] [CrossRef]
- Yang, Q.; Ma, Z.; Xu, Y.; Li, Z.; Sun, J. Inferring Point Cloud Quality via Graph Similarity. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 44, 3015–3029. [Google Scholar] [CrossRef]
- Zhou, W.; Yang, Q.; Jiang, Q.; Zhai, G.; Lin, W. Blind Quality Assessment of 3D Dense Point Clouds with Structure Guided Resampling (SGR-PCQA). arXiv 2022, arXiv:2208.14603. [Google Scholar]
- Liu, Q.; Su, H.; Duanmu, Z.; Liu, W.; Wang, Z. PQA-Net: Deep No Reference PC Quality Assessment via Multi-View Projection. IEEE Trans. Circuits Syst. Video Technol. 2021, 31, 4645–4660. [Google Scholar] [CrossRef]
- Yang, Q.; Liu, Y.; Chen, S.; Xu, Y.; Sun, J. No-Reference Point Cloud Quality Assessment via Domain Adaptation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022; pp. 21147–21156. [Google Scholar] [CrossRef]
- Shan, Z.; Zhang, Y.; Yang, Q.; Yang, H.; Xu, Y.; Liu, S. PAME: Self-Supervised Masked Autoencoder for No-Reference Point Cloud Quality Assessment. arXiv 2024, arXiv:2403.10061. [Google Scholar]
- Chen, T.; Liu, Q.; Zhang, Y.; Xu, Y.; Tang, R.; Sun, J. Layered Projection-Based Quality Assessment of 3D Point Clouds. IEEE Access 2021, 9, 88108–88120. [Google Scholar] [CrossRef]
- Zhang, Z.; Zhou, Y.; Sun, W.; Min, X.; Zhai, G. Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment. arXiv 2023, arXiv:2310.17147. [Google Scholar] [CrossRef]
- Chai, X.; Shao, F.; Mu, B.; Chen, H.; Jiang, Q.; Ho, Y.-S. Plain-PCQA: No-Reference Point Cloud Quality Assessment by Analysis of Plain Visual and Geometrical Components. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 6207–6223. [Google Scholar] [CrossRef]
- Wang, Z.; Zhang, Y.; Yang, Q.; Xu, Y.; Sun, J.; Liu, S. Point Cloud Quality Assessment using 3D Saliency Maps. In Proceedings of the 2023 IEEE International Conference on Visual Communications and Image Processing (VCIP), Jeju, Republic of Korea, 4–7 December 2023; pp. 1–5. [Google Scholar] [CrossRef]
- Zhang, L.; Zhu, L.; Ma, K. GMS-3DQA: Geometry–Texture Joint Modeling for No-Reference Point Clouds Quality Assessment. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 3775–3788. [Google Scholar]
- Yang, Q.; Chen, H.; Ma, Z.; Xu, Y.; Tang, R.; Sun, J. Predicting the Perceptual Quality of Point Cloud: A 3D-to-2D Projection-Based Exploration. IEEE Trans. Multimed. 2021, 23, 3877–3891. [Google Scholar] [CrossRef]
- Liu, Q.; Su, H.; Duanmu, Z.; Liu, W.; Wang, Z. Perceptual Quality Assessment of Colored 3D Point Clouds. IEEE Trans. Vis. Comput. Graph. 2023, 29, 3642–3655. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Z.; Sun, W.; Zhu, Y.; Min, X.; Wu, W.; Chen, Y.; Zhai, G. Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric. arXiv 2023, arXiv:2208.14085v3. [Google Scholar] [CrossRef]
- Zhang, Z.; Sun, W.; Min, X.; Wang, Q.; He, J.; Zhou, Q.; Zhai, G. MM-PCQA: Multi-Modal Learning for No-Reference Point Clouds Quality Assessment. In Proceedings of the IJCAI, Macao, China, 19–25 August 2023; pp. 1759–1767. [Google Scholar]
- Zhang, Z.; Wu, W.; Min, X.; Zhu, Y.; Zhai, G.; Sun, W. Optimizing Projection-Based Point Cloud Quality Assessment with Human Preferred Viewpoints Selection. In Proceedings of the 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15–19 July 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Chen, W.; Wu, Q.; Zhou, W.; Shao, F.; Zhai, G.; Lin, W. No-Reference Point Cloud Quality Assessment via Graph Convolutional Network. IEEE Trans. Multimed. 2025, 27, 2489–2502. [Google Scholar] [CrossRef]




| Type | Description/Method | Effect |
|---|---|---|
| OT | Octree-based compression via MPEG PCC. | Geometry |
| CN | Gaussian noise on RGB channels. | Color |
| DS | Uniform down-sampling of points. | Geometry |
| DS + CN | DS followed by RGB Gaussian noise. | Geometry + Color |
| DS + CGN | DS followed by Gaussian geometry noise. | Hybrid Geometry |
| CGN | Gaussian noise on point coordinates. | Geometry |
| BN | Random brightness changes on RGB. | Color |
| CN + BN | Gaussian color + brightness noise. | Color Compound |
| OT + DS | DS after octree compression. | Geometry Compound |
| Model | SROCC | PLCC | KROCC | RMSE |
|---|---|---|---|---|
| PQA-Net [7] | 0.8500 | 0.8200 | – | – |
| IT-PCQA [8] | 0.5800 | 0.6300 | – | – |
| VQA-PC [17] | 0.8509 | 0.8635 | 0.6585 | 1.1334 |
| GMS-3DQA [14] | 0.9108 | 0.9177 | 0.7735 | 0.7872 |
| 3D-NSS [4] | 0.7382 | 0.7144 | 0.5174 | 1.7686 |
| MM-PCQA [18] | 0.8998 | 0.9202 | 0.7677 | 0.8801 |
| VPI-PCQA [19] | 0.9041 | 0.9155 | – | 0.9263 |
| GC-PCQA [20] | 0.9108 | 0.9301 | 0.7546 | 0.8691 |
| Proposed | 0.9203 | 0.9370 | 0.7753 | 0.8065 |
| Model | SROCC | PLCC | KROCC | RMSE |
|---|---|---|---|---|
| PQA-NET [7] | 0.7000 | 0.6900 | 0.5100 | 15.1800 |
| IT-PCQA [8] | 0.5500 | 0.5400 | - | - |
| 3D-NSS [4] | 0.6514 | 0.6479 | 0.4417 | 16.5745 |
| Proposed | 0.7689 | 0.7647 | 0.5817 | 14.7151 |
| Variant | SROCC | PLCC | KROCC | RMSE |
|---|---|---|---|---|
| w/o CDPS | 0.9188 | 0.9302 | 0.7702 | 0.8129 |
| w/o AEST | 0.9132 | 0.9257 | 0.7684 | 0.8352 |
| w/o GRH | 0.9160 | 0.9146 | 0.7704 | 0.8286 |
| Proposed | 0.9203 | 0.9370 | 0.7753 | 0.8065 |
| Metric | Value |
|---|---|
| Model File Size | 317.0 MB |
| GPU Memory Usage | 6.0 GB |
| Training Time | 7613.9 s |
| Testing Time | 8.5 s |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Shen, X.; Li, Q.; Tu, R.; Bai, Y.; Ge, D.; Zhu, Z. Point Cloud Quality Assessment via Complexity-Driven Patch Sampling and Attention-Enhanced Swin-Transformer. Information 2026, 17, 93. https://doi.org/10.3390/info17010093
Shen X, Li Q, Tu R, Bai Y, Ge D, Zhu Z. Point Cloud Quality Assessment via Complexity-Driven Patch Sampling and Attention-Enhanced Swin-Transformer. Information. 2026; 17(1):93. https://doi.org/10.3390/info17010093
Chicago/Turabian StyleShen, Xilei, Qiqi Li, Renwei Tu, Yongqiang Bai, Di Ge, and Zhongjie Zhu. 2026. "Point Cloud Quality Assessment via Complexity-Driven Patch Sampling and Attention-Enhanced Swin-Transformer" Information 17, no. 1: 93. https://doi.org/10.3390/info17010093
APA StyleShen, X., Li, Q., Tu, R., Bai, Y., Ge, D., & Zhu, Z. (2026). Point Cloud Quality Assessment via Complexity-Driven Patch Sampling and Attention-Enhanced Swin-Transformer. Information, 17(1), 93. https://doi.org/10.3390/info17010093

