GraphGPT-Patent: Time-Aware Graph Foundation Modeling on Semantic Similarity Document Graphs for Grant-Time Economic Impact Prediction
Abstract
1. Introduction
- We introduce a leakage-free benchmark protocol for grant-time impact prediction on large-scale patent document graphs, spanning three CPC domains and three evaluation horizons.
- We propose GraphGPT-Patent, a graph foundation adaptation method based on reversible graph-to-sequence serialization, augmented with time- and domain-conditioned edge reliability modeling for semantic graphs.
- We design a joint classification–ranking objective to support high-recall detection and stable within-domain prioritization under delayed, long-tailed supervision signals.
- We provide explainable graph evidence via subgraph attributions and structural diagnostics, enabling auditable model analysis across domains and time.
2. Related Work
2.1. Document Impact Prediction and Delayed Supervision
2.2. Semantic Similarity Graphs and Document Graph Learning
2.3. Graph Foundation Models and Graph-to-Sequence Transformers
2.4. Robust Graph Learning: Edge Noise, Denoising, Temporal/Domain Shift
2.5. Explainable Graph Learning and Subgraph-Based Explanations
3. Data, Document Graph Construction, and Evaluation Protocol
3.1. CPC Domains for Cross-Domain Evaluation
3.2. High-Impact Labels and Evaluation Windows
3.3. Semantic Similarity Network Construction
3.4. Scalability and Graph Construction Complexity
3.5. Training and Testing Split
4. Method
4.1. GraphGPT-Patent
4.2. Time- and Domain-Conditioned Edge Reliability Modeling
4.3. Joint Objective of Impact Ranking Consistency
4.4. Explanatory Subgraphs and Structural Diagnostics
5. Experiments and Results
5.1. Metrics and Baselines
5.2. Main Results: System Performance Across Nine Settings
Ablation of Edge Reliability and Joint Ranking
5.3. Temporal Drift: Systematic Impact of Training Windows on Performance
5.4. Explainability Diagnostics: Structural Differences in Explanatory Subgraphs
6. Discussion, Conclusions, and Future Directions
6.1. Discussion
6.2. Implications for Leakage-Free Impact Prediction
6.3. Conclusions
6.4. Future Directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Supplementary Quantitative Analysis
Appendix A.1. Decomposition of Prediction Window Difficulty
| Model | 3y Avg | 5y Avg | 10y Avg | 5y − 3y | 10y − 3y |
|---|---|---|---|---|---|
| Doc2Vec-MLP | 0.710 | 0.827 | 0.840 | 0.117 | 0.130 |
| PatentBERT-MLP | 0.707 | 0.813 | 0.817 | 0.107 | 0.110 |
| Doc2Vec-GCN | 0.763 | 0.853 | 0.867 | 0.090 | 0.103 |
| Doc2Vec-GTN | 0.657 | 0.850 | 0.890 | 0.193 | 0.233 |
| Doc2Vec-GSAGE | 0.703 | 0.883 | 0.900 | 0.180 | 0.197 |
| PatentBERT-GCN | 0.690 | 0.797 | 0.807 | 0.107 | 0.117 |
| PatentBERT-GTN | 0.727 | 0.863 | 0.880 | 0.137 | 0.153 |
| PatentBERT-GSAGE | 0.667 | 0.897 | 0.880 | 0.230 | 0.213 |
| GraphGPT-Patent (Ours) | 0.767 | 0.917 | 0.917 | 0.150 | 0.150 |
Appendix A.2. Effect Size of Temporal Drift Gains
| Model | ΔAcc | ΔPr | ΔRe | ΔF1 |
|---|---|---|---|---|
| Doc2Vec-GCN | 0.07 | 0.02 | 0.20 | 0.10 |
| Doc2Vec-GSAGE | 0.06 | −0.02 | 0.21 | 0.08 |
| Doc2Vec-GTN | 0.04 | −0.04 | 0.23 | 0.08 |
| PatentBERT-GCN | 0.06 | 0.00 | 0.18 | 0.08 |
| PatentBERT-GSAGE | 0.06 | −0.01 | 0.19 | 0.09 |
| PatentBERT-GTN | 0.06 | −0.01 | 0.17 | 0.08 |
| GraphGPT-Patent (Ours) | 0.04 | 0.01 | 0.10 | 0.06 |
Appendix A.3. Effect Size of Explanatory Structure Differences
| CPC | ΔDensity | ΔAvg Degree | ΔClustering Coeff |
|---|---|---|---|
| A61 | −0.093 | −0.527 | 0.037 |
| H04 | 0.035 | 5.394 | 0.129 |
| G06 | 0.000 | 5.158 | 0.147 |
Appendix A.4. Positive-Class Precision Across Nine Settings
| Model | A61 3y | A61 5y | A61 10y | H04 3y | H04 5y | H04 10y | G06 3y | G06 5y | G06 10y |
|---|---|---|---|---|---|---|---|---|---|
| GraphGPT-Patent (Ours) | 0.92 | 0.99 | 0.90 | 0.94 | 0.67 | 0.65 | 0.90 | 0.64 | 0.56 |
Appendix A.5. Quantile-Threshold Robustness
| Quantile Setting | Precision | Recall | F1 | Within-Group AUC | P@50 |
|---|---|---|---|---|---|
| Top/Bottom 5% | 0.823 | 0.892 | 0.856 | 0.915 | 0.679 |
| Top/Bottom 10% (main) | 0.798 | 0.867 | 0.817 | 0.904 | 0.652 |
| Top/Bottom 15% | 0.772 | 0.842 | 0.804 | 0.891 | 0.631 |
Appendix A.6. Future-Year/OOD Check (3y Horizon)
| Test Year | Precision | Recall | F1 | Within-Group AUC | P@50 |
|---|---|---|---|---|---|
| 2016 | 0.923 | 0.767 | 0.837 | 0.904 | 0.652 |
| 2017 | 0.914 | 0.751 | 0.824 | 0.896 | 0.641 |
| 2018 | 0.901 | 0.733 | 0.808 | 0.887 | 0.629 |
| 2019 | 0.889 | 0.712 | 0.791 | 0.876 | 0.614 |
Appendix A.7. Domain Imbalance Snapshot for the 2016 Test Slice
| Domain | Total Test Samples | Positive (Top 10%) | Negative (Bottom 10%) | Pos:Neg Ratio |
|---|---|---|---|---|
| A61 | 1788 | 179 | 179 | 1:1 |
| H04 | 1946 | 195 | 195 | 1:1 |
| G06 | 1768 | 177 | 177 | 1:1 |
| All domains | 5502 | 551 | 551 | 1:1 |
Appendix A.8. Hyperparameter Sensitivity of Graph Construction
| k-Hop | Top-K | t | Precision | Recall | F1 | Accuracy | Within-Group AUC | P@50 |
|---|---|---|---|---|---|---|---|---|
| 1 | 100 | 0.70 | 0.807 | 0.852 | 0.804 | 0.739 | 0.896 | 0.641 |
| 2 (default) | 100 | 0.70 | 0.798 | 0.867 | 0.817 | 0.744 | 0.904 | 0.652 |
| 3 | 100 | 0.70 | 0.789 | 0.868 | 0.815 | 0.741 | 0.901 | 0.648 |
| 2 | 150 | 0.68 | 0.793 | 0.869 | 0.816 | 0.742 | 0.902 | 0.649 |
| 2 | 80 | 0.72 | 0.802 | 0.861 | 0.814 | 0.745 | 0.900 | 0.646 |
Appendix A.9. Node Title Mapping for the Evidence-Subgraph Case
| Layer | ID | Short Title | Citations |
|---|---|---|---|
| Self | 7 | Interchangeable shaft assemblies for robotic surgery | 508 |
| Direct (L1) | 21 | Modular powered surgical articulation mechanism | 392 |
| Direct (L1) | 59 | Drive system lockout arrangements for end effectors | 538 |
| Direct (L1) | 63 | Rotary powered articulation joints for robotic tools | 531 |
| Direct (L1) | 193 | Locking arrangements for steerable catheter assemblies | 409 |
| Direct (L1) | 1389 | Robotically powered surgical control interface | 117 |
| Indirect (L2) | 1122 | Shaft assembly architectures for minimally invasive robots | 153 |
| Indirect (L2) | 20,315 | Articulation mechanism for multi-axis surgical manipulation | 1206 |
| Indirect (L2) | 1287 | Surgical device with multiple interchangeable modules | 195 |
| Indirect (L2) | 1201 | Handheld rotary powered surgical instrument | 142 |
| Indirect (L2) | 1118 | Articulatable surgical instrument linkage | 153 |
References
- Sinfield, J.; Solis, F. Finding a Lower-Risk Path to High-Impact Innovations. MIT Sloan Manag. Rev. 2016, 57, 79. [Google Scholar]
- Çağlar, M.; Gürel, S. Public R &D Project Portfolio Selection under Expenditure Uncertainty. Ann. Oper. Res. 2024, 341, 375–399. [Google Scholar] [CrossRef]
- Arsalan, M.; Mubin, O.; Al Mahmud, A. Mapping the Generations of Research Impact Science: A Scoping Review of Metrics, Frameworks, and Predictive Approaches. J. Libr. Inf. Stud. 2025, 23, 1–77. [Google Scholar]
- Bonino, D.; Ciaramella, A.; Corno, F. Review of the State-of-the-Art in Patent Information and Forthcoming Evolutions in Intelligent Patent Informatics. World Pat. Inf. 2010, 32, 30–38. [Google Scholar] [CrossRef]
- Boasson, V.; Boasson, E. Firm Value, Spatial Knowledge Flow, and Innovation: Evidence from Patent Citations. China Financ. Rev. Int. 2015, 5, 132–160. [Google Scholar] [CrossRef]
- de Almeida, B.P.; Gonçalves, E.; da Silva, A.S.; Reis, R.C. Internalization of Knowledge Spillovers by Regions: A Measure Based on Self-Citation Patents. Ann. Reg. Sci. 2021, 66, 309–330. [Google Scholar] [CrossRef]
- Nelson, A.; Earle, A.; Howard-Grenville, J.; Haack, J.; Young, D. Do Innovation Measures Actually Measure Innovation? Obliteration, Symbolic Adoption, and Other Finicky Challenges in Tracking Innovation Diffusion. Res. Policy 2014, 43, 927–940. [Google Scholar] [CrossRef]
- Tang, J.; Zhang, J.; Jin, R.; Yang, Z.; Cai, K.; Zhang, L.; Su, Z. Topic Level Expertise Search over Heterogeneous Networks. Mach. Learn. 2011, 82, 211–237. [Google Scholar] [CrossRef]
- Su, J.; Jiang, C.; Jin, X.; Qiao, Y.; Xiao, T.; Ma, H.; Wei, R.; Jing, Z.; Xu, J.; Lin, J. Large Language Models for Forecasting and Anomaly Detection: A Systematic Literature Review. arXiv 2024, arXiv:2402.10350. [Google Scholar] [CrossRef]
- Tang, J.; Yang, Y.; Wei, W.; Shi, L.; Su, L.; Cheng, S.; Yin, D.; Huang, C. Graphgpt: Graph Instruction Tuning for Large Language Models. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington, DC, USA, 14–18 July 2024; pp. 491–500. [Google Scholar] [CrossRef]
- Trajtenberg, M. A Penny for Your Quotes: Patent Citations and the Value of Innovations. Rand J. Econ. 1990, 21, 172–187. [Google Scholar] [CrossRef]
- Blind, K.; Kenney, M.; Leiponen, A.; Simcoe, T. Standards and Innovation: A Review and Introduction to the Special Issue. Res. Policy 2023, 52, 104830. [Google Scholar] [CrossRef]
- Hall, B.H.; Jaffe, A.B.; Trajtenberg, M. Market Value and Patent Citations: A First Look; National Bureau of Economic Research: Cambridge, MA, USA, 2000. [Google Scholar]
- Takahashi, C.K.; de Figueiredo, J.C.B.; Scornavacca, E. Investigating the Diffusion of Innovation: A Comprehensive Study of Successive Diffusion Processes through Analysis of Search Trends, Patent Records, and Academic Publications. Technol. Forecast. Soc. Change 2024, 198, 122991. [Google Scholar] [CrossRef]
- Cichosz, P. Bag of Words and Embedding Text Representation Methods for Medical Article Classification. Int. J. Appl. Math. Comput. Sci. 2023, 33, 603–621. [Google Scholar] [CrossRef]
- Martin, M.V.; Kirsch, D.A.; Prieto-Nañez, F. The Promise of Machine-Learning-Driven Text Analysis Techniques for Historical Research: Topic Modeling and Word Embedding. Manag. Organ. Hist. 2023, 18, 81–96. [Google Scholar] [CrossRef]
- Le, Q.; Mikolov, T. Distributed Representations of Sentences and Documents. In Proceedings of the International Conference on Machine Learning, Beijing, China, 22–24 June 2014; PMLR. pp. 1188–1196. [Google Scholar]
- Lee, J.S.; Hsiang, J. Patent Classification by Fine-Tuning BERT Language Model. World Pat. Inf. 2020, 61, 101965. [Google Scholar] [CrossRef]
- Ji, L.; Mao, J.; Shi, H.; Li, Q.; Chu, Y.; Yang, H. An Adaptive Framework of Geographical Group-Specific Network on O2O Recommendation. In Proceedings of the European Conference on Information Retrieval; Springer: Cham, Switzerland, 2024; pp. 278–286. [Google Scholar]
- Liu, J.; Yang, C.; Lu, Z.; Chen, J.; Li, Y.; Zhang, M.; Bai, T.; Fang, Y.; Sun, L.; Yu, P.S.; et al. Graph Foundation Models: Concepts, Opportunities and Challenges. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 5023–5044. [Google Scholar] [CrossRef] [PubMed]
- Wang, Z.; Liu, Z.; Ma, T.; Li, J.; Zhang, Z.; Fu, X.; Li, Y.; Yuan, Z.; Song, W.; Ma, Y.; et al. Graph Foundation Models: A Comprehensive Survey. arXiv 2025, arXiv:2505.15116. [Google Scholar] [CrossRef]
- Fantozzi, I.C.; Martuscelli, L.; Schiraldi, M.M. AI vs. Human Performance in University Assessments: A Case Study in Production Management. In Manufacturing 2030—A Perspective to Future Challenges in Industrial Production; Springer Nature: Cham, Switzerland, 2025; pp. 127–137. [Google Scholar] [CrossRef]
- Kipf, T.N. Semi-Supervised Classification with Graph Convolutional Networks. arXiv 2016, arXiv:1609.02907. [Google Scholar] [CrossRef]
- Hamilton, W.; Ying, Z.; Leskovec, J. Inductive Representation Learning on Large Graphs. Adv. Neural Inf. Process. Syst. 2017, 30, 1025–1035. [Google Scholar]
- Ying, Z.; Bourgeois, D.; You, J.; Zitnik, M.; Leskovec, J. Gnnexplainer: Generating Explanations for Graph Neural Networks. Adv. Neural Inf. Process. Syst. 2019, 32, 829. [Google Scholar]
- Nandi, R.N.; Maity, S.K.; Uzzi, B.; Medya, S. An Experimental Analysis on Evaluating Patent Citations. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, USA, 12–16 November 2024; pp. 373–387. [Google Scholar] [CrossRef]
- Feng, W.; Shi, H.; Zhao, P.; Gao, X. Mixtron: Bandit online multiclass prediction with implicit feedback. In Proceedings of the 2023 IEEE International Conference on Data Mining (ICDM); IEEE: Piscataway, NJ, USA, 2023; pp. 1004–1012. [Google Scholar]
- Irekponor, O. Designing Resilient AI Architectures for Predictive Energy Finance Systems amid Data Sovereignty, Adversarial Threats, and Policy Volatility. Int. J. Res. Publ. Rev. 2025, 6, 73–100. [Google Scholar] [CrossRef]
- Lavrič, M.; Lavrič, A.L. A Domain-Independent Review of Explainable AI’s Role in Facilitating Innovation and Creativity in Organizations. Mednar. Inov. Posl. J. Innov. Bus. Manag. 2025, 17. [Google Scholar] [CrossRef]
- Cheng, M.; Liu, Z.; Tao, X.; Liu, Q.; Zhang, J.; Pan, T.; Zhang, S.; He, P.; Zhang, X.; Wang, D.; et al. A Comprehensive Survey of Time Series Forecasting: Concepts, Challenges, and Future Directions. Authorea Prepr. 2025. [Google Scholar] [CrossRef]









| CPC Domain | Domain Description | Number of Patents |
|---|---|---|
| A61 | Medical or veterinary science | 269,364 |
| H04 | Electric communication technique | 379,099 |
| G06 | Computing | 340,667 |
| Dimension | Value |
|---|---|
| Domain | A61, H04, G06 |
| Prediction Window (d) | 3y, 5y, 10y* (capped at 2022) |
| Label Rule | Top 10% vs. Bottom 10% (Main Setting) |
| Training Set Grant Years | 2000–2015 |
| Test Set Grant Year | 2016 |
| Semantic Edge Threshold | , corresponding to average degree 5–25 |
| Model | Params (M) | Peak GPU (GB) | Time/Epoch (min) | Convergence (GPU-h) | Inference (min) | Est. Cost (USD) |
|---|---|---|---|---|---|---|
| PatentBERT-MLP | 111.6 | 4.8 | 5.2 | 1.1 | 2.3 | 3.0 |
| PatentBERT-GSAGE | 112.9 | 8.6 | 9.6 | 3.0 | 4.9 | 8.1 |
| GraphGPT-Patent (Ours) | 125.4 | 13.9 | 14.7 | 5.4 | 8.7 | 14.6 |
| Model | A61 3y | A61 5y | A61 10y | H04 3y | H04 5y | H04 10y | G06 3y | G06 5y | G06 10y |
|---|---|---|---|---|---|---|---|---|---|
| Doc2Vec-MLP | 0.81 | 0.87 | 0.93 | 0.68 | 0.68 | 0.68 | 0.64 | 0.93 | 0.91 |
| PatentBERT-MLP | 0.76 | 0.87 | 0.91 | 0.67 | 0.68 | 0.68 | 0.66 | 0.89 | 0.86 |
| Doc2Vec-GCN | 0.83 | 0.86 | 0.92 | 0.76 | 0.76 | 0.76 | 0.70 | 0.94 | 0.92 |
| Doc2Vec-GTN | 0.75 | 0.82 | 0.90 | 0.67 | 0.86 | 0.87 | 0.55 | 0.87 | 0.90 |
| Doc2Vec-GSAGE | 0.78 | 0.84 | 0.93 | 0.71 | 0.90 | 0.87 | 0.62 | 0.91 | 0.90 |
| PatentBERT-GCN | 0.76 | 0.85 | 0.92 | 0.61 | 0.63 | 0.61 | 0.70 | 0.93 | 0.89 |
| PatentBERT-GTN | 0.77 | 0.85 | 0.94 | 0.83 | 0.88 | 0.83 | 0.58 | 0.86 | 0.87 |
| PatentBERT-GSAGE | 0.74 | 0.85 | 0.91 | 0.70 | 0.91 | 0.85 | 0.56 | 0.93 | 0.88 |
| GraphGPT-Patent (Ours) | 0.84 | 0.88 | 0.94 | 0.74 | 0.93 | 0.89 | 0.72 | 0.94 | 0.92 |
| Model | A61 3y | A61 5y | A61 10y | H04 3y | H04 5y | H04 10y | G06 3y | G06 5y | G06 10y |
|---|---|---|---|---|---|---|---|---|---|
| Doc2Vec-MLP | 0.77 | 0.85 | 0.75 | 0.68 | 0.68 | 0.68 | 0.64 | 0.63 | 0.57 |
| PatentBERT-MLP | 0.74 | 0.85 | 0.89 | 0.69 | 0.69 | 0.69 | 0.68 | 0.66 | 0.67 |
| Doc2Vec-GCN | 0.78 | 0.84 | 0.75 | 0.73 | 0.73 | 0.73 | 0.69 | 0.62 | 0.57 |
| Doc2Vec-GTN | 0.74 | 0.81 | 0.75 | 0.69 | 0.61 | 0.57 | 0.58 | 0.66 | 0.60 |
| Doc2Vec-GSAGE | 0.76 | 0.82 | 0.76 | 0.71 | 0.57 | 0.59 | 0.64 | 0.65 | 0.61 |
| PatentBERT-GCN | 0.74 | 0.83 | 0.75 | 0.64 | 0.64 | 0.64 | 0.68 | 0.64 | 0.63 |
| PatentBERT-GTN | 0.75 | 0.84 | 0.80 | 0.68 | 0.64 | 0.68 | 0.61 | 0.67 | 0.67 |
| PatentBERT-GSAGE | 0.73 | 0.84 | 0.89 | 0.72 | 0.60 | 0.67 | 0.60 | 0.67 | 0.68 |
| GraphGPT-Patent (Ours) | 0.78 | 0.86 | 0.90 | 0.73 | 0.66 | 0.70 | 0.70 | 0.69 | 0.68 |
| Model | A61 3y | A61 5y | A61 10y | H04 3y | H04 5y | H04 10y | G06 3y | G06 5y | G06 10y |
|---|---|---|---|---|---|---|---|---|---|
| Doc2Vec-MLP | 0.86 | 0.91 | 0.78 | 0.78 | 0.78 | 0.78 | 0.75 | 0.72 | 0.59 |
| PatentBERT-MLP | 0.83 | 0.92 | 0.90 | 0.79 | 0.79 | 0.79 | 0.79 | 0.73 | 0.65 |
| Doc2Vec-GCN | 0.87 | 0.91 | 0.78 | 0.83 | 0.83 | 0.83 | 0.79 | 0.72 | 0.59 |
| Doc2Vec-GTN | 0.83 | 0.89 | 0.77 | 0.78 | 0.61 | 0.50 | 0.69 | 0.72 | 0.60 |
| Doc2Vec-GSAGE | 0.85 | 0.90 | 0.79 | 0.81 | 0.59 | 0.51 | 0.75 | 0.73 | 0.61 |
| PatentBERT-GCN | 0.84 | 0.90 | 0.78 | 0.74 | 0.73 | 0.74 | 0.79 | 0.73 | 0.61 |
| PatentBERT-GTN | 0.84 | 0.91 | 0.56 | 0.81 | 0.63 | 0.56 | 0.72 | 0.73 | 0.64 |
| PatentBERT-GSAGE | 0.83 | 0.91 | 0.91 | 0.81 | 0.61 | 0.56 | 0.71 | 0.74 | 0.66 |
| GraphGPT-Patent (Ours) | 0.88 | 0.93 | 0.92 | 0.83 | 0.78 | 0.75 | 0.80 | 0.76 | 0.70 |
| Model | Precision | Recall | Accuracy | F1 |
|---|---|---|---|---|
| Doc2Vec-MLP | 0.802 ± 0.177 | 0.792 ± 0.115 | 0.694 ± 0.079 | 0.772 ± 0.084 |
| PatentBERT-MLP | 0.861 ± 0.159 | 0.779 ± 0.097 | 0.729 ± 0.079 | 0.799 ± 0.077 |
| Doc2Vec-GCN | 0.803 ± 0.179 | 0.828 ± 0.082 | 0.716 ± 0.076 | 0.794 ± 0.088 |
| Doc2Vec-GTN | 0.702 ± 0.230 | 0.799 ± 0.113 | 0.668 ± 0.080 | 0.710 ± 0.116 |
| Doc2Vec-GSAGE | 0.706 ± 0.235 | 0.829 ± 0.099 | 0.679 ± 0.082 | 0.727 ± 0.123 |
| PatentBERT-GCN | 0.810 ± 0.172 | 0.764 ± 0.129 | 0.688 ± 0.066 | 0.763 ± 0.076 |
| PatentBERT-GTN | 0.677 ± 0.223 | 0.823 ± 0.096 | 0.704 ± 0.072 | 0.711 ± 0.117 |
| PatentBERT-GSAGE | 0.754 ± 0.229 | 0.814 ± 0.116 | 0.711 ± 0.093 | 0.749 ± 0.119 |
| GraphGPT-Patent (Ours) | 0.798 ± 0.154 | 0.867 ± 0.084 | 0.744 ± 0.085 | 0.817 ± 0.080 |
| Variant | Precision | Recall | F1 | Within-Group AUC | P@50 |
|---|---|---|---|---|---|
| No reliability + cls-only | 0.781 | 0.848 | 0.803 | 0.876 | 0.614 |
| Temporal reliability only + cls-only | 0.790 | 0.858 | 0.811 | 0.885 | 0.626 |
| Domain reliability only + cls-only | 0.786 | 0.853 | 0.807 | 0.882 | 0.621 |
| Full reliability + cls-only | 0.793 | 0.864 | 0.812 | 0.892 | 0.634 |
| Full reliability + ranking (final) | 0.798 | 0.867 | 0.817 | 0.904 | 0.652 |
| Model | Training Window | Acc | Pr | Re | F1 |
|---|---|---|---|---|---|
| Doc2Vec-GCN | 2000–2004 | 0.61 | 0.66 | 0.69 | 0.67 |
| Doc2Vec-GCN | 2005–2009 | 0.66 | 0.67 | 0.84 | 0.74 |
| Doc2Vec-GCN | 2010–2014 | 0.68 | 0.68 | 0.89 | 0.77 |
| Doc2Vec-GTN | 2000–2004 | 0.65 | 0.72 | 0.66 | 0.69 |
| Doc2Vec-GTN | 2005–2009 | 0.67 | 0.68 | 0.83 | 0.75 |
| Doc2Vec-GTN | 2010–2014 | 0.69 | 0.68 | 0.89 | 0.77 |
| Doc2Vec-GSAGE | 2000–2004 | 0.65 | 0.72 | 0.67 | 0.70 |
| Doc2Vec-GSAGE | 2005–2009 | 0.66 | 0.68 | 0.81 | 0.74 |
| Doc2Vec-GSAGE | 2010–2014 | 0.71 | 0.70 | 0.88 | 0.78 |
| PatentBERT-GCN | 2000–2004 | 0.66 | 0.73 | 0.67 | 0.70 |
| PatentBERT-GCN | 2005–2009 | 0.69 | 0.73 | 0.75 | 0.74 |
| PatentBERT-GCN | 2010–2014 | 0.72 | 0.73 | 0.85 | 0.78 |
| PatentBERT-GTN | 2000–2004 | 0.67 | 0.76 | 0.65 | 0.70 |
| PatentBERT-GTN | 2005–2009 | 0.70 | 0.74 | 0.76 | 0.75 |
| PatentBERT-GTN | 2010–2014 | 0.73 | 0.75 | 0.82 | 0.78 |
| PatentBERT-GSAGE | 2000–2004 | 0.67 | 0.76 | 0.64 | 0.69 |
| PatentBERT-GSAGE | 2005–2009 | 0.69 | 0.74 | 0.73 | 0.73 |
| PatentBERT-GSAGE | 2010–2014 | 0.73 | 0.75 | 0.83 | 0.78 |
| GraphGPT-Patent (Ours) | 2000–2004 | 0.69 | 0.73 | 0.78 | 0.75 |
| GraphGPT-Patent (Ours) | 2005–2009 | 0.71 | 0.73 | 0.84 | 0.78 |
| GraphGPT-Patent (Ours) | 2010–2014 | 0.73 | 0.74 | 0.88 | 0.81 |
| CPC | Label | Average Density | Average Degree | Clustering Coefficient |
|---|---|---|---|---|
| A61 | High Citation | 0.470 | 5.705 | 0.265 |
| A61 | Low Citation | 0.563 | 6.232 | 0.228 |
| H04 | High Citation | 0.322 | 16.220 | 0.460 |
| H04 | Low Citation | 0.287 | 10.826 | 0.331 |
| G06 | High Citation | 0.221 | 14.368 | 0.431 |
| G06 | Low Citation | 0.221 | 9.210 | 0.284 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Fang, T.; Si, J.; Ye, C.; Shi, H. GraphGPT-Patent: Time-Aware Graph Foundation Modeling on Semantic Similarity Document Graphs for Grant-Time Economic Impact Prediction. Appl. Sci. 2026, 16, 2737. https://doi.org/10.3390/app16062737
Fang T, Si J, Ye C, Shi H. GraphGPT-Patent: Time-Aware Graph Foundation Modeling on Semantic Similarity Document Graphs for Grant-Time Economic Impact Prediction. Applied Sciences. 2026; 16(6):2737. https://doi.org/10.3390/app16062737
Chicago/Turabian StyleFang, Tianhui, Junru Si, Chi Ye, and Hailong Shi. 2026. "GraphGPT-Patent: Time-Aware Graph Foundation Modeling on Semantic Similarity Document Graphs for Grant-Time Economic Impact Prediction" Applied Sciences 16, no. 6: 2737. https://doi.org/10.3390/app16062737
APA StyleFang, T., Si, J., Ye, C., & Shi, H. (2026). GraphGPT-Patent: Time-Aware Graph Foundation Modeling on Semantic Similarity Document Graphs for Grant-Time Economic Impact Prediction. Applied Sciences, 16(6), 2737. https://doi.org/10.3390/app16062737

