Next Article in Journal
Protein Adsorption on Nano- and Microparticles: Dependence on Morphological and Physicochemical Properties of Particles and Effect on Particle–Cell Interactions
Previous Article in Journal
Plant-Based ZnO Nanoparticles for Green Nanobiocontrol of a Highly Virulent Bacterial Leaf Blight Pathogen: Mechanistic Insights and Biocompatibility Evaluation
Previous Article in Special Issue
Improving the Performance of Arsenene Nanoribbon Gate-All-Around Tunnel Field-Effect Transistors Using H Defects
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Graph Neural Network Determine the Ground State Structures of Boron or Nitride Substitute C60 Fullerenes

1
College of Mathematics, Hohai University, Changzhou 213200, China
2
Guangdong Basic Research Center of Excellence for Structure and Fundamental Interactions of Matter, Guangdong Provincial Key Laboratory of Quantum Engineering and Quantum Materials, School of Physics, South China Normal University, Guangzhou 510006, China
3
College of Mechanics and Engineering Sciences, Hohai University, Nanjing 210098, China
*
Authors to whom correspondence should be addressed.
Nanomaterials 2025, 15(13), 1012; https://doi.org/10.3390/nano15131012
Submission received: 7 May 2025 / Revised: 17 June 2025 / Accepted: 19 June 2025 / Published: 30 June 2025

Abstract

Substitutional doping of fullerenes represents a significant category of heterofullerenes. Due to the vast number of isomers, confirming the ground state structure poses considerable challenges. In this study, we generated isomers of C60−nBn and C60−nNn with n ranging from 2 to 12. To avoid overlooking the ground state structures, we applied specific filtering rules: no adjacent nitrogen (N) or boron (B) atoms are allowed, and substitutions in meta-positions within pentagons are prohibited when the substitution number n exceeds nine. Approximately 15,000 isomers across various values of n within the range of 2 to 12 for B and N substituted fullerenes were selected and optimized using density functional theory (DFT) calculations, forming our dataset. We developed a Graph Neural Network (GNN) that aggregates both topological connections and its dual graph with ring types as input information to predict their binding energies. The GNN achieved high accuracy, reaching a root mean square error (RMSE) of 1.713 meV. Furthermore, it operates efficiently; indeed, it can predict over six thousand isomers per second on an eight-core PC. Several predicted stable structures were further optimized by DFT to confirm their ground state configurations. The energy cutoffs of each composition were determined through statistical simulations to ensure that the selected ground state structures possess high confidence levels. Notably, new lower-energy structures have been discovered for boron-substituted fullerenes with substitution number ranging from seven to twelve and nitride-substituted fullerenes with substitution number ranging from seven to eleven.

1. Introduction

Heterofullerenes, which involve the incorporation of heteroatoms into the cage structures of fullerenes, can be classified into three categories: exo-doping, substitutional doping, and endo-doping. In comparison to pristine carbon fullerenes, these doped nanocages exhibit distinct electronic, magnetic, and optical properties, thereby expanding their potential applications in various fields such as catalysis [1], hydrogen storage materials [2], lithium battery cathodes [3], nonlinear optical materials [4,5], and molecular electronics [6,7,8]. Among all types of heterofullerenes, nitrogen- and boron-doped fullerenes have attracted significant attention due to the trivalent nature of nitrogen and boron atoms along with their comparable size and electronegativity to carbon. Notably, the chemical inertness of BCN materials surpasses that of diamond. Consequently, they are suitable for use in high-temperature semiconductor devices as well as short-wavelength optoelectronic devices [9,10].
Since the synthesis of carbon fullerene cages [11,12], there has been considerable interest in exploring their novel structures alongside their electronic and optical properties. In a pioneering experiment conducted in 1991 by Guo et al., boron-substituted fullerenes C60−nBn (n = 1–6) were reported [13]. Chen et al. performed semiempirical and ab initio calculations on C60 substituted with two to eight B or N atoms [14] as well as on C70 substituted with two to ten B or N atoms [15]. They found that fullerene structures doped with fewer heteroatoms tend to exhibit greater stability.
In 2011, Garg et al. empirically identified several structures of C60−nBn (where n = 1–12) and discovered potential lower-energy configurations [16]. They noted that boron-doped fullerenes contain no more than one B atom on a pentagon, while two B atoms can coexist in either para or meta positions on the same hexagon. Xie investigated a lower isomer of C48B12, which demonstrates enhanced third-order optical non-linearity, thereby suggesting its promising application in photonic devices [8]. Cheng conducted a systematic study of the C60−nBn systems for n = 1–6 and uncovered new low-energy structures [17].
Comparatively, C60−nNn azafullerenes have been studied more extensively than boron-doped fullerenes. Several azafullerenes, including C59N+ and C69N+ [18] and C57N3 [19], have been successfully synthesized in laboratory settings. Hultman et al. [20] discovered a cage structure of C48N12 exhibiting S6 symmetry and synthesized carbon-nitrogen nano-onions with a nitrogen content of 20%. A more stable configuration of C48N12 was theoretically proposed by Manna et al. [21], which retains the same S6 symmetry as reported by Hultman et al. [20]. Chen found that nitrogen substitution in host carbon fullerenes is generally more stable than boron substitution [14,15]. Sharma et al. [22] investigated the structural, electronic, and vibrational properties of C60−nNn (n = 1–12). Srinivasu et al. [23] calculated the structure, stability, and nonlinear optical properties of C60−2nN2n (for n = 1–12). In 2019, Cheng et al. [24] conducted a systematic investigation into the structures of C60−nNn up to n = 12. They calculated all possible isomers for n = 1–4, estimated isomer energies for n = 5–9, and proposed a classification method to filter out unstable isomers for n = 10–12.
Despite the fact that some groups have investigated the substitution of B or N for carbon in C60 and other sized carbon fullerenes, the ground state structures of these heterofullerenes remain uncertain. This uncertainty primarily stems from the exponential increase in the number of isomers of C60−nXn with respect to n [25], rendering it nearly impossible to calculate all possible isomers using first-principles methods. Generally speaking, previous studies have employed semiempirical approaches or applied strict filtering conditions to compute a limited set of isomers. However, excessive filtering may result in the exclusion of critical ground state structures [24].
In the past decade, machine learning (ML) techniques have been successfully employed to accelerate the structural prediction of clusters, including main-group clusters [26,27,28], coinage metal clusters [29], and specific cage clusters [30]. A ML model for atomistic simulations of boron and carbon, constructed using Gaussian approximation potential, can expedite the global minimum search for Bn (n = 36, 40, 84) and Cn (n up to 720) clusters. The global minimum structures of small clusters such as Ptn (n = 8–14), Tan (n = 9–13), and Agn (n = 14–26) have been successfully obtained using various deep neural networks. Furthermore, global optimization efforts for larger Agn (n = 30–60) and Aln clusters (n = 21–55) have demonstrated the scalability and transferability of machine learning methods.
Combining optimization techniques with machine learning could significantly enhance the current state-of-the-art in structural search methodologies. However, since these models rely on atomic coordinates as inputs, their accuracy is contingent upon a reasonable initial structure that typically necessitates DFT calculations. To further improve efficiency, it is desirable to develop effective methods capable of directly predicting energy based on topological connections within cage structures. A notable attempt in this direction was made by Liu et al., who utilized a neural network potential based on SchNet [31] to investigate the exohedral functionalization of fullerenes [30], achieving a mean absolute error (MAE) of just 0.37 eV. Moreover, in our previous work, we developed a graph neural network (GNN) model that directly predicts the binding energy of boron-nitride fullerene cages solely from their topological connections [32].
In this paper, we generated all possible isomers of C60−nBn and C60−nNn (n = 2–12) utilizing a recursive algorithm combined with isomorphic judgment techniques. Subsequently, we developed a modified GNN model to predict their binding energies both rapidly and accurately. This model employs ring type as initial features and performs convolution on both the source graph and dual graph to aggregate vertex and ring characteristics effectively. As a result, we confirmed several previous findings while also identifying some lower-energy structures.

2. Data Generation

In this study, we began with the well-known C60 buckyball with Ih symmetry and subsequently replace a portion of the C atoms with B and N atoms. Taking azafullerenes as an example, we first substituted a specified number (n) of N atoms into C60. The B-substituted fullerenes can be readily obtained by replacing all N atoms in the azafullerenes with B atoms. Given that all 60 C atoms are equivalent, we will consistently replace the first C atom with an N atom. Following this substitution, we will generate combinations represented by C(59, n − 1). Due to the high symmetry inherent in the structure of C60, some substitutions may yield equivalent configurations.
The symmetry group of C60 has been determined to be a group of order 120, consisting of permutations of the 60 atoms. For each permutation, a hash value is computed by encoding the replacement position using sexagesimal notation. The minimal hash among all 120 equivalent replacements is utilized to uniquely identify each substitution. During the generation of each combination, we verified whether it already exists in the hash set. If it does not exist, a new substitution is identified and its minimal hash is added to the set. Additionally, the replacement position is recorded as a valid structure. Each generated structure retains the minimum replacement position among its equivalent replacement isomers.
The theoretical number of isomers can be determined by Pólya’s enumeration theorem based on the symmetry group of the C60 structure. We generated the atom permutation group of C60 and then deduced that the number of replacement of k B or N atoms corresponds to coefficient of xk in following polynomial:
1 120 ( 1 + x ) 60 + 16 ( 1 + x 2 ) 60 + 24 ( 1 + x 10 ) 6 + 20 ( 1 + x 6 ) 10 + 24 ( 1 + x 5 ) 12 + 20 ( 1 + x 3 ) 20 + 15 ( 1 + x 2 ) 28 ( 1 + x ) 4
The number of isomers for C60 substituted by N (or B) atoms is summarized in Table 1. As noted in previous studies [14,16], structures containing adjacent N or B atoms are energetically unfavorable. Consequently, we restrict our analysis to configurations where N and B atoms are not adjacent. When introducing a new N atom, its position and neighboring positions are excluded from the possible positions already in the set. The subsequent N atom is then placed in one of the remaining available positions. However, as the number of N atoms increases, the number of isomers without adjacent N atoms grows excessively large. For instance, when there are 10 N atoms, the number of isomers exceeds 10 million (see Table 1). Garg et al. also observed that a pentagon cannot accommodate more than one boron atom [16]. Furthermore, Srinivasu et al. [23] and Cheng et al. [24] excluded isomers with meta-position substitutions in pentagons and introduced additional filtering criteria to further reduce the number of isomers. In this study, we applied a rule prohibiting meta-position substitutions on pentagons for n = 10–12, which results in a tenfold reduction in the number of isomers (see Table 1). This filtering significantly enhances computational efficiency, allowing GNN to predict all isomer energies within an acceptable time frame.
Some structures of C60−nBn and C60−nNn were selected, with the number n ranging from 3 to 12. All isomers of C57B3 and C56N4 were included, while a subset of isomers for n > 4 was randomly selected. These structures were optimized with spin unrestricted for both open and close shell clusters using DFT as implemented in the DMol3 package [33], forming our dataset. The double numerical basis set along with the Perdew–Burke–Ernzerhof (PBE) functional within the generalized gradient approximation (GGA) [34] was employed for self-consistent field calculations. The relaxed C60-Ih structure yielded two distinct types of C-C bond lengths: 1.401 Å and 1.460 Å, which align reasonably well with experimental values of 1.401 Å and 1.458 Å [35,36]. Our dataset encompasses six compositions of C-B and C-N cluster systems: C57B3, C54B6, C56N4, C54N6, C52N8, C51N9, C49N11, and C48N12, and two test datasets—C49N11 and C48N12—while the remaining are designated as training data (Table 2). In total, the training and testing datasets comprise 11,594 and 1916 structures, respectively.

3. Train and Test

GNN is employed to predict the energies of C60−nBn and C60−nNn heterofullerenes. Based on our previous research, we found that utilizing the dual graph simplifies handling compared to its original graph [32]. Therefore, in this study, we adopted the dual graph as input. The first critical step is to the initial features for the vertices. We should emphasize that only topology information can be used as input because using coordinates would require optimizing structures first. However, optimizing millions of isomers using DFT calculations is impractical. Our task is to predict binding energies without relying on DFT calculations. In a simple graph, the degree or element type serves as a natural feature of a vertex; however, degree alone cannot differentiate between various isomers. Additionally, there is no element type present in the dual graph since each vertex corresponds to a ring in the original graph. In our prior investigation of (BN)n fullerenes [32], we established that initial vertex features could be represented by permutations of neighboring vertices for each vertex, effectively capturing different rings. For C-B and C-N cage clusters, there are only 14 distinct types of pentagons and hexagons, as illustrated in Figure 1. Consequently, an integer ranging from 0 to 13 can be utilized to represent a vertex within the dual graph. This integer is then mapped to a vector through an embedding layer to form the feature vector corresponding to that specific vertex. Following this process, vertex features are updated by aggregating features from neighboring vertices into central vertices using various methods. Below are two approaches: one method involves summing all neighbor features [32,37] according to the following formula:
H l + 1 = σ ( A ^ H l W l )
Here, W represents a weight matrix used for transferring feature dimensions; Hl denotes l-th layer of vertices’ features; σ is the active function; and I is identity matrix with the same order of A. A ^ signifies a matrix obtained from adjacent matrix A.
A ˜ = A + I D ˜ i i = j A ˜ i j A ^ = D ˜ 1 2 A ˜ D ˜ 1 2
An enhanced approach entails aggregating neighboring features with weighted coefficients, a technique commonly referred to as the attention mechanism [38]:
α i j = Softmax ( LeakyReLU ( h i | | h j ) ) v i l + 1 = Elu ( h i + β j N ( i ) α i j h j )
where hi represents feature of node i. αij is attention coefficients of node i. Softmax, LeakyReLU, and Elu are activation functions.
We initially constructed a model that comprises a feature embedding layer, three graph convolution layers, a fully connected layer, and a final readout function. The feature embedding layer transforms the input ring type into vector features. The graph convolution layers capture local structural information. The fully connected layer maps each vertex feature to one dimension and subsequently sums all atom values into a single value that serves as the prediction for average binding energy. This model (referred to as Model-1) achieved a test mean square error (MSE) of 1.974 meV for average binding energy (see Table 3). Model-2 introduces an additional graph convolution layer following the three graph convolution layers from Model-1; this new layer aggregates dual-graph features onto the original graph vertices. As a result, this model attained a lower MSE of 1.932 meV (Figure 2). A more effective model (Model-3) was developed by incorporating element information: the elemental data from all 60 sites were embedded into vectors, followed by application of a tanh activation function and integration with another fully connected layer. Elemental information was added to vertex features prior to the readout function. Consequently, Model-3 achieved an even lower MSE of 1.822 meV.
When varying the number of graph convolution layers to two or four, we observed an increase in test MSEs to 1.862 and 1.872 meV, respectively; both results exceeded the test MSE obtained with three graph convolution layers. Furthermore, we compared the performance of graph attention layers against traditional graph convolution layers. By substituting two graph attention layers for the original ones in our architecture, we recorded a minimum test error of 1.713 meV for the resulting Model-4; however, employing three graph attention layers led to an increased test error of 1.868 meV (as shown in Figure 2), suggesting that two attention layers are sufficient. Overall, these findings indicate that networks utilizing ring type as input topology information alongside graph convolutional mechanisms for substructure feature extraction demonstrate robustness and exhibit minimal sensitivity to specific architectural details.

4. Prediction

Our GNN model has successfully and rapidly predicted all isomers of C60−nBn and C60−nNn for n = 4–12. The prediction speed reaches 6000 isomers per second on a PC equipped with an 8-core Xeon Gold 6139 CPU, allowing for the prediction of up to 2 million isomers within just 5 min. For any given system, the subsequent step involves selecting stable structures from the top list generated by the GNN predictions for further examination using DFT calculations, aimed at identifying the true ground state structure. Taking C52B8 as an example, we identified a total of 4,158,712 isomers, with the highest predicted binding energy being 8.5348 eV according to our GNN model. It is essential to determine a cutoff value Ec such that those isomers exhibiting predicted binding energies greater than 8.5348–Ec may potentially include the true ground state structure. Hence, only structures with predicted energies exceeding 8.5348–Ec will be considered for further analysis via DFT calculations.
It should be noted that both the distribution of predicted energies and the number of isomers can vary across different systems. To address this variability, we conducted 1000 simulations for each composition in order to ascertain its corresponding Ec value. In each simulation, we introduced random errors into the predicted energies of all isomers based on a distribution derived from sample statistics representing their true binding energy. Subsequently, we establish an Ec value such that in 5 out of 1000 simulations, the difference between the lowest predicted energy and simulated true energy remains less than Ec.
In Figure 3, we illustrated the energy differences between GNN predictions and DFT results specifically for C49N11 obtained from these simulations; this reveals a sample standard deviation (σ) of energy difference equal to 0.002147 eV. The prediction error was assessed using the Kolmogorov–Smirnov test, which yielded a p-value of 0.9802, significantly higher than our significance level set at 0.05. This indicates that we cannot reject the hypothesis asserting conformity to a normal distribution. We assume that discrepancies between GNN predictions and DFT results follow a normal distribution denoted as N(0, σ2). Accordingly, each GNN-predicted energy receives an associated random error ε~N(0, σ2). By selecting Ec = 3.5σ, only 91 isomers are predicted to have energies exceeding 8.5348–Ec, leading to just four instances where the lowest DFT energy does not fall within this cutoff range. As a consequence, we can assert with 99.6% confidence that one of these 91 isomers corresponds to the ground state structure.
Table 4 presents all cutoff values and the number of isomers to be examined through DFT calculations for C60−nBn and C60−nNn with n = 4–12. In most cases, less than 1% of all possible isomers require evaluation. The selected combinations of cutoff values and isomers were optimized using DMol3 program with high precision. For n ≤ 6, our results align with previously reported ground state structures; however, for n > 7, new lower-energy structures were identified except in the case of C48N12.
The previously reported ground state structures and the top three lowest energy configurations identified by GNN for C60−nBn and C60−nNn with n = 7–12 are shown in Figure 4. The ground state structure of C53B7, as determined in this study, features C atom indices (1, 7, 11, 16, 24, 27, 36) that have been substituted with boron atoms. This configuration exhibits a flower-like substructure composed of five petal-shaped hexagons surrounding a pentagon; specifically, five B atoms are symmetrically arranged on the petals of these hexagons. Additionally, two other B atoms occupy para-positions within another hexagon, resulting in the formation of five pairs of para-positioned B-B bonds. Notably, this structure is energetically favored by 2.057 eV compared to that reported by Garg et al. [16]. The second lowest-energy configuration for C53B7 identified here has B positions at (1, 7, 11, 14, 24, 27, 31), which also maintains a similar arrangement involving five B positions.
Regarding the fullerene C60−nBn series for n = 7, 9, 10, and 11, we have discovered several additional isomers exhibiting greater stability than those previously reported. The top five stable isomers are detailed in Table S1. For instance, the lowest-energy structure found for C52B8 corresponds to B positions (1, 7, 11, 15, 24, 27, 36, 39). It shares a comparable arrangement of five B atoms akin to that observed in C53B7 7 while incorporating three additional boron atoms organized into two pairs located at para-positions within their respective hexagons. This particular configuration demonstrates an energy reduction of approximately 0.083 eV relative to isomer(b), which contains two B atoms situated within a pentagon as reported by Chen et al. [14], and it is also lower in energy by 1.014 eV compared to the structure presented by Garget al. [16].
The lowest-energy structure of C51B9 identified by GNN features B atoms located at positions (1, 7, 11, 14, 24, 27, 31, 36, and 39), exhibiting an energy reduction of 2.034 eV compared to the previously reported configuration [16]. This structure also includes a flower-like sub-structure akin to that found in C53B7; additionally, the remaining four B atoms are arranged in two para-positioned hexagons.
For C50B10, the ground state structure reveals a substitution pattern for B atoms at positions (1, 7, 11, 24, 27, 34, 37, and 50). It comprises two groups of five B atom substructures similar to those observed in C53B7 and demonstrates D5d symmetry. Notably, this specific structure is energetically more favorable by approximately 1.969 eV when compared with earlier reported structure [16].
In the case of n = 11, the lowest-energy structure involves substituting B atoms at positions (1, 7, 11, 14, 17, 24, 27, 31, 35, 41, 57). This configuration showcases five B atoms positioned analogously to petal arrangements seen in C53B7. The remaining six B atoms are organized into three pairs situated in para positions. This particular geometry exhibits greater stability than the isomeric structure documented by Garg et al. [16], which contains a pair of meta-positioned B atoms within a pentagon and is less stable by approximately 1.315 eV.
For C48B12, we found that the most energetically favorable configuration consists of B atoms placed at locations (1, 6, 8, 11, 16, 18, 23, 28, 31, 36, 54, 60). In contrast to previous structures examined, it incorporates a hexagon containing three B atoms. Furthermore, the S6 symmetric structure reported by Manna et al. [39] is 0.147 eV higher in energy than our current ground state configuration and ranks fifth on our list.
In this study, we found lower energy structures for all C60−nNn systems with n = 7–11, while the previously reported ground state structure for n = 12 has been confirmed (Figure 5). Our predicted lowest energy configuration of C53N7 features nitrogen atoms located at locations (1, 7, 26, 31, 37, 51, 54). This structure includes three pairs of para-positioned N and is energetically favored by 0.048 eV compared to the structure with two pairs of para-positioned N [24].
For C52N8, we identified eight structures (Table S1) that possess lower energies than those previously reported [24]. Among these configurations, the lowest-energy structure has N positioned at (1, 7, 26, 31, 37, 46, 51, 54), which is found to be energetically more favorable by 0.095 eV relative to that in literature [24]. The current ground state structure of C51N9 contains N atoms situated at positions (1, 7, 11, 14, 24, 27, 35, 54, 60). This configuration comprises four pairs of para-positioned N; this is in contrast to the previously reported one that contained only three pairs [24]. Our proposed structure is lower in energy by 0.102 eV in comparison to the previously reported structure featuring just three pairs of para-positioned N [24].
Furthermore, we have identified four additional structures with even lower energies. For C50N10, we discovered a total of sixteen structures that exhibit greater energetic favorability compared to those previously reported. Among these, the lowest-energy structure is favored by as much as 0.177 eV. This specific configuration comprises three pairs each of meta-positioned and para-positioned N atoms, which are substituted at positions (1, 6, 11, 15, 18, 43, 46, 49, 52, 56). In the case of C49N11, we found six isomers with lower energy. Notably, the ground state structure is 0.012 eV lower than the previously reported structure [24]. In this configuration, N atoms are positioned at (1, 6, 11, 18, 23, 27, 33, 40, 48, 51, 59), consisting of two pairs of meta-positioned N and six pairs of para-positioned N atoms.

5. Conclusions

To summarize, all possible cage structures of C60−nBn and C60−nNn with n = 2–12 have been generated. A graph neural network has been trained on over 10,000 data points, achieving a test mean squared error of 1.713 meV. This enables the rapid prediction of binding energies for a substantial number of isomers. Through meticulous statistical analysis, only the top several to thousands of isomers are selected to ascertain the true ground state structure for each system. New lower-energy structures have been identified for C60−nBn with n = 7–12 and C60−nNn with n = 7–11. Our methodology significantly accelerates the search for ground state structures in cage-like molecules or clusters and contributes to advancements in fullerene research.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/nano15131012/s1, Table S1: List and relative energy of isomers lower in energy than Reference. When isomer numbers larger than 5, only list top 5 isomers. Table S2: Cartesian coordinates (Å) of ground state C53B7, C52B8, C51B9, C50B10, C49B11 and C48B12 clusters. Table S3: Cartesian coordinates (Å) of ground state C53N7, C52N8, C51N9, C50N10, and C49N11 clusters.

Author Contributions

Methodology, L.S.; Software, L.S.; Validation, B.D.; Formal analysis, C.T.; Data curation, S.A.; Writing—original draft, L.S. and B.D.; Writing—review & editing, J.Z.; Visualization, L.F.; Supervision, J.Z.; Funding acquisition, L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (12374254).

Data Availability Statement

The code used in this paper is available in Gitee repository: https://gitee.com/saieuler/bcn_code. This open-sourced repository contains code generate structures, network definition, train file and other relevant codes. List of top 5 structures can be seen in Supplemental Information.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Agnihotri, D.; Sharma, H. Boron Doped Fullerene C60 as Catalyst for NaAlH4. Adv. Nanomat. Nanotech. 2013, 143, 499–502. [Google Scholar]
  2. Sun, Q.; Wang, Q.; Jena, P. Functionalized heterofullerenes for hydrogen storage. Appl. Phys. Lett. 2009, 94, 013111. [Google Scholar] [CrossRef]
  3. Sood, P.; Kim, K.C.; Jang, S.S. Electrochemical Properties of Boron-Doped Fullerene Derivatives for Lithium-Ion Battery Applications. ChemPhysChem 2018, 19, 753–758. [Google Scholar] [CrossRef] [PubMed]
  4. Zhou, X.; Li, W.-Q.; Shao, B.; Tian, W.Q. Nonlinear optical properties of fullerene C96 (D3d) and related heterofullerenes. J. Phys. Chem. C 2013, 117, 23172–23177. [Google Scholar] [CrossRef]
  5. Xie, R.-H.; Bryant, G.W.; Zhao, J.; Smith, V.H., Jr.; Di Carlo, A.; Pecchia, A. Tailorable Acceptor C60−nBn and Donor C60−mNm Pairs for Molecular Electronics. Phys. Rev. Lett. 2003, 90, 206602. [Google Scholar] [CrossRef]
  6. Rotas, G.; Tagmatarchis, N. Azafullerene C59N in Donor-Acceptor Dyads: Synthetic Approaches and Properties. Chem. Eur. J. 2016, 22, 1206–1214. [Google Scholar] [CrossRef]
  7. Zhao, J.; Zeng, C.; Cheng, X.; Wang, K.; Wang, G.; Yang, J.; Hou, J.G.; Zhu, Q. Single C59N Molecule as a Molecular Rectifier. Phys. Rev. Lett. 2005, 95, 045502. [Google Scholar] [CrossRef]
  8. Xie, R.-H.; Jensen, L.; Bryant, G.W.; Zhao, J.; Smith, V.H. Structural, electronic, and magnetic properties of heterofullerene C48B12. Chem. Phys. Lett. 2003, 375, 445–451. [Google Scholar] [CrossRef]
  9. Liang, Z.; Tu, H.; Shi, D.; Chen, F.; Jiang, H.; Shao, Y.; Wu, Y.; Hao, X. In Situ Growing BCN Nanotubes on Carbon Fibers for Novel High-Temperature Supercapacitor with Excellent Cycling Performance. Small 2021, 17, 2102899. [Google Scholar] [CrossRef]
  10. Chakraborty, H.; Mogurampelly, S.; Yadav, V.K.; Waghmare, U.V.; Klein, M.L. Phonons and thermal conducting properties of borocarbonitride (BCN) nanosheets. Nanoscale 2018, 10, 22148–22154. [Google Scholar] [CrossRef]
  11. Kroto, H.W.; Heath, J.R.; O’Brien, S.C.; Curl, R.F.; Smally, R.E. C60: Buckminsterfullerene. Nature 1985, 318, 162–163. [Google Scholar] [CrossRef]
  12. Krätschmer, W.; Lamb, L.D.; Fostiropoulos, K.; Huffman, D.R. Solid C60: A new form of carbon. Nature 1990, 347, 354–358. [Google Scholar] [CrossRef]
  13. Guo, T.; Jin, C.; Smalley, R.E. Doping Bucky: Formation and Properties of Boron-Doped Buckminsterfullerene. J. Phys. Chem. 1991, 95, 4948–4950. [Google Scholar] [CrossRef]
  14. Chen, Z.; Zhao, X.; Tang, A. Theoretical Studies of the Substitution Patterns in Heterofullerenes C60−xNx and C60−xBx (x = 2–8). J. Phys. Chem. A 1999, 103, 10961–10968. [Google Scholar] [CrossRef]
  15. Chen, Z.; Reuther, U.; Hirsch, A.; Thiel, W. Theoretical Studies on the Substitution Patterns in Heterofullerenes C70−xNx and C70−xBx (x = 2–10). J. Phys. Chem. A 2001, 105, 8105–8110. [Google Scholar] [CrossRef]
  16. Garg, I.; Sharma, H.; Dharamvir, K.; Jindal, V.K. Substitutional Patterns in Boron Doped Heterofullerenes C60−nBn (n = 1–12). J. Comput. Theor. Nanosci. 2011, 8, 642–655. [Google Scholar] [CrossRef]
  17. Cheng, Y.-H.; Liao, J.-H.; Zhao, Y.-J.; Yang, X.-B. An extended cluster expansion for ground states of heterofullerenes. Sci. Rep. 2017, 7, 16211. [Google Scholar] [CrossRef] [PubMed]
  18. Llamparth, I.; Nuber, B.; Schick, G.; Skiebe, A.; Grösser, T.; Hirsch, A. C59N+ and C59N+: Isoelectronic Heteroanalogues of C60 and C70. Angew. Chem. Int. Ed. Engl. 1995, 34, 2257. [Google Scholar] [CrossRef]
  19. Otero, G.; Biddau, G.; Sánchez-Sánchez, C.; Caillard, R.; López, M.F.; Rogero, C.; Palomares, F.J.; Cabello, N.; Basanta, M.A.; Ortega, J.; et al. Fullerenes from aromatic precursors by sur-face-catalysed cyclodehydrogenation. Nature 2008, 454, 865. [Google Scholar] [CrossRef]
  20. Hultman, L.; Stafström, S.; Czigány, Z.; Neidhardt, J.; Hellgren, N.; Brunell, I.F.; Suenaga, K.; Cooliex, C. Cross-Linked Nano-onions of Carbon Nitride in the Solid Phase: Existence of a Novel C48N12 Aza-Fullerene. Phys. Rev. Lett. 2001, 87, 225503. [Google Scholar] [CrossRef]
  21. Manaa, M.R.; Sprehn, D.W.; Ichord, H.A. Prediction of Extended Aromaticity for a Novel C48N12 Azafullerene Structure. J. Am. Chem. Soc. 2002, 124, 13990. [Google Scholar] [CrossRef] [PubMed]
  22. Sharma, H.; Garg, I.; Dharamvir, K.; Jindal, V.K. Structural, Electronic, and Vibrational Properties of C60−nNn (n = 1–12). J. Phys. Chem. A 2009, 113, 9002. [Google Scholar] [CrossRef] [PubMed]
  23. Srinivasu, K.; Jena, N.K.; Ghosh, S.K. Electronic structure, stability and non-linear optical properties of aza-fullerenes C60−nNn (n = 1–12). Aip. Adv. 2012, 2, 042111. [Google Scholar] [CrossRef]
  24. Cheng, Y.; Liao, J.; Zhao, Y.; Ni, J.; Yang, X. Theoretical investigations on stable structures of C60−nNn (n = 2–12): Symmetry, model interaction, and global optimization. Carbon 2019, 154, 140–149. [Google Scholar] [CrossRef]
  25. Balasubramanian, K. Enumeration of Chiral and Positional Isomers of Substituted Fullerene Cages (C20–C70). J. Phys. Chem. 1993, 97, 6990–6998. [Google Scholar] [CrossRef]
  26. Karasulu, B.; Leyssale, J.-M.; Rowe, P.; Weber, C.; de Tomas, C. Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials. Carbon 2022, 191, 255–266. [Google Scholar] [CrossRef]
  27. Wang, Y.; Liu, S.; Lile, P.; Norwood, S.; Hernandez, A.; Manna, S.; Mueller, T. Accelerated prediction of atomically precise cluster structures using on-the-fly machine learning. Npj Comput. Mater. 2022, 8, 173. [Google Scholar] [CrossRef]
  28. Tong, Q.; Xue, L.; Lv, J.; Wang, Y.; Ma, Y. Accelerating CALYPSO Structure Prediction by Data-driven Learning of Potential Energy Surface. Faraday Discuss. 2018, 211, 31–43. [Google Scholar] [CrossRef]
  29. Rodríguez-Kessler, P.L.; Muñoz-Castro, A. Structure search for transition metal clusters. Towards a rational understanding of their size-dependent properties. Inorganica Chim. Acta 2024, 574, 122376. [Google Scholar] [CrossRef]
  30. Liu, M.; Han, Y.; Cheng, Y.; Zhao, X.; Zheng, H. Exploring exohedral functionalization of fullerene with automation and Neural Network Potential. Carbon 2023, 213, 118180. [Google Scholar] [CrossRef]
  31. Schütt, K.T.; Sauceda, H.E.; Kindermans, P.-J.; Tkatchenko, A.; Müller, K.-R. SchNet—A deep learning architecture for molecules and materials. J. Chem. Phys. 2018, 148, 241722. [Google Scholar] [CrossRef]
  32. Sai, L.; Fu, L.; Zhao, J. Predicting Binding Energies and Electronic Properties of Boron Nitride Fullerenes Using a Graph Convolutional Network. J. Chem. Inf. Model. 2023, 64, 2645. [Google Scholar] [CrossRef] [PubMed]
  33. Delley, B. From molecules to solids with the DMol3 approach. J. Chem. Phys. 2000, 113, 7756–7764. [Google Scholar] [CrossRef]
  34. Perdew, J.P.; Burke, K.; Ernzerhof, M. Generalized gradient approximation made simple. Phys. Rev. Lett. 1996, 77, 3865–3868. [Google Scholar] [CrossRef] [PubMed]
  35. Pradeep, T.; Vijayakrishnan, V.; Santra, A.K.; Rao, C.N.R. Interaction of Nitrogen with Fullerenes: Nitrogen Derivatives of C60 and C70. J. Phys. Chem. 1991, 95, 10564–10565. [Google Scholar] [CrossRef]
  36. Hedberg, K.; Hedberg, L.; Bethune, D.S. Bond Lengths in Free Molecules of Buckminsterfullerene, C60, from Gas-Phase Electron Diffraction. Science 1991, 254, 410–412. [Google Scholar] [CrossRef]
  37. Kipf, T.N.; Welling, M. Semi-Supervised Classification with Graph Convolutional Networks. In Proceedings of the ICLR, Toulon, France, 24–26 April 2017. [Google Scholar]
  38. Sai, L.; Fu, L.; Du, Q.; Zhao, J. Graph attention network for global search of atomic clusters: A case study of Agn (n = 14−26) clusters. Front. Phys. 2022, 18, 13306. [Google Scholar] [CrossRef]
  39. Manaa, M.R.; Ichord, H.A.; Sprehn, D.W. Predicted molecular structure of novel C48B12. Chem. Phys. Lett. 2003, 378, 449–455. [Google Scholar] [CrossRef]
Figure 1. Ring types of all C-B and C-N pentagons and hexagons. Gray, pink and blue vertices are Carbon, Boron and Nitride atom respectively.
Figure 1. Ring types of all C-B and C-N pentagons and hexagons. Gray, pink and blue vertices are Carbon, Boron and Nitride atom respectively.
Nanomaterials 15 01012 g001
Figure 2. Comparison of test MSEs for four models.
Figure 2. Comparison of test MSEs for four models.
Nanomaterials 15 01012 g002
Figure 3. The distribution of energy difference between DFT calculation and GNN prediction for C49N11.
Figure 3. The distribution of energy difference between DFT calculation and GNN prediction for C49N11.
Nanomaterials 15 01012 g003
Figure 4. Predicted top three and reference structures of C60−nBn (n = 7–12).
Figure 4. Predicted top three and reference structures of C60−nBn (n = 7–12).
Nanomaterials 15 01012 g004
Figure 5. Lowest-energy structures of C60−nNn (n = 7–11) found by GNN (top panel) in comparison with the reported structures [24] (bottom panel). The relative energy is given after each structure.
Figure 5. Lowest-energy structures of C60−nNn (n = 7–11) found by GNN (top panel) in comparison with the reported structures [24] (bottom panel). The relative energy is given after each structure.
Nanomaterials 15 01012 g005
Table 1. Total and filtered number of C60−nNn cages with n = 2–12.
Table 1. Total and filtered number of C60−nNn cages with n = 2–12.
nTotal NumberAdjacent ForbiddenAdjacent Forbidden and Meta-Postion Forbidden of Pentagon
22321
3303257
441903019
545,71826,333
6418,470180,316
73,220,218967,944
821,330,5584,158,712
9123,204,92114,406,8891,667,833
10628,330,62940,549,0922,020,015
112,855,893,755 1,432,314
1211,661,527,055 449,788
Table 2. Constitution of the dataset.
Table 2. Constitution of the dataset.
C60−nXnC57B3C54B6C56N4C54N6C51N9C48N12C52N8C49N11Total
Number25715003019249131751152916100013,510
Table 3. Model parameters and test error measured by mean square error (MSE).
Table 3. Model parameters and test error measured by mean square error (MSE).
ModelFeature Embedding
Layer Size
Graph Aggregate LayerOrigin Graph Info.Element EmbeddingTest MSE (meV)
186,6,6××1.974
286,6,6×1.932
366,6,641.822
488,841.713
Table 4. Energy cutoff and numbers of isomers for DFT calculations of C60−nBn and C60−nNn with n = 4–12.
Table 4. Energy cutoff and numbers of isomers for DFT calculations of C60−nBn and C60−nNn with n = 4–12.
Cluster SystemCutoff (σ = 2.147 meV)Number of IsomersPercentage of Total Isomers
C56B44 σ190.629%
C56N43 σ832.749%
C55B55 σ9953.779%
C55N52 σ12174.622%
C54B64 σ410.023%
C54N63 σ12090.67%
C53B75 σ80.0008%
C53N73.5 σ9910.102%
C52B83.5 σ910.024%
C52N83.5 σ65800.158%
C51B94 σ760.005%
C51N94 σ35880.215%
C50B10σ10.00005%
C50N105 σ1640.008%
C49B1116 σ2940.021%
C49N113 σ13640.095%
C48B128 σ5450.121%
C48N123 σ64601.436%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sai, L.; Du, B.; Fu, L.; Akter, S.; Tang, C.; Zhao, J. Graph Neural Network Determine the Ground State Structures of Boron or Nitride Substitute C60 Fullerenes. Nanomaterials 2025, 15, 1012. https://doi.org/10.3390/nano15131012

AMA Style

Sai L, Du B, Fu L, Akter S, Tang C, Zhao J. Graph Neural Network Determine the Ground State Structures of Boron or Nitride Substitute C60 Fullerenes. Nanomaterials. 2025; 15(13):1012. https://doi.org/10.3390/nano15131012

Chicago/Turabian Style

Sai, Linwei, Beiran Du, Li Fu, Sultana Akter, Chunmei Tang, and Jijun Zhao. 2025. "Graph Neural Network Determine the Ground State Structures of Boron or Nitride Substitute C60 Fullerenes" Nanomaterials 15, no. 13: 1012. https://doi.org/10.3390/nano15131012

APA Style

Sai, L., Du, B., Fu, L., Akter, S., Tang, C., & Zhao, J. (2025). Graph Neural Network Determine the Ground State Structures of Boron or Nitride Substitute C60 Fullerenes. Nanomaterials, 15(13), 1012. https://doi.org/10.3390/nano15131012

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop