Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance

Chen, Nengfu; He, Chong; Zhu, Weiren

doi:10.3390/nano13020329

Open AccessArticle

Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance

by

Nengfu Chen

,

Chong He

and

Weiren Zhu

^*

Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

Nanomaterials 2023, 13(2), 329; https://doi.org/10.3390/nano13020329

Submission received: 20 December 2022 / Revised: 5 January 2023 / Accepted: 10 January 2023 / Published: 12 January 2023

(This article belongs to the Special Issue Metasurfaces for Photonic Devices: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Graphene, as a widely used nanomaterial, has shown great flexibility in designing optically transparent microwave metasurfaces with broadband absorption. However, the design of graphene-based microwave metasurfaces relies on cumbersome parameter sweeping as well as the expertise of researchers. In this paper, we propose a machine-learning network which enables the forward prediction of reflection spectra and inverse design of versatile microwave absorbers. Techniques such as the normalization of input and transposed convolution layers are introduced in the machine-learning network to make the model lightweight and efficient. Particularly, the tunable conductivity of graphene enables a new degree in the intelligent design of metasurfaces. The inverse design system based on the optimization method is proposed for the versatile design of microwave absorbers. Representative cases are demonstrated, showing very promising performances on satisfying various absorption requirements. The proposed machine-learning network has significant potential for the intelligent design of graphene-based metasurfaces for various microwave applications.

Keywords:

graphene; metasurface design; machine learning; microwave absorption

1. Introduction

Metasurfaces, composed of periodic or quasi-periodic two-dimensional (2D) arrays of subwavelength units, have emerged as one of the most thriving types of artificial electromagnetic surfaces, owing to their fascinating and tailorable electromagnetic properties [1,2]. In contrast to traditional bulk metamaterials [3,4,5,6], metasurfaces exhibit extreme thicknesses which enable engineering electromagnetic waves in phase, amplitude, and polarization through a compact and easiley fabricated system, providing great freedom in manipulating light-matter interactions at the sub-wavelength scale [7,8]. Such promising approaches prove their feasibility in numerous applications, from basic devices of holograms [9], electromagnetic absorbers [10], and polarizers [11], to more complex systems of information encryption [12,13], signal processing [14], and intelligent recognization [15].

Microwave absorption is one of the most important applications of metasurfaces, which are extremely useful in various engineering aspects [16,17]. Metasurface absorbers [18,19,20] can provide devisable bandwidth, ultra-thin thickness, and angular robustness, as compared to conventional microwave-absorbing materials or devices. Combining nanomaterials and metasurfaces provides a brand-new solution for excellent microwave absorption performance with optical transparency [21,22]. Recent advances in the study of 2D materials, particularly graphene, provide a novel viewpoint for the active control of electromagnetic waves throughout a wide spectrum [23,24]. Graphene possesses remarkable physical properties including monoatomic thickness, optical transparency, and unique electrical tunability attributable to its gapless and symmetrical band structure [25,26]. Notably, the electrostatic control of carrier concentration in graphene allows the dynamic manipulation of electromagnetic waves by adjusting graphene’s Fermi energy [27,28]. For example, garphene has been experimentally implemented for microwave absorbers [21,29]. Most recently, Zhang et al. [30] proposed an optically transparent and flexible microwave metasurface absorber based on a patterned graphene sandwich structure, which can achieve dynamic microwave absorption according to different bias voltages.

However, the traditional design process of graphene-based metasurfaces, including the design of the graphene pattern, the thickness of the dielectric layer, and the sheet resistance of graphene, depends on the expertise of researchers and time-consuming numerical simulations. The design of such metasurfaces demands a working knowledge base in order to moderate iterative simulations that scan multi-dimensional parameter spaces. In recent years, with the development of machine-learning methods and a burst in computation power from GPU acceleration, the concept of artificial intelligence (AI) has been introduced and applied in various research areas, such as image classification [31], natural language processing [32], and wireless communication [33]. It is also a popular interdisciplinary subject to introduce machine-learning technology as a tool in assisting the efficient and rapid design of metasurfaces [34,35,36]. Various neural networks have been used for designing metasurfaces, including multilayer perceptron (MLP) [37], deep neural networks [38], convolutional neural networks (CNN) [39], auto-encoders [40], and generative adversarial networks [41]. Machine learning is also helpful in designing metasurface absorbers. The forward prediction of the reflective spectra and inverse design of the microwave absorption metasurfaces could be achieved by building different machine-learning models. For example, variational autoencoder and covariance matrix-adaptation evolution strategies are utilized to find the optimal absorber in the X band [40]. Recent research about the intelligent design of metasurfaces that utilize machine-learning methods are focusing on the geometrical design. The design degrees are typically several geometrical parameters or the coding pattern of the resonant structures. The electromagnetic properties of materials are not considered in these works since the intrinsic material characteristics are not changeable in traditional design. However, the tunability of graphene enables a new degree of intelligent metasurface design. The sheet resistance of patterned graphene can be adjusted by bias voltage, which is used as a new design degree in this paper. In another aspect, the machine-learning models used in those studies are large and typically contain huge numbers of trainable parameters, some of which are actually redundant. However, the model could be designed specifically to contain a suitable number of trainable parameters. In that case, we call it a lightweight model, which is easy to train and also performs well efficiently.

In this paper, a lightweight machine-learning model is proposed and trained to predict the absorption spectrum of a graphene-based metasurface in milliseconds by putting in geometrical parameters of the patterned graphene layer and the tunable sheet resistance of graphene. Transposed convolution layers are adopted in the network to increase the performance of forward prediction and inverse design systems with the number of training parameters reducing at the same time. An inverse design system is constructed to give the optimized absorption result within the sampling space after specifying design requirements. This system combines the knowledge of the trained machine-learning model and optimization method to achieve quick and efficient design, which gives the optimized spectrum results, the optimized geometrical parameters, and the sheet resistance of graphene at the same time, within seconds.

2. Method

2.1. Graphene-Based Metasurface Absorber Model

The microwave metasurface absorber studied in this article by intelligent design consists of patterned graphene sandwich structures [30]. The top layer of the absorber is a graphene sandwich structure, which is based on a polyethylene glycol terephthalate (PET) substrate with a dielectric constant of 3. A thin ITO ground set is the bottom layer. It is worth noting that all those materials are optically transparent, so that the metasurface made of such materials would be optically transparent. This structure utilizes graphene’s dynamic conductivity by applying different bias voltages and can be used for tunable broadband absorption. The graphene layer is modeled as an infinitesimally thin resistive surface characterized by a sheet resistance

R_{g}

given by the well-established Kubo formula [42]. The bias voltage directly changes the sheet resistance of the patterned graphene layer, resulting in the dynamic change in the absorption performance at different frequency ranges. The sheet resistance of the graphene layer can be simplified as [43]:

R_{g} = \frac{1}{σ_{g}} \approx \frac{π ℏ^{2} (ω + i 2 Γ)}{i e^{2} k_{B} T} {[\frac{E_{F}}{k_{B} T} + 2 ln (1 + e^{\frac{E_{F}}{k_{B} T}})]}^{- 1}

(1)

where e represents the electron charge constants, and ℏ and

k_{B}

are the Planck’s and Boltzmann’s constants, respectively.

ω

represents the operation angular frequency. T is the room temperature and

E_{F}

is the Fermi energy of graphene proportional to the external bias voltage.

ℏ = 1 / 2 τ

is the phenomenological scattering rate(

τ

is the electron-phonon relaxation time). We consider

T = 300

K and

τ = 0.2

ps. The electromagnetic performance of such a metasurface absorber critically relies on the patterned graphene layer [26,44]. By modifying the geometry of the patterned graphene layer and the sheet resistance of graphene, versatile absorption performances can be obtained. However, the conventional design of metasurface structures relies on massive numerical simulations for computing the electromagnetic response of different parameter combinations, which is time-consuming and redundant. In this paper, we utilize neural networks and suitable machine-learning techniques to propose an efficient, user-friendly, and high-performance design system for graphene-based metasurface absorbers.

2.2. Machine-Learning Model

We propose the machine-learning prototype utilizing an MLP network and a transposed convolution technique to realize the fast prediction of reflection coefficients in the range of 6–20 GHz, according to the combination of several geometrical parameters. In our work, a combination of 5 parameters is used as the input information and the reflection coefficients can be inferred as the output of the machine-learning model. There are 4 geometrical parameters,

p, d, l, h

. p represents the period of meta unit, d is the length of the middle square hole, l is the length of graphene in the y-axis, and h is the thickness of the PET substrate, as shown in Figure 1. The tunable sheet resistance

R_{g}

of the graphene layer is another parameter considered in the machine-learning model.

Reflection coefficients are points evenly taken from the results of numerical simulations via CST Microwave Studio, corresponding to the combinations of 5 parameters. A total of 281 points in the range of 6–20 GHz are used for approximation. Therefore, every sample consists of two vectors from the linear space of

R^{5}

and

R^{281}

, respectively.

Samples in the dataset are uniformly distributed from a reasonable range of the linear space

R^{5}

, which restricts those parameters not to violate topology and are in accordance with physics. Moreover, all parameters are normalized to [0,1] before being put into the model, as Equation (2) illustrates:

\bar{s} = \frac{s}{s_{m a x} - s_{m i n}}

(2)

where s represents the value of any of the 5 parameters,

\bar{s}

represents the normalized value,

s_{m a x}

is the maximum value of this parameter while

s_{m i n}

is the minimum value. Normalization reduces the impact of numerical differences between different parameters and makes the training process more stable and effective. Thus, the excellent performance of the forward prediction network is guaranteed when taking those 5 normalized parameters as input.

The convolution techniques that are adopted in CNN are used in lots of fields such as digital image and voice processing [45,46]. They can combine the information in local fields in learning and work as feature-extraction tools. The convolution operation decreases the spatial dimensions and produces an abstract representation of the input image as we go deeper down the network [47]. Recent research into inverse design of metasurfaces uses 1D convolution to extract spectrum features and build inverse model to predict design parameters directly [48]. Here, we add transposed convolution techniques to our forward prediction network to serve the inverse design system better. Transposed convolution is also known as deconvolution, which is not appropriate as deconvolution implies removing the effect of convolution, which we are not aiming to achieve. It is used as a efficient upsampling tool in the modern image semantic segmentation [49,50] and super-resolution algorithms [51]. Deconvolution can also be used to observe the feature-learning performance of the intermediate convolution layer, and is mostly used in image processing and pattern recognition. In our work, since the network model is finally used for inverse design, transposed convolution techniques are used to upsampling the reshaped features from hidden layers and the reconstruct reflective spectrum, which improve the effect of inverse design, to a certain extent.

That strategy also helps improve the performance of prediction in some boundaries of input sampling space and reduce the training parameters of this machine-learning model. That is, we make the model more lightweight and easier to train without performance degradation by introducing transposed convolution layers. The architecture of our deep-learning network is shown in Figure 2. There are two fully connected layers with 100 neurons and 700 neurons in the linear block and three 1D transposed convolution layers in our transposed convolution block. Neurons in the linear block are activated by the Leaky ReLU (rectified linear unit) activation function while those in transposed convolution block are activated by the ReLU activation function.

The total number of trainable parameters in our model is 82,250, which is significantly less than models from recent research on forward predicting light spectrum by AI. The training process of this network is the optimization of a loss function. Minimum square error (MSE) loss is a simple and suitable loss function for our network, which is defined as:

L_{MSE} = \frac{1}{2 n} \sum_{i}^{n} {({\hat{y}}^{i} - y^{i})}^{2},

(3)

with

{\hat{y}}^{i}

denoting the output of MLP network when the input is

x^{i}

, and n denoting the amount of training samples.

We call this machine-learning model a forward prediction network (FPN), which can perform accurate simulation replacing numerical electromagnetic simulations. The training progress is discussed in Section 3.2. The FPN learns knowledge of electromagnetic theory from data and reproduces the calculation correctly gradually.

2.3. Inverse Design System

For inverse design, the FPN model can be seen as a black-box function. That is, the trained FPN model is used as a function

F (x)

, defined as:

\begin{matrix} x \overset{F}{\to} y, \\ x \in R^{5}, y \in R^{281} . \end{matrix}

(4)

Obviously,

F

is continuous when

x \in X

, and the input space is 5 dimensional and the output space is 281 dimensional. Partial derivatives

\partial y_{i} / \partial x_{j}

exist and can be calculated for every

i = 1, 2, \dots, 281

and

j = 1, 2, \dots, 5

. Thus,

F

is the first-order differentiate.

F (x)

is an abstract function, not like a normal sine or polynomial function that can be mathematically expressed easily. We only need to know the input and output, while the relationship between input and output is learned from the machine-learning training process, which is also called “knowledge” of AI. We can utilize this “knowledge” for our further research and do not need to understand its fundamentals. The Jacobi matrix of

F (\bar{x})

for

\forall \bar{x} \in X

can be easily calculated numerically by back propagation using machine-learning toolbox. Since we need to design a specific absorber, that means we have some pre-defined requirements on the absorption spectrum. This can also be represented by a reflective spectrum, since the transmissive wave is neglectable in our design. In the absorber design, we typically want the band of absorption to be as wide as possible, or the intensity of absorption to be as strong as possible. Based on such requirements of absorber design, we can set optimization goals on the reflective spectrum according to different requirements. The optimization goal L can be set as:

L = \sum_{x_{i} \in S} {(w_{i} F {(x)}_{i})}^{2},

(5)

where S denotes the points set of the designed absorption frequency range, and

F {(x)}_{i}

represents the i-th element of the output vector

y = F (x)

. Here,

w_{i}

is the optimization weight of

F {(x)}_{i}

, which can fine tune the optimization results. The optimization problem can be presented as:

\begin{matrix} min_{x} \sum_{x_{i} \in S} {(w_{i} F {(x)}_{i})}^{2} \\ s . t . x \in X \end{matrix}

(6)

where X is the input space of our FPN. Therefore, fulfilling the absorber design requirements turns into solving an optimization of the first-order differentiable continuous functions with constraints. Selecting an optimization algorithm in the conventional convex optimization field such as steepest descent, conjugate gradient or Lagrange multiplier method, and penalty function method can make our inverse design system work. Since the Jacobi matrix of

F (x)

can be obtained directly from the machine-learning toolbox, the optimization process can also be performed in a machine-learning prototype effectively. The absorption in the frequency range S can reach the optimized state by minimizing L. The working mechanism is shown in Figure 3, where

\nabla_{x} L (x)

is computed by Chain rule in calculus:

\begin{matrix} \nabla_{x} L (x) = (\begin{matrix} \frac{\partial L}{\partial x_{1}} \\ \frac{\partial L}{\partial x_{2}} \\ ⋮ \\ \frac{\partial L}{\partial x_{5}} \end{matrix}) = (\begin{matrix} \sum_{i = 1}^{281} \frac{\partial L}{\partial y_{i}} \cdot \frac{\partial y_{i}}{\partial x_{1}} \\ \sum_{i = 1}^{281} \frac{\partial L}{\partial y_{i}} \cdot \frac{\partial y_{i}}{\partial x_{2}} \\ ⋮ \\ \sum_{i = 1}^{281} \frac{\partial L}{\partial y_{i}} \cdot \frac{\partial y_{i}}{\partial x_{5}} \end{matrix}) . \end{matrix}

To implement the inverse design system, we define a new machine-learning model with nearly the same structure as the forward prediction model. The new model does not need input as FPN. The first layer of the new model is working as model input, but is trainable instead. After the first layer, the network architecture is the same as our pre-trained FPN model. Then, we fix the weight and bias in the new model except for the first layer and set those quantities to exactly the same as pre-trained FPN model, which means these parts work as the black-box function

F (x)

. In the training process, the parameters of the first layer are optimized by our optimization method while other parameters of the new machine-learning model remain unchanged.

In the inverse design system building, we realize the differentiable property of the machine-learning model and utilize the model as a black-box function. CST generates data in a complex numerical calculation manner while the machine-learning model can mine the implicit knowledge contained in the generated data. Therefore, for this graphene-based metasurface absorber, the principle of numerical calculation can be concisely and accurately represented by machine learning, which is crucial to establishing a fast and effective inverse design system. The insights gained from machine learning have great potential to expand to other nanomaterials applications.

3. Experiments and Results

3.1. Dataset Collection

To train the forward prediction neural network, we first need a dataset of our graphene-based metasurface with adequate data to sample space. Simulations for 7000 combinations of parameters are conducted by Matlab-CST co-simulation. The metasurface is first modeled and the simulation condition is set in commercial software CST Microwave Studio. In the simulations, periodic boundary conditions were set along the x and y directions and the Flouquet port excitation was applied along the z direction. In the process of the generation of data, parameter combinations are firstly uniformly sampled in Matlab. The built-in Visual Basic interface of CST is utilized in Matlab to change the dynamic parameters that we need. In each iteration, CST Microwave Studio fetches one parameter combination from Matlab, runs the corresponding numerical simulation, and passes calculated spectrum data back to Matlab. The values of parameters are uniformly sampled from constrained space, which keeps the topology and physical rules correct. In this way, 7000 pairs of data containing parameter combinations and spectrum are finally generated and organized into the dataset. The sampling process and the range of parameters are shown in Figure 4 and Table 1.

3.2. Performance of Forward Prediction

The hyperparameters for training models are shown in Table 2, which is not heavily optimized but can achieve our objectives. The training process is achieved in the online platform Kaggle with GPU acceleration by Nvidia Tesla P100. This GPU has the NVIDIA Pascal GPU architecture, which is optimized to support novel deep-learning applications. Figure 5a shows the loss of training and validation data during the training process, for the cases with and without normalization of the input parameters. It can be seen that the train loss with normalization converges to below

10^{- 5}

, one order lower than the one without normalization. The validation loss with normalization also converges to

3 \times 10^{- 5}

, much lower than the one without normalization,

1.37 \times 10^{- 4}

. Moreover, both the validation and the train loss with normalization drop very quickly in the first 1000 epochs, much faster than those without normalization. The comparison of model training with and without normalization shows that the normalization of the input parameters speeds up the convergence of loss and makes the test error significantly lower. The average percent error for each spectrum is defined as the difference between the prediction of the FPN model and the ground truth from CST simulation, divided by the latter. Figure 5b shows the average percent error with and without normalization. It can be seen that the error for each spectrum point is less than 1% after 1000 epochs of training with normalization, which shows the extraordinary prediction accuracy of our FPN models.

To better show the performace of our network architecture, we build an MLP model for comparison. The MLP model has two hidden layers, the same as our FPN, with a fully connected last hidden layer with a 281-dimension vector output. The nodes of the four layers (including input and output) are 5, 100, 700, 281. Therefore, the total trainable parameters are 268,281, about 4 times our FPN model. We also define a measurable criterion to evaluate the performance of different models more visually. The distance

D

refers to the error of one prediction of the model defined as:

D = | 10 log (\hat{y}) - 10 log (y) | .

(7)

Figure 5b shows the prediction of the reflective spectrum by models with and without transposed convolution layers. It is clear that, in some extreme situations, the distance of our FPN remains quite low while the MLP performs poorly in some frequencies.

The model is trained well quite quickly, with less than 25 min required to achieve very promising accuracy for the forward prediction. The forward-prediction performance of evaluating examples not used for model training is shown in Figure 5e,f. The our FPN can run thousands of simulations in milliseconds and its result perfectly matches the ones of CST simulation. Owing to its concise and lightweight architecture and extremely promising prediction precision, our model is able to be trained quickly and efficiently. Once we collect some minor datasets in other frequency regimes with specifying geometries, the model is also scalable and valid across terahertz or infrared regimes by transfer learning, which is worth further investigation.

3.3. Results of Inverse Design System

The absorptivity

A (ω)

of the proposed metasurface can be calculated by:

A (ω) = 1 - {| r (ω) |}^{2} - {| t (ω) |}^{2} \approx 1 - {| r (ω) |}^{2} .

(8)

Here,

r (ω)

and

t (ω)

are the reflection and transmission coefficients, with

t (ω)

being negligibly low.

As a first example, we set the targeted frequency at around 10 GHz to find the highest absorptivity as possible, as shown in Case 1 of Figure 6. It is seen that a minimum reflectance reaching −50 dB could be optimized at 10 GHz, reaching a peak absorption over 99.99%. As a second example, the absorption frequency range is targeted as 9–14 GHz, as shown in Case 2 of Figure 6. After optimization, a wide-band absorption spectrum can be obtained and more than 99% absorptivity achieved in the entire band of 9–14 GHz. We can also set two targeted frequencies separately as the optimization regions for seeking a metasurface absorber with dual-band absorption. As shown in Case 3 of Figure 6, there is nearly 99.9% absorption in the dual peaks at 10 GHz and 16 GHz. In Case 4 of Figure 6, we show an ultra-wide band absorption optimization, where over 90% absorption have been achieved within the frequency band 7.55–18.8 GHz, covering both X-band and Ku-band.

Figure 6 shows the reflective spectrum of FPN prediction and the CST simulation result based on optimized parameter combinations. The corresponding parameter combinations are shown in Table 3, verifying the effectiveness and performance of our machine-learning model based the inverse design of versatile metasurface absorbers. Those cases are chosen from arbitrary given requirements, which reveal the universal applicability of our inverse design system.

In most situations, the dielectric substrates are typically of standard thicknesses. Substrates with non-standard thicknesses are hard to fabricate or of high cost. If, for example, we fix the thickness h to be 3.5 mm, the design system can still give promising results. In Figure 6, Cases 5–8 show the optimization with the remaining four design parameters to achieve the required absorption performance. It is seen that most requirements can still be satisfied well. Since the thickness of substrate

h = 3.5

mm is relatively thick and affects the resonance frequency evidently, the absorption frequency bands in Cases 7 and 8 show a slight red shift as compared to those in Cases 3 and 4.

On the other hand, since the tunable patterned graphene is more difficult to fabricate than unchangeable graphene, it is also meaningful to examine the performance of our inverse design system with the fixed-sheet resistance of graphene. Here, assuming the graphene layer has a fixed-sheet resistance of 250

Ω

, different absorbers can still be designed well. Figure 6 Cases 9–12 show the inverse design results by optimizing the remaining four geometrical parameters while fixing the sheet resistance of graphene. We can see that the absorption performances in Cases 9–12 are obtained very similarly to those of Cases 1–4, with little degradation in absorptivities. All these results indicate that our inverse design system has good design flexibility. The satisfactory performance of our model indicates its potential in other nanomaterials applications. When it comes to a new material system, the reasonable parameter-sampling space should be given, firstly, according to professionals of engineers. Then, similar training procedures and inverse-design-system building could be applied, as such a model is scalable to other material systems.

4. Conclusions

In this work, we proposed a novel machine-learning-model-based inverse-design system for designing graphene-based metasurface absorbers with versatile absorption performance. Transposed convolution layers were introduced in our forward-prediction architecture for reducing the model size, which improves performance. With the key parameters of the metasurface normalized as input, the forward-prediction model can quickly predict the reflective spectra of the absorbers with high accuracy, as compared to numerical simulations. Based on the well-trained machine-learning model, we built an inverse-design system to optimize versatile-absorption performance. Given the optimization goal for specified absorption frequencies, the system can find the optimized results in a sampling space in seconds. The insights gathered from this paper could help with the intelligent design for other type of graphene-based metasurfaces or devices.

Author Contributions

Conceptualization, W.Z.; methodology, N.C. and C.H.; software, N.C.; investigation, N.C.; data curation, N.C. and C.H.; writing—original draft preparation, N.C.; writing—review and editing, W.Z. and C.H.; visualization, N.C. and C.H.; supervision, W.Z.; project administration, W.Z.; funding acquisition, W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (62071291).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yu, N.; Genevet, P.; Kats, M.A.; Aieta, F.; Tetienne, J.P.; Capasso, F.; Gaburro, Z. Light Propagation with Phase Discontinuities: Generalized Laws of Reflection and Refraction. Science 2011, 334, 333–337. [Google Scholar] [CrossRef] [Green Version]
Sun, S.; He, Q.; Xiao, S.; Xu, Q.; Li, X.; Zhou, L. Gradient-index meta-surfaces as a bridge linking propagating waves and surface waves. Nat. Mater. 2012, 11, 426–431. [Google Scholar] [CrossRef]
Schurig, D.; Mock, J.J.; Justice, B.J.; Cummer, S.A.; Pendry, J.B.; Starr, A.F.; Smith, D.R. Metamaterial Electromagnetic Cloak at Microwave Frequencies. Science 2006, 314, 977–980. [Google Scholar] [CrossRef] [Green Version]
Kundtz, N.; Smith, D.R. Extreme-angle broadband metamaterial lens. Nat. Mater. 2010, 9, 129–132. [Google Scholar] [CrossRef]
Zheludev, N.I.; Kivshar, Y.S. From metamaterials to metadevices. Nat. Mater. 2012, 11, 917–924. [Google Scholar] [CrossRef]
Vendik, I.; Vendik, O. Metamaterials and their application in microwaves: A review. Tech. Phys. 2013, 58, 1–24. [Google Scholar] [CrossRef]
Akram, M.R.; Ding, G.; Chen, K.; Feng, Y.; Zhu, W. Ultrathin Single Layer Metasurfaces with Ultra-Wideband Operation for Both Transmission and Reflection. Adv. Mater. 2020, 32, 1907308. [Google Scholar] [CrossRef]
Li, Z.; Qi, J.; Hu, W.; Liu, J.; Zhang, J.; Shao, L.; Zhang, C.; Wang, X.; Jin, R.; Zhu, W. Dispersion-Assisted Dual-Phase Hybrid Meta-Mirror for Dual-Band Independent Amplitude and Phase Controls. IEEE Trans. Antenn. Propag. 2022, 70, 7316–7321. [Google Scholar] [CrossRef]
Zheng, G.; Mühlenbernd, H.; Kenney, M.; Li, G.; Zentgraf, T.; Zhang, S. Metasurface holograms reaching 80% efficiency. Nat. Nanotech. 2015, 10, 308–312. [Google Scholar] [CrossRef]
Akselrod, G.M.; Huang, J.; Hoang, T.B.; Bowen, P.T.; Su, L.; Smith, D.R.; Mikkelsen, M.H. Large-Area Metasurface Perfect Absorbers from Visible to Near-Infrared. Adv. Mater. 2015, 27, 8028–8034. [Google Scholar] [CrossRef]
Yu, Y.; Xiao, F.; He, C.; Jin, R.; Zhu, W. Double-arrow metasurface for dual-band and dual-mode polarization conversion. Opt. Express 2020, 28, 11797–11805. [Google Scholar] [CrossRef] [PubMed]
Li, Z.; Premaratne, M.; Zhu, W. Advanced encryption method realized by secret shared phase encoding scheme using a multi-wavelength metasurface. Nanophotonics 2020, 9, 3687–3696. [Google Scholar] [CrossRef]
Li, Z.; Kong, X.; Zhang, J.; Shao, L.; Zhang, D.; Liu, J.; Wang, X.; Zhu, W.; Qiu, C.W. Cryptography Metasurface for One-Time-Pad Encryption and Massive Data Storage. Laser Photonics Rev. 2022, 16, 2200113. [Google Scholar] [CrossRef]
Zhao, H.; Shuang, Y.; Wei, M.; Cui, T.J.; Hougne, P.d.; Li, L. Metasurface-assisted massive backscatter wireless communication with commodity Wi-Fi signals. Nat. Commun. 2020, 11, 3926. [Google Scholar] [CrossRef]
Li, L.; Shuang, Y.; Ma, Q.; Li, H.; Zhao, H.; Wei, M.; Che, L.; Hao, C.; Qiu, C.W.; Cui, T. Intelligent metasurface imager and recognizer. Light-Sci. Appl. 2019, 8, 97. [Google Scholar] [CrossRef] [Green Version]
Zhou, Z.; Chen, K.; Zhao, J.; Chen, P.; Jiang, T.; Zhu, B.; Feng, Y.; Li, Y. Metasurface Salisbury screen: Achieving ultra-wideband microwave absorption. Opt. Express 2017, 25, 30241–30252. [Google Scholar] [CrossRef]
Zhou, Z.; Chen, K.; Zhu, B.; Zhao, J.; Feng, Y.; Li, Y. Ultra-Wideband Microwave Absorption by Design and Optimization of Metasurface Salisbury Screen. IEEE Access 2018, 6, 26843–26853. [Google Scholar] [CrossRef]
Guo, W.; Liu, Y.; Han, T. Ultra-broadband infrared metasurface absorber. Opt. Express 2016, 24, 20586–20592. [Google Scholar] [CrossRef]
Alaee, R.; Albooyeh, M.; Rockstuhl, C. Theory of metasurface based perfect absorbers. J. Phys. D Appl. Phys. 2017, 50, 503002. [Google Scholar] [CrossRef] [Green Version]
To, N.; Juodkazis, S.; Nishijima, Y. Detailed Experiment-theory comparison of mid-infrared metasurface perfect absorbers. Micromachines 2020, 11, 409. [Google Scholar] [CrossRef]
Lu, W.B.; Wang, J.W.; Zhang, J.; Liu, Z.G.; Chen, H.; Song, W.J.; Jiang, Z.H. Flexible and optically transparent microwave absorber with wide bandwidth based on graphene. Carbon 2019, 152, 70–76. [Google Scholar] [CrossRef]
Jang, T.; Youn, H.; Shin, Y.J.; Guo, L.J. Transparent and Flexible Polarization-Independent Microwave Broadband Absorber. ACS Photonics 2014, 1, 279–284. [Google Scholar] [CrossRef]
Zeng, S.; Sreekanth, K.V.; Shang, J.; Yu, T.; Chen, C.K.; Yin, F.; Baillargeat, D.; Coquet, P.; Ho, H.P.; Kabashin, A.V.; et al. Graphene–Gold Metasurface Architectures for Ultrasensitive Plasmonic Biosensing. Adv. Mater. 2015, 27, 6163–6169. [Google Scholar] [CrossRef]
Shi, S.F.; Zeng, B.; Han, H.L.; Hong, X.; Tsai, H.Z.; Jung, H.S.; Zettl, A.; Crommie, M.F.; Wang, F. Optimizing Broadband Terahertz Modulation with Hybrid Graphene/Metasurface Structures. Nano Lett. 2015, 15, 372–377. [Google Scholar] [CrossRef] [Green Version]
Zhang, J.; Wei, X.; Rukhlenko, I.D.; Chen, H.T.; Zhu, W. Electrically Tunable Metasurface with Independent Frequency and Amplitude Modulations. ACS Photonics 2020, 7, 265–271. [Google Scholar] [CrossRef]
Balci, O.; Polat, E.O.; Kakenov, N.; Kocabas, C. Graphene-enabled electrically switchable radar-absorbing surfaces. Nat. Commun. 2015, 6, 6628. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhang, J.; Zhang, H.; Yang, W.; Chen, K.; Wei, X.; Feng, Y.; Jin, R.; Zhu, W. Dynamic Scattering Steering with Graphene-Based Coding Metamirror. Adv. Opt. Mater. 2020, 8, 2000683. [Google Scholar] [CrossRef]
Balci, O.; Kakenov, N.; Karademir, E.; Balci, S.; Cakmakyapan, S.; Polat, E.O.; Caglayan, H.; Özbay, E.; Kocabas, C. Electrically switchable metadevices via graphene. Sci. Adv. 2018, 4, eaao1749. [Google Scholar] [CrossRef] [Green Version]
Grande, M.; Bianco, G.; Vincenti, M.; De Ceglia, D.; Capezzuto, P.; Petruzzelli, V.; Scalora, M.; Bruno, G.; D’Orazio, A. Optically transparent microwave screens based on engineered graphene layers. Opt. Express 2016, 24, 22788–22795. [Google Scholar] [CrossRef] [Green Version]
Zhang, J.; Li, Z.; Shao, L.; Zhu, W. Dynamical absorption manipulation in a graphene-based optically transparent and flexible metasurface. Carbon 2021, 176, 374–382. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
Hirschberg, J.; Manning, C.D. Advances in natural language processing. Science 2015, 349, 261–266. [Google Scholar] [CrossRef] [PubMed]
Hu, X.; Liu, Z.; Yu, X.; Zhao, Y.; Chen, W.; Hu, B.; Du, X.; Li, X.; Helaoui, M.; Wang, W.; et al. Convolutional Neural Network for Behavioral Modeling and Predistortion of Wideband Power Amplifiers. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 3923–3937. [Google Scholar] [CrossRef]
Peurifoy, J.; Shen, Y.; Jing, L.; Yang, Y.; Cano-Renteria, F.; DeLacy, B.G.; Joannopoulos, J.D.; Tegmark, M.; Soljačić, M. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 2018, 4, eaar4206. [Google Scholar] [CrossRef] [Green Version]
Lin, R.; Zhai, Y.; Xiong, C.; Li, X. Inverse design of plasmonic metasurfaces by convolutional neural network. Opt. Lett. 2020, 45, 1362–1365. [Google Scholar] [CrossRef] [PubMed]
Liu, Z.; Zhu, D.; Rodrigues, S.P.; Lee, K.T.; Cai, W. Generative Model for the Inverse Design of Metasurfaces. Nano Lett. 2018, 18, 6570–6576. [Google Scholar] [CrossRef] [Green Version]
Chen, Y.; Zhu, J.; Xie, Y.; Feng, N.; Liu, Q.H. Smart inverse design of graphene-based photonic metamaterials by an adaptive artificial neural network. Nanoscale 2019, 11, 9749–9755. [Google Scholar] [CrossRef]
Liu, D.; Tan, Y.; Khoram, E.; Yu, Z. Training Deep Neural Networks for the Inverse Design of Nanophotonic Structures. ACS Photonics 2018, 5, 1365–1369. [Google Scholar] [CrossRef] [Green Version]
Liu, L.; Xie, L.X.; Huang, W.; Zhang, X.J.; Lu, M.H.; Chen, Y.F. Broadband acoustic absorbing metamaterial via deep learning approach. Appl. Phys. Lett. 2022, 120, 251701. [Google Scholar] [CrossRef]
On, H.I.; Jeong, L.; Jung, M.; Kang, D.J.; Park, J.H.; Lee, H.J. Optimal design of microwave absorber using novel variational autoencoder from a latent space search strategy. Mater. Des. 2021, 212, 110266. [Google Scholar] [CrossRef]
Ma, W.; Cheng, F.; Xu, Y.; Wen, Q.; Liu, Y. Probabilistic Representation and Inverse Design of Metamaterials Based on a Deep Generative Model with Semi-Supervised Learning Strategy. Adv. Mater. 2019, 31, 1901111. [Google Scholar] [CrossRef] [Green Version]
Hanson, G.W. Dyadic Green’s functions and guided surface waves for a surface conductivity model of graphene. J. Appl. Phys. 2008, 103, 064302. [Google Scholar] [CrossRef] [Green Version]
Quader, S.; Zhang, J.; Akram, M.R.; Zhu, W. Graphene-Based High-Efficiency Broadband Tunable Linear-to-Circular Polarization Converter for Terahertz Waves. IEEE J. Sel. Top. Quantum Electron. 2020, 26, 4501008. [Google Scholar] [CrossRef]
Zhang, J.; Wei, X.; Premaratne, M.; Zhu, W. Experimental demonstration of an electrically tunable broadband coherent perfect absorber based on a graphene-electrolyte-graphene sandwich structure. Photon. Res. 2019, 7, 868–874. [Google Scholar] [CrossRef]
Kong, Q.; Cao, Y.; Iqbal, T.; Wang, Y.; Wang, W.; Plumbley, M.D. PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE/ACM Trans. Audio Speech Lang. Process. 2020, 28, 2880–2894. [Google Scholar] [CrossRef]
Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J.; et al. Recent advances in convolutional neural networks. Pattern Recogn. 2018, 77, 354–377. [Google Scholar] [CrossRef] [Green Version]
Zhou, D.X. Theory of deep convolutional neural networks: Downsampling. Neural Netw. 2020, 124, 319–327. [Google Scholar] [CrossRef]
Liao, X.; Gui, L.; Yu, Z.; Zhang, T.; Xu, K. Deep learning for the design of 3D chiral plasmonic metasurfaces. Opt. Mater. Express 2022, 12, 758–771. [Google Scholar] [CrossRef]
Fu, J.; Liu, J.; Li, Y.; Bao, Y.; Yan, W.; Fang, Z.; Lu, H. Contextual deconvolution network for semantic segmentation. Pattern Recogn. 2020, 101, 107152. [Google Scholar] [CrossRef]
Minaee, S.; Boykov, Y.; Porikli, F.; Plaza, A.; Kehtarnavaz, N.; Terzopoulos, D. Image Segmentation Using Deep Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 3523–3542. [Google Scholar] [CrossRef]
Yang, X.; Mei, H.; Zhang, J.; Xu, K.; Yin, B.; Zhang, Q.; Wei, X. DRFN: Deep Recurrent Fusion Network for Single-Image Super-Resolution With Large Factors. IEEE Trans. Multimed. 2019, 21, 328–337. [Google Scholar] [CrossRef] [Green Version]
Duchi, J.; Hazan, E.; Singer, Y. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. J. Mach. Learn. Res. 2011, 12, 2121–2159. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of forward prediction and inverse design of graphene-based microwave absorbers using machine-learning model.

Figure 2. Illustration of forward prediction machine-learning network. (a) The five design parameters are normalized as a 5-dimension vector input for the machine-learning network. The output points correspond to 281 sample points of reflective spectrum in 6–20 GHz. The network consists of two fully connected layers and a transposed convolution block. (b) The working mechanism of transposed convolution block considering batch size in training process with three transposed convolution layers. Here,

n_{b}

is the batch size. The kernel size for each layer and feature-map evolution are demonstrated. This block eventually turns information from the second fully connected layer into 281 dimensions’ output which can reproduce the reflective spectrum.

Figure 2. Illustration of forward prediction machine-learning network. (a) The five design parameters are normalized as a 5-dimension vector input for the machine-learning network. The output points correspond to 281 sample points of reflective spectrum in 6–20 GHz. The network consists of two fully connected layers and a transposed convolution block. (b) The working mechanism of transposed convolution block considering batch size in training process with three transposed convolution layers. Here,

n_{b}

is the batch size. The kernel size for each layer and feature-map evolution are demonstrated. This block eventually turns information from the second fully connected layer into 281 dimensions’ output which can reproduce the reflective spectrum.

Figure 3. Illustration of inverse design system optimization process. Before training begins, an initial seed

x_{0}

is generated randomly. In iterative step k, loss L and the its gradients to

x

:

g_{k} = \nabla_{x} L (x)

are firstly computed through the pre-trained model.

{\hat{g}}_{k}

is derived by the adaptive subgradient method [52] based on

g_{k}

in all iterations before k-th to determine the desecending direction

d^{k}

, which is the same dimension as

x

.

λ

is the descending step of each iterative, which is a scalar in (0, 1).

x

is updated with the iterative paradigm. After a fixed number of iterative steps, the optimized result can be then solved. Combinations of optimized parameters of any design requirements can be obtained within seconds.

Figure 3. Illustration of inverse design system optimization process. Before training begins, an initial seed

x_{0}

is generated randomly. In iterative step k, loss L and the its gradients to

x

:

g_{k} = \nabla_{x} L (x)

are firstly computed through the pre-trained model.

{\hat{g}}_{k}

is derived by the adaptive subgradient method [52] based on

g_{k}

in all iterations before k-th to determine the desecending direction

d^{k}

, which is the same dimension as

x

.

λ

is the descending step of each iterative, which is a scalar in (0, 1).

x

is updated with the iterative paradigm. After a fixed number of iterative steps, the optimized result can be then solved. Combinations of optimized parameters of any design requirements can be obtained within seconds.

Figure 4. Illustration of dataset generation. S1, S2, S3, S4 are randomly selected sample spectra from dataset.

Figure 5. (a) Train and validation loss in the training process with and without normalization. (b) Average percentage error of each 281 spectrum-point visualizations in the training process. (c,d) Prediction performance with and without transposed convolution layers in two extreme situations when the parameters are at the boundaries of sampling space. (e,f) Two examples of prediction performance of our FPN model.

Figure 6. Versatile absorber designs based on different requirements. (The reflectance are shown in logarithmic coordinate) Cases 1–4: inverse design in all five degree of freedom. Cases 5–8: inverse design with thickness of substrate h fixed at 3.5 mm. Cases 9–12: inverse design with sheet resistance of graphene

R_{g}

fixed at 250

Ω

. The colored area is the optimization area. FPN indicates the optimized reflection spectra given by inverse design system while CST represents the reflection spectra from CST simulations with the optimized parameter combinations.

Figure 6. Versatile absorber designs based on different requirements. (The reflectance are shown in logarithmic coordinate) Cases 1–4: inverse design in all five degree of freedom. Cases 5–8: inverse design with thickness of substrate h fixed at 3.5 mm. Cases 9–12: inverse design with sheet resistance of graphene

R_{g}

fixed at 250

Ω

. The colored area is the optimization area. FPN indicates the optimized reflection spectra given by inverse design system while CST represents the reflection spectra from CST simulations with the optimized parameter combinations.

Table 1. Design parameters sampling space.

Design Parameters	Start	End
$R_{g}$ ( $Ω$ )	132	300
d (mm)	1	6
l (mm)	5	11
p (mm)	8	14
h (mm)	2	4

Table 2. Hyperparameters in model training.

Hyperparameters	Values
Learning rate	3 × 10 $^{- 4}$
Optimization method	Adam
Learning-rate decay	5 × 10 $^{- 6}$
Loss function	MSELoss

Table 3. Parameters combinations for cases in Figure 6.

Parameters	$R_{g}$ ( $Ω$ )	d (mm)	l (mm)	p (mm)	h (mm)
Case 1	136.42	1	6.36	11.63	2.6
Case 2	149.48	5.29	7.23	14	3.57
Case 3	153.8	3.43	6.49	12.26	3.14
Case 4	132	3.73	7	10.62	2.91
Case 5	171.48	4.42	7.58	13.17	3.5
Case 6	154.39	4.85	7.04	14	3.5
Case 7	179.47	2.38	7.35	13.18	3.5
Case 8	132	4.75	7.58	11.38	3.5
Case 9	250	1	6.28	10.43	3.16
Case 10	250	2.3	7.3	14	3.41
Case 11	250	1	6.86	12.5	3.16
Case 12	250	1.11	7.72	10.84	2.83

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, N.; He, C.; Zhu, W. Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance. Nanomaterials 2023, 13, 329. https://doi.org/10.3390/nano13020329

AMA Style

Chen N, He C, Zhu W. Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance. Nanomaterials. 2023; 13(2):329. https://doi.org/10.3390/nano13020329

Chicago/Turabian Style

Chen, Nengfu, Chong He, and Weiren Zhu. 2023. "Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance" Nanomaterials 13, no. 2: 329. https://doi.org/10.3390/nano13020329

APA Style

Chen, N., He, C., & Zhu, W. (2023). Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance. Nanomaterials, 13(2), 329. https://doi.org/10.3390/nano13020329

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Lightweight Machine-Learning Model for Efficient Design of Graphene-Based Microwave Metasurfaces for Versatile Absorption Performance

Abstract

1. Introduction

2. Method

2.1. Graphene-Based Metasurface Absorber Model

2.2. Machine-Learning Model

2.3. Inverse Design System

3. Experiments and Results

3.1. Dataset Collection

3.2. Performance of Forward Prediction

3.3. Results of Inverse Design System

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI