Abstract
The so-called material distribution methods for topology optimization cast the governing equation as an extended or fictitious domain problem, in which a coefficient field represents the design. In practice, the finite element method is typically used to approximate that kind of governing equations by using a large number of elements to discretize the design domain, and an element-wise constant function approximates the coefficient field in that domain. This paper presents a spectral analysis of the coefficient matrices associated with the linear systems stemming from the finite element discretization of a linearly elastic problem for an arbitrary coefficient field in three spatial dimensions. The given theoretical analysis is used for designing and studying an optimal multigrid method in the sense that the (arithmetic) cost for solving the problem, up to a fixed desired accuracy, is linear in the corresponding matrix size. Few selected numerical examples are presented and discussed in connection with the theoretical findings.
1. Introduction
In our previous paper [], we applied the theory of generalized locally Toeplitz (GLT) sequences to compute and analyze the asymptotic spectral distribution of the sequence of stiffness matrices , with being the finite element (FE) approximation of the considered one spatial dimension topology optimization problem, for a given fineness parameter associated to n. In a later contribution [], we extended the analysis to the two-dimensional setting using so-called multilevel block GLT sequences. In this paper, we further expand the theory to cover the three-dimensional case.
Since the first material distribution method for topology design was introduced in the late 1980s [], topology optimization [,], a well-known computational tool for finding the optimal distribution of material within a given design domain, has been studied extensively. The material distribution topology optimization has contributed to the development of several areas, such as electromagnetic [,,], fluid–structure interaction [,], acoustics [,], additive manufacturing [], and especially (non-)linear elasticity [,,]. For problems in linear elasticity, which motivates the study in this paper, the most common method to solve this type of problem is the so-called density-based or material distribution approach. In this approach, a so-called material indicator function —typically referred to as the density or the physical design—models the presence/absence of material; where material is present, else . However, the binary design problem is computationally intractable. A standard approach to make the problem computationally feasible is employing a combination of relaxation, penalization, and filtering, in which the physical density is defined as by , which is a constant, the penalization operator is g, and the filtering procedure is . In this approach, the finite element method (FEM) is the standard choice for generating numerical solutions of a linearly elastic boundary value problem. In practice, the physical design is typically represented as an element-wise constant function. Moreover, a combination of allowing intermediate values of the design, filtering, and penalization has been crucial for the success of these methods. In this paper, we limit our attention to studying the linear system that stems from the FE discretization of the governing equation and its solution. From a high performance computing perspective, this limitation is natural since the computational effort in solving topology optimization problems is dominated by the cost of solving the discretized governing equations—a fact that is particularly prominent when studying three-dimensional problems. Over the last decades, there has been significant work in improving the efficiency of topology optimization, aiming at solving large-scale design problems [,,,,]. Although our study is directly motivated by the material-distribution or density-based approach to topology optimization, the work is relevant for other design descriptions, such as level sets [,] or moving morphable components [,,,], provided that the physics description uses the so-called ersatz material approach, which in some cases can be justified rigorously [].
The theory of multilevel block GLT sequences [,] is a generalization of the GLT sequences theory [,] that is typically used for computing/analyzing the spectral distribution of matrix sequences arising from, for example, the numerical discretization, such as the FE approximation of partial differential equations (PDEs) with proper boundary conditions. In the considered matrix sequences, the size of the given linear systems increases with n, in which tends to infinity as . In this paper, the entire sequence of linear systems with increasing size arising from the three spatial dimension problems is the primary consideration. Under mild conditions, essentially relying on regularity assumptions on the meshes, we show that the sequence of discretization matrices has an asymptotic spectral distribution. By leveraging the theory of multilevel block GLT sequences, we now extend our previous works [,] further to perform a detailed spectral analysis of the linear systems associated with the FE discretization of the governing equation in the three-dimensional setting. Similar to our previous work [], we also make use of the information obtained from the spectral symbol f to design a fast, multigrid solver in three dimensions for optimizing the (arithmetic) cost to solve the related linear systems up to a fixed desired accuracy, which is proportional to the matrix–vector cost, which is linear in the corresponding matrix size. This solver is also verified and comes up with very satisfying numerical results, in terms of the linear cost and number of iterations, which are bounded independently of the matrix size and mildly depending on the desired accuracy.
In the following sections, we will go into detail on the problem description, spectral analysis, multigrid method, and a brief conclusion. The description of the continuous problem and the resulting coefficient matrices arising from our FE approximation is delivered in Section 2 and Section 3 is devoted to the spectral analysis of the FE matrices from the perspective of the GLT theory. In Section 4, a brief account of multigrid methods with special attention to the block case encountered in the present context is given, and the spectral information is a core component in the development of the multigrid proposal for our specific 3D setting presented in Section 5. Eventually, the conclusions are reported in Section 6, and some relevant model information is explained in Appendix A and Appendix B.
2. Problem Description
We consider a linearly elastic structure that occupies (part of) the hyper-rectangular domain . In particular, we are interested in the setting used in material distribution based topology optimization, where a function, typically denoted as the physical density, , describes the layout of the unloaded structure. We assume that the structure is clamped along the boundary portion . Moreover, we let be a given body load (a volume force) in , be the surface traction acting on the non-clamped boundary of the solid, and u denote the resulting equilibrium displacement, which solves the following problem.
where the set of all kinematically admissible displacements of the structure is
The energy bilinear form a and the load linear form ℓ are defined as
where is the strain tensor of u. The colon “:” denotes the full contraction between the two tensors; when using the standard basis, the full contraction of the two matrices is their Frobenius scalar product. is a constant fourth-order elasticity tensor. In this paper, we study an FE discretization of problem (1), in which the physical density is approximated by an element-wise constant function, which is typical in material distribution based topology optimization. The domain is discretized into n trilinear hexahedral elements and then applying FE approximation, the variational problem (1) is reduced to the linear system
where and are the nodal displacement and load vector, respectively, and is the stiffness matrix of element-wise constant physical density function —the entries of the vector are the element values of ; that is, is the value of in the ith element in the FE mesh. The stiffness matrix is typically assembled by looping over each element so that
where is the element stiffness matrix. We emphasize that the formal expression of the relevant matrices is a key ingredient for applying the multilevel block GLT theory to produce a global spectral description of the matrix sequences under consideration. More precisely, the non-zero blocks of three-dimensional element stiffness matrix can be expressed as
where each block has size and is associated with one node on a so-called reference element, is Young’s modulus, and is Poisson’s ratio. More details are provided in Appendix A for the explicit expressions for the element stiffness matrix and in Appendix B for the stress–strain relation and various bounds. With regard to Equation (2), the lower part of the matrix is not explicitly given because the matrix is globally real symmetric and so is every block , whose expressions are reported in Appendix A, Equation (A1).
3. Spectral Analysis
The current section is devoted to the spectral analysis of the FE coefficient matrices derived in the previous section and is complemented by a selection of numerical tests that confirm the theoretical analysis (more numerical experiments have been performed but here we report only a selection for the sake of brevity). We limit our focus to isotropic materials; for more information about various bounds, see the discussion in Appendix B. From a mathematics perspective, this means that we limit our attention to when the Poisson’s ratio is in the range . In particular, Section 3.1 contains the minimal set of preliminary concepts and tools, while Section 3.2 and Section 3.3 are focused on the specific study in 3D in the constant and variable coefficient cases, respectively.
3.1. Premises
The premises include the formal definition of the eigenvalue and singular value distribution, the notion of multi-indexing, the concepts of multilevel block Toeplitz matrices, multilevel block sampling matrices, and multilevel block GLT matrix sequences.
3.1.1. Singular Value/Eigenvalue Distributions
We first give the formal definitions, and then we briefly discuss their informal and practical meaning.
Definition 1.
Let be two positive integers. Let be a sequence of matrices, with of size with eigenvalues and singular values . Furthermore, let be a measurable function defined on a set D with , and with denoting the Lebesgue measure on .
- We say that has a (asymptotic) singular value distribution described by f, and we write , ifwith being the singular values of f, each of them being a measurable function non-negative almost everywhere.
- We say that has a (asymptotic) spectral (or eigenvalue) distribution described by f, and we write , ifwith being the eigenvalues of f, each of them being a complex-valued measurable function.
If has both a singular value and an eigenvalue distribution described by f, we write . (In practice, in the Toeplitz setting and often in the GLT setting, the parameter r can be read at a matrix level as the size of the elementary blocks which form the global matrix , as it will be clear both in our stiffness matrices in Section 3.2 and Section 3.3, and in the block Toeplitz/block diagonal sampling structures in Section 3.1.2.)
The symbol f contains spectral/singular value information briefly described informally as follows. With reference to relation (4), the eigenvalues of are partitioned into r subsets of the same cardinality, except possibly for a small number of outliers, such that the ith subset is approximately formed by the samples of over a uniform grid in D, . Thus, provided that n is large enough, the symbol f provides a ’compact’ and quite accurate description of the spectrum of the matrices . Similarly, relation (3) has the same meaning when talking of the singular values of and by replacing with , .
3.1.2. Multilevel Block Toeplitz Matrices, Multilevel Block Diagonal Sampling Matrices, and Multilevel Block GLT Sequences
The present section is specifically devoted to matrix-theoretic notations and definitions, which are essential when dealing with multi-dimensional problems and with the related sequences of matrices.
Definition 2.
A multi-index , also called a d-index, is a (row) vector in , whose components are denoted by .
- are the vectors of all zeros, all ones, all twos, … (their size will be clear from the context).
- For any d-index , we set and we write to indicate that .
- If are d-indices, means that for all .
- If are d-indices such that , the multi-index range is the set . The standard lexicographic ordering is assumed uniformly
With regard to the previous definition, in the case , the lexicographic ordering is the following: , , , ,, , , . Notice that, in general, a multi-index range , is used with or with .
Definition 3. (Multilevel) Block Toeplitz Matrices).
Given , a matrix of the form
with vector of all ones, with blocks , , is called a multilevel block Toeplitz matrix, or, more precisely, a d-level r-block Toeplitz matrix. Let be a matrix-valued function in which each entry belongs to . We denote the Fourier coefficients of the generating function ϕ as
where the integrals are computed componentwise, , and . For every , the th Toeplitz matrix associated with ϕ is defined as
or, equivalently, as
where ⊗ denotes the (Kronecker) tensor product of matrices, while is the matrix of order m whose entry equals 1 if and zero otherwise. We call the family of (multilevel block) Toeplitz matrices associated with ϕ, which, in turn, is called the generating function of .
Definition 4. (Multilevel) Block Diagonal Sampling Matrices).
For and , we define the block diagonal sampling matrix as the diagonal matrix
For and , we define the multilevel block diagonal sampling matrix as the block diagonal matrix
with the lexicographical ordering discussed at the beginning of Section 3.1.2.
Definition 5. (Zero-Distributed Sequences of Matrices).
According to Definition 1, a sequence of matrices such that
is referred to as a zero-distributed sequence. Note that, for any , is equivalent to (notice that and denote the zero matrix and the identity matrix, respectively).
Proposition 1 provides an important characterization of zero-distributed sequences together with a useful sufficient condition for detecting such sequences. Throughout this paper, we use the natural convention .
Proposition 1.
[] Let be a sequence of matrices, with of size , and let be the standard spectral matrix norm (the one induced by the Euclidean vector norm).
- is zero distributed if and only if with and as .
- is zero distributed if there exists a such that as .
(Multilevel) Block GLT Matrix Sequences. Now, we give a very concise and operational description of the multilevel block GLT sequences, from which it will be clear that the multilevel block Toeplitz structures, the zero-distributed matrix sequences, and the multilevel block diagonal sampling matrices represent the basic building components. All the material is taken from the books [,] and from the papers [,].
Let be fixed positive integers. A multilevel r-block GLT sequence (or simply a GLT sequence if can be inferred from the context or we do not need/want to specify them) is a special r-block matrix sequence equipped with a measurable function , the so-called symbol. We use the notation to indicate that is a GLT sequence with symbol . The symbol of a GLT sequence is unique in the sense that if and then a.e. in . The main properties of r-block GLT sequences proved in [] are listed below. If A is a matrix, we denote by the Moore–Penrose pseudoinverse of A (recall that whenever A is invertible).
- GLT 1
- If then . If moreover each is Hermitian, then .
- GLT 2
- We have the following:
- if is in ;
- if is Riemann integrable;
- if and only if (zero-distributed sequences coincide exactly with the GLT sequences having GLT symbol equal to a.e.).
- GLT 3
- If and , then
- ;
- for all ;
- ;
- provided that is invertible a.e.
3.2. Constant Coefficient Case
In the current section, we report the derivation and the formal expression of the symbol for the matrix sequences in the constant coefficient setting , according to the standard notion of generating function in the Toeplitz theory (see Section 3.1.2 and the paper by Garoni et al. [] for more details) and according to the notion of symbol reported in Definition 1.
3.2.1. Symbol Definition
Recall that the computational domain is a hyper-rectangular domain in 3D, and the boundary of comprises six sides. Let be the stiffness matrix obtained with a FE approximation with proper boundary conditions, Dirichlet ‘D’ on one of ’s sides and Neumann ‘N’ in the other five sides of (and hence the formal notation ), where we have chosen a uniform meshing with n intervals in the direction, n intervals in the direction, and n intervals in the direction. According to the previously considered ordering of the nodes, the matrix is a three-level block tridiagonal structure of size n with two-level tridiagonal blocks of size , with tridiagonal blocks of size , whose elements are small matrices of size 3. We notice that the size is dictated by all the meshpoints, including those in the boundaries when considering Neumann boundary conditions, while only the internal meshpoints are involved when the Dirichlet boundary conditions are enforced.
In accordance with the 2D case [], if all the boundary conditions are of the Dirichlet type, then we obtain a matrix , which is a three-level block Toeplitz structure with elementary blocks of size 3, having global dimension , since all the nodes on the boundaries are not unknowns. More precisely, we obtain
and, for , we have
while, for , the following block tridiagonal matrices are defined:
By reading the entries of , analogously to the process in the 2D setting as in [], we can compute explicitly the related generating function, following the rules given in Definition 3:
where every , is a matrix because three degrees of freedom are associated to each node of the mesh. Each is a sum of the building blocks of the elementary matrix (2). More precisely, in terms of the block matrices , cf. definition (A1) in Appendix A, we can write
Thus, we have
where
Finally, by expanding the s according to expressions (A2), we deduce the formal expression of the generating functions
The following proposition links in a precise way the involved Toeplitz structures and matrices and .
Proposition 2.
Let be the symbol defined in the previous lines; see (5). Let us consider a uniform meshing in all the three directions with n subintervals. Then we have the following relationships:
where Rn has rank of order , with .
If the more general setting is considered with subintervals in the direction, subintervals in the direction, and intervals in the direction, then the analogous is true:
where Rn has a rank proportional to , with .
Proof.
Since the grid points in a cube are , if we consider Dirichlet boundary conditions in a facet orthogonal, for example, to the direction , then the number of points and equations which are affected by the Neumann boundary conditions on the other 5 facets are , where typically if the standard linear FEs are employed. Hence, the rank correction induced by the matrix Rn is proportional, as claimed to . □
In the following section, we prove that the function is the eigenvalue symbol, in the sense of Definition 1, of the matrix sequences and .
3.2.2. Symbol Spectral Analysis in 3D: Distribution, Extremal Eigenvalues, and Conditioning
We study the spectral distribution, extremal eigenvalues, and conditioning of our matrix sequences in 3D. All the results are based on the symbol and on its analytical features.
Proposition 3.
Let be the symbol defined in Section 3.2.1 According to the notation in Proposition 2, all the matrix sequences , , , , are spectrally distributed as in the sense of Definition 1.
Proof.
For the multilevel block Toeplitz sequences , , and refer to Item GLT 2., part 1, and Item GLT 1., part 2 in Section 3.1.2. For the sequences , first observe that the matrix sequence defined in Proposition 2 is zero distributed, thanks to Proposition 1 and Proposition 2 since rank size, as . Then the claim follows thanks to the *-algebra structure of the GLT sequences and more specifically thanks to Item GLT 3., part 2, Item GLT 2., part 1, and Item GLT 1., part 2. □
Now, we identify a few key analytical features of the spectral symbol which are shared by all the matrix sequences mentioned in Proposition 3. Theorem 1 is crucial for the study of the extremal eigenvalues and of the conditioning of the same matrix sequences and, in Section 5, it is the main ingredient for designing ad hoc multigrid solvers when dealing with the associated large linear systems.
Theorem 1.
Let be the symbol defined in (5). The following statements hold true:
- , ;
- All three eigenvalues of have a zero of order 2 at .
Proof.
Claim 1. The function evaluated at equals
with and , whose row-sum according to (A2).
Claim 2. It is just a direct check. Indeed, it is enough to check that the quantities
have a zero of orders two and six, respectively, by considering the Taylor expansion centered at . □
In accordance to what was proven in the 2D setting, the minimal eigenvalue of the symbol has a unique zero of order two at . In fact, the symbol is positive semi-definite, and all the eigenvalues of the symbol have a unique zero of order two at , that is, there exist positive constants , such that
uniformly in a proper neighborhood of .
Therefore, we can infer important information on the extremal eigenvalues and on the conditioning of the corresponding matrix sequences.
Proposition 4.
Let be the symbol defined in (5). Let us consider a uniform meshing in all the directions with n subintervals. Then we have the following relationships:
and
with Rn as in Proposition 2 and denoting the condition number in spectral norm (the one induced by the Euclidean vector norm).
If the more general setting is considered with subintervals in the direction, subintervals in the direction, and subintervals in the direction, then analogous relations are true:
and
with Rn as in Proposition 2.
3.3. Non-Constant Coefficient Case—3D
It should be observed that the natural extension of the previous analysis refers to the case of a non-constant coefficient . According to a standard assembling procedure in FEs, given the triangulation made by all the basic elements , we write the stiffness matrix as
where is the elementary matrix in (2) (possibly properly cut when nodes on the boundary are involved), but widened to size according to the chosen global ordering of nodes. Here, in the case of the Dirichlet boundary conditions and n subintervals in all the directions such that , while in the case of Dirichlet boundary conditions on one-side Neumann boundary conditions in the remaining ones with n subintervals in all the directions so that .
Clearly, the elementary matrix is positive semi-definite for . More precisely, the not zero eigenvalues are
In addition, every elementary matrix, which has been cut due to nodes on the boundary, is positive semi-definite as well since it is a principal submatrix of in (2).
Thus, on the basis of (6) and by applying the Courant–Fisher theorem, we can claim again
with and minimum and maximum of , respectively (see Figure 1 for an upperbound of the maximal eigenvalues of as a function of ).
Figure 1.
Maxima of eigenvalue surfaces of the symbol vs. .
By combining the above inequalities and Proposition 4, we infer that the extremal eigenvalues and the conditioning have the same asymptotical behavior as in the constant coefficient setting.
The whole eigenvalues distribution is sketched below by referring to the basics of the GLT theory [] reported in Section 3.1.2. Let be a multilevel block diagonal sampling matrix according to the notions introduced in Section 3.1.2 and let be the multilevel block Toeplitz matrix if the Dirichlet boundary conditions are used or in the other case. Then the following facts hold:
- Fact 1
- according to Item GLT 2.
- Fact 2
- according to Proposition 2, Proposition 1, and Item GLT 2.
- Fact 3
- according to Item GLT 2.
- Fact 4
- according to Fact 2, Fact 3, and to the *-algebra structure of GLT sequences that is Item GLT 3.
- Fact 5
- given a simple check shows that , using Proposition 1.
- Fact 6
- as a consequence of Fact5, Fact 1, Fact 4, and of the *-algebra structure of GLT sequences that is Item GLT 3.; moreover since the matrix sequence is made up by Hermitian matrices, by Item GLT 1. it follows that .
We stress that a unique notation has been employed for the sake of notational compactness. However, the main statement, Fact 6, holds for all the matrix sequences reported in Proposition 3 in the case of a variable .
Finally, it is worth noticing that non-square domains can be treated as well, using the reduced GLT theory (see pages 398–399 in [], Section 3.1.4 in [], and the recent analysis []): here, we do not go in the details and this will be the subject of future investigations.
4. Two-Grid and Multigrid Methods
In this section, we concisely report few relevant results concerning the convergence theory of algebraic multigrid methods with special attention of the case of multilevel block Toeplitz structures generated by a matrix-valued symbol f.
We start by taking into consideration the generic linear system with large dimension m, where is a Hermitian positive definite matrix and . Let and let be a full-rank matrix for any s. At last, let us denote by a class of stationary iterative methods for given linear systems of dimension .
In accordance with [], the algebraic two-grid method (TGM) can be easily seen as a stationary iterative method whose generic steps are reported below.
| Pre-smoothing iterations | |
| Exact coarse grid correction (CGC) | |
| Post-smoothing iterations | |
Here, we refer to the dimension by means of its subscript s.
In the first and last steps, a pre-smoothing iteration and a post-smoothing iteration are applied times and times, respectively, taking into account the considered stationary iterative method in the class . Furthermore, the intermediate steps define the exact coarse grid correction operator, which is dependent on the considered projector operator . The resulting iteration matrix of the TGM is then defined as
where and represent the pre-smoothing and post-smoothing iteration matrices, respectively, and is the identity matrix at the sth level.
By employing a recursive procedure, the TGM leads to a multigrid method (MGM). Indeed, the standard V-cycle can be expressed in the following way:
| if then | |
| Exact solution | |
| else | |
| Pre-smoothing iterations | |
| Coarse grid correction | |
| Post-smoothing iterations | |
From a computational viewpoint, to reduce the related costs, it is more efficient that the matrices are computed in the so-called setup phase.
According to the previous setting, the global iteration matrix of the MGM is recursively defined as
Lastly, the W-cycle is just a variation of the previous V-cycle considering two recursive calls in the coarse grid correction as follows:
| if then | |
| Exact solution | |
| else | |
| Pre-smoothing iterations | |
| Coarse grid correction | |
| Post-smoothing iterations | |
In the following remark, we emphasize the relevant computational properties of the considered methods.
Remark 1.
The first part of the current remark concerns the computational cost. In the V-cycle, there is just one recursive call, while in the W-cycle, there are two recursive calls, which in principle, is more expensive. Let us analyze the related costs. Since our matrices are sparse and they remain sparse at the lower levels, the number of arithmetic operations in the V-cycle and W-cycle algorithms is of the type
where and . Now in the one-dimensional setting and , while and thus the W-cycle is asymptotically more expensive. Conversely, in a d-dimensional setting with , we have and, as a consequence of the recursive relations, both and are linear in the matrix-size m. To be precise, we have
so that, in the current three-dimensional setting, the bounds of the V-cycle and W-cycle complexity are very close, i.e.,
Another issue to be considered is the case where the discretization matrices appear in checkerboard fashion (see, for example, [] and the references therein). We emphasize that our analysis, which is based on the Toeplitz generating function, is not directly applicable since we lose the Toeplitz character of the approximation matrices when a checkerboard (called also red–black) ordering is used. However, this is just a matter of a similarity transformation by a permutation matrix. Hence, with a careful work and without increasing the complexity analyzed in the previous lines, the algorithm discussed here and the related theoretical analysis can be adapted to the checkerboard context.
Remark 2.
In the relevant literature (see, for instance, []), the convergence analysis of the TGM splits into checking two separate conditions: the smoothing property and the approximation property. Regarding the latter and regarding scalar structured matrices [,], the TGM optimality is given in terms of choosing the proper conditions that the symbol p of a family of projection operators has to fulfill. Indeed, let with , where f is a non-negative trigonometric polynomial. Let be the unique zero of f. Then the TGM optimality applied to is guaranteed if we choose the symbol p of the family of projection operators such that
where, for , the sets and are the following corner and mirror points
respectively. In the general case of , we have
with , so that the cardinality of and is and , respectively.
Informally, for , it means that the optimality of the two-grid method is obtained by choosing the family of projection operators associated to a symbol p such that does not have zeros and is bounded (if we require the optimality of the V-cycle, then the second condition is a bit stronger); see []. For approximation of an elliptic differential operator of order by local methods (e.g., finite differences, FEs, and isogeometric analysis), the previous conditions mean that p has a unique zero of order at least at whenever f has a unique zero at of order .
In our specific block setting, by interpreting the analysis given in [], all the involved symbols are matrix valued, and the conditions which generalize (7) and are sufficient for the TGM convergence and optimality are the following:
- (A)
- Zero of order 2 at all the mirror points of the eigenvalue functions of the symbol of the projector for our matrix sequences having common symbol (mirror point theory [,]);
- (B)
- Positive definiteness of ;
- (C)
- Commutativity of all for varying in the corner points.
Even if the theoretical extension to the V-cycle and W-cycle convergence and optimality is not given, in the subsequent section, we propose specific choices of the projection operators numerically showing how this leads to two-grid, V-cycle, and W-cycle procedures converging optimally or quasi-optimally with respect to all the relevant parameters (size, dimensionality, and polynomial degree k).
Our choices are in agreement with the mathematical conditions set in items (A), (B), and (C). We remark that the violation of condition (C) is discussed by Donatelli et al. [], in their conclusion section.
5. Multigrid Proposals
Based on the theory in Section 4, we define the prolongation operator as
in the case of matrices , and
in the case of matrices . In the expressions above,
Note that equals the same FEM approximation on the coarse mesh in both cases, independently of the boundary conditions.
Independently of the boundary conditions, we observe that the symbol of the prolongation operator is a matrix-valued function as the symbol of the coefficient matrix sequence of the linear systems to be solved: more precisely we define
Since the matrix-valued function is a scalar function times the identity, it automatically follows that the difficult condition (C) is satisfied. Moreover, taking into account Theorem 1, since the scalar function has a zero of order 2 at , also condition (A) holds, while the positive definiteness of is verified as well, so that also condition (B) is met.
Finally, we choose one iteration of Gauss–Seidel as the pre and post smoother. Given the analysis reported in the previous section, we expect that our multigrid method is convergent in an optimal way, that is, with a convergence speed independent of the matrix size. In Table 1, we find a plain confirmation of our theoretical expectations, which holds not only for a constant , but also in the variable coefficient setting as long as remains positive.
Table 1.
Number of iteration for matrices and of increasing dimension , with the numeric precision , respectively.
6. Conclusions
We provided a quite complete spectral analysis of the (large) coefficient matrices associated with the linear systems stemming from the FE discretization of a linearly elastic problem for an arbitrary element-wise constant coefficient field. Our interest in this problem stems from the fact that in the material distribution method for topology optimization, such a problem is solved at each iteration. The solution for these linear systems typically dominates the computational effort required to solve the topology optimization problems. Based on the spectral information, we proposed a specialized multigrid method, which turned out to be optimal in the sense that the (arithmetic) cost for solving the related linear systems, up to a fixed desired accuracy, is proportional to the computational cost of matrix–vector products, which is linear in the corresponding matrix size and mildly depending on the given accuracy. The method was tested, and the preliminary numerical results are very satisfactory and promising, in terms of having a linear cost and a number of iterations that is bounded by a constant independent of the matrix size and only lightly influenced by the desired accuracy.
Finally, we mention future lines of research:
- The present analysis was performed also with variable coefficients, but with the restriction that the domain is Cartesian: this limitation can be easily overcome, but the computations are not trivial using the notion of reduced GLT sequences of matrices (see [] for a recent very complete work and [,] for examples of applications and for the initial idea).
- The use of higher order FEs is another direction to explore: for more standard problems, we refer to [,] whose analysis can be the starting point for adapting the techniques to the present context.
- In the direction of more general approaches, we remind that the GLT machinery was used also in connection with finite differences, finite volumes, isogeometric analysis of high order and intermediate regularity (see [,] and references therein). Hence, we are convinced that such extensions can be formally obtained, also in the case of the current topology optimization problem.
Author Contributions
Conceptualization, Q.K.N., S.S.-C., C.T.-P. and E.W.; Investigation, Q.K.N., S.S.-C., C.T.-P. and E.W.; Methodology, Q.K.N., S.S.-C., C.T.-P. and E.W.; Writing—original draft, Q.K.N., S.S.-C., C.T.-P. and E.W.; Writing—review and editing, Q.K.N., S.S.-C., C.T.-P. and E.W. All authors have read and agreed to the published version of the manuscript.
Funding
This work is financially supported by the Swedish strategic research program eSSENCE and the Italian Institution for High Mathematics (INdAM).
Data Availability Statement
The data in this paper are results of numerical computations and not experimental measurements. The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
Conflicts of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Appendix A. Explicit Expressions for the Element Stiffness Matrix
Recall that for the studied problem, the three-dimensional element stiffness matrix is
where
We stress that all the blocks mentioned before are real symmetric and hence the lower part is defined accordingly; see Section 3.2.1 for the use of these blocks in the definition of the GLT symbol. We observe that the quantity is a multiplicative term that will be neglected in the spectral analysis since it will be simplified and included in the right-hand vector when solving the related large linear systems). An analytical evaluation of the entries of the stiffness matrix is reported in the following lines:
where
Appendix B. Stress–Strain Relation and Various Bounds
In the three-dimensional setting, there are six independent strain components in total at a point in an element, and they are written as a vector
Similarly, corresponding to the six strain components above, there are also six independent stress components written in vector form as
By the generalized Hooke’s law, the most general linear relation among components of the stress and strain tensor can then be written as
where is a matrix that corresponds to the constant fourth-order elasticity tensor . The relationship between stresses and strains is
where is Poisson’s ratio, is Young’s modulus, and the shear modulus G is
The relationships above can be rewritten in matrix form as
In this case, the bulk modulus K can be expressed as
The Young’s (), shear (G), and bulk (K) moduli need to be positive. Thus, Equations (A5) and (A9) imply that the Poisson’s ratio in three dimensions must satisfy .
Remark A1.
Physically, there are no known isotropic materials with a negative Poisson’s ratio. Therefore, from a practical perspective [], we can limit our study to ν, satisfying the inequalities . Furthermore, in some more delicate circumstances, the Poisson ratio lies in the interval , as proved experimentally in view of the elastic properties of real materials [].
References
- Nguyen, Q.K.; Serra-Capizzano, S.; Wadbro, E. On using a zero lower bound on the physical density in material distribution topology optimization. Comput. Methods Appl. Mech. Eng. 2020, 359, 112669. [Google Scholar] [CrossRef]
- Nguyen, Q.K.; Serra-Capizzano, S.; Tablino-Possio, C.; Wadbro, E. Spectral Analysis of the Finite Element Matrices Approximating 2D Linearly Elastic Structures and Multigrid Proposals. Numer. Linear Algebra Appl. 2022, 29, e2433. [Google Scholar] [CrossRef]
- Bendsøe, M.P.; Kikuchi, N. Generating optimal topologies in structural design using a homogenization method. Comput. Methods Appl. Mech. Eng. 1988, 71, 197–224. [Google Scholar] [CrossRef]
- Allarire, G. Shape Optimization by the Homogenization Method; Springer: New York, NY, USA, 2002. [Google Scholar]
- Bendsøe, M.P.; Sigmund, O. Topology Optimization. Theory, Methods, and Applications; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2003. [Google Scholar]
- Elesin, Y.; Lazarov, B.S.; Jensen, J.S.; Sigmund, O. Time domain topology optimization of 3D nanophotonic devices. Photonics Nanostruct. Appl. 2014, 12, 23–33. [Google Scholar] [CrossRef]
- Erentok, A.; Sigmund, O. Topology optimization of sub-wavelength antennas. IEEE Trans. Antennas Propag. 2011, 59, 58–69. [Google Scholar] [CrossRef]
- Wadbro, E.; Engström, C. Topology and shape optimization of plasmonic nano-antennas. Comput. Methods Appl. Mech. Eng. 2015, 293, 155–169. [Google Scholar] [CrossRef]
- Andreasen, C.; Sigmund, O. Topology optimization of fluid–structure-interaction problems in poroelasticity. Comput. Methods Appl. Mech. Eng. 2013, 258, 55–62. [Google Scholar] [CrossRef]
- Yoon, G.H. Topology optimization for stationary fluid–structure interaction problems using a new monolithic formulation. Int. J. Numer. Methods Eng. 2010, 82, 591–616. [Google Scholar] [CrossRef]
- Christiansen, R.E.; Lazarov, B.S.; Jensen, J.S.; Sigmund, O. Creating geometrically robust designs for highly sensitive problems using topology optimization: Acoustic cavity design. Struct. Multidiscip. Optim. 2015, 52, 737–754. [Google Scholar] [CrossRef]
- Kook, J.; Koo, K.; Hyun, J.; Jensen, J.S.; Wang, S. Acoustical topology optimization for Zwicker’s loudness model—Application to noise barriers. Comput. Methods Appl. Mech. Eng. 2012, 237–240, 130–151. [Google Scholar] [CrossRef]
- Wadbro, E.; Niu, B. Multiscale design for additive manufactured structures with solid coating and periodic infill pattern. Comput. Methods Appl. Mech. Eng. 2019, 357, 112605. [Google Scholar] [CrossRef]
- Clausen, A.; Aage, N.; Sigmund, O. Topology optimization of coated structures and material interface problems. Comput. Methods Appl. Mech. Eng. 2015, 290, 524–541. [Google Scholar] [CrossRef]
- Klarbring, A.; Strömberg, N. Topology optimization of hyperelastic bodies including non-zero prescribed displacements. Struct. Multidiscip. Optim. 2013, 47, 37–48. [Google Scholar] [CrossRef]
- Park, J.; Sutradhar, A. A multi-resolution method for 3D multi-material topology optimization. Comput. Methods Appl. Mech. Eng. 2015, 285, 571–586. [Google Scholar] [CrossRef]
- Aage, N.; Andreassen, E.; Lazarov, B.S.; Sigmund, O. Giga-voxel computational morphogenesis for structural design. Nature 2017, 550, 84–86. [Google Scholar] [CrossRef]
- Amir, O.; Stolpe, M.; Sigmund, O. Efficient use of iterative solvers in nested topology optimization. Struct. Multidiscip. Optim. 2010, 42, 55–72. [Google Scholar] [CrossRef]
- Baandrup, M.; Sigmund, O.; Polk, H.; Aage, N. Closing the gap towards super-long suspension bridges using computational morphogenesis. Nat. Commun. 2020, 11, 2735. [Google Scholar] [CrossRef]
- Borrvall, T.; Petersson, J. Large-scale topology optimization in 3D using parallel computing. Comput. Methods Appl. Mech. Eng. 2001, 190, 6201–6229. [Google Scholar] [CrossRef]
- Wadbro, E.; Berggren, M. Megapixel topology optimization using a graphics processing unit. SIAM Rev. 2009, 51, 707–721. [Google Scholar] [CrossRef]
- Allaire, G.; Jouve, F.; Toader, A.M. Structural optimization using sensitivity analysis and a level-set method. J. Comput. Phys. 2004, 194, 363–393. [Google Scholar] [CrossRef]
- Dijk, N.P.; Maute, K.; Langelaar, M.; van Keulen, F. Level-set methods for structural topology optimization: A review. Struct. Multidiscip. Optim. 2013, 48, 437–472. [Google Scholar] [CrossRef]
- Du, Z.; Cui, T.; Liu, C.; Zhang, W.; Guo, Y.; Guo, X. An efficient and easy-to-extend Matlab code of the Moving Morphable Component (MMC) method for three-dimensional topology optimization. Struct. Multidiscip. Optim. 2022, 65, 158. [Google Scholar] [CrossRef]
- Guo, X.; Zhang, W.; Zhong, W. Doing topology optimization explicitly and geometrically—A new moving morphable components based framework. J. Appl. Mech. 2014, 81, 081009. [Google Scholar] [CrossRef]
- Niu, B.; Wadbro, E. On equal-width length-scale control in topology optimization. Struct. Multidiscip. Optim. 2019, 59, 1321–1334. [Google Scholar] [CrossRef]
- Zhang, W.; Yuan, J.; Zhang, J.; Guo, X. A new topology optimization approach based on Moving Morphable Components (MMC) and the ersatz material model. Struct. Multidiscip. Optim. 2016, 53, 1243–1260. [Google Scholar] [CrossRef]
- Barbarino, G.; Garoni, C.; Serra-Capizzano, S. Block Generalized Locally Toeplitz Sequences: Theory and applications in the multidimensional case. Electron. Trans. Numer. Anal. 2020, 53, 113–216. [Google Scholar] [CrossRef]
- Garoni, C.; Mazza, M.; Serra-Capizzano, S. Block Generalized Locally Toeplitz Sequences: From the Theory to the Applications. Axioms 2018, 7, 49. [Google Scholar] [CrossRef]
- Garoni, C.; Serra-Capizzano, S. Generalized Locally Toeplitz Sequences: Theory and Applications; Springer: Cham, Switzerland, 2017; Volume I. [Google Scholar]
- Garoni, C.; Serra-Capizzano, S. Generalized Multilevel Locally Toeplitz Sequences: Theory and Applications; Springer: Cham, Switzerland, 2018; Volume II. [Google Scholar]
- Barbarino, G.; Garoni, C.; Serra-Capizzano, S. Block generalized locally Toeplitz sequences: Theory and applications in the unidimensional case. Electron. Trans. Numer. Anal. 2020, 53, 28–112. [Google Scholar] [CrossRef]
- Serra-Capizzano, S. Generalized locally Toeplitz sequences: Spectral analysis and applications to discretized partial differential equations. Linear Algebra Appl. 2003, 366, 371–402. [Google Scholar] [CrossRef]
- Serra-Capizzano, S. The GLT class as a generalized Fourier analysis and applications. Linear Algebra Appl. 2006, 419, 180–233. [Google Scholar] [CrossRef]
- Barbarino, G. A systematic approach to reduced GLT. BIT Numer. Math. 2022, 62, 681–743. [Google Scholar] [CrossRef]
- Hackbusch, W. Multigrid Methods and Applications; Springer: Berlin/Heidelberg, Germany, 1985. [Google Scholar]
- Saye, R.I. Efficient multigrid solution of elliptic interface problems using viscosity-upwinded local discontinuous Galerkin methods. Commun. Appl. Math. Comput. Sci. 2019, 14, 247–283. [Google Scholar] [CrossRef]
- Aricò, A.; Donatelli, M.; Serra-Capizzano, S. V-cycle optimal convergence for certain (multilevel) structured linear systems. SIAM J. Matrix Anal. Appl. 2004, 26, 186–214. [Google Scholar] [CrossRef][Green Version]
- Fiorentino, G.; Serra, S. Multigrid methods for Toeplitz matrices. Calcolo 1991, 28, 283–305. [Google Scholar] [CrossRef]
- Donatelli, M.; Ferrari, P.; Furci, I.; Serra-Capizzano, S.; Sesana, D. Multigrid methods for block-Toeplitz linear systems: Convergence analysis and applications. Numer. Linear Algebra Appl. 2021, 28, e2356. [Google Scholar] [CrossRef]
- Garoni, C.; Serra-Capizzano, S.; Sesana, D. Spectral analysis and spectral symbol of d-variate Qp Lagrangian FEM stiffness matrices. SIAM J. Matrix Anal. Appl. 2015, 36, 1100–1128. [Google Scholar] [CrossRef]
- Rahla, R.; Serra-Capizzano, S.; Tablino-Possio, C. Spectral analysis of Pk finite element matrices in the case of Friedrichs-Keller triangulations via generalized locally Toeplitz technology. Numer. Linear Algebra Appl. 2020, 27, e2302. [Google Scholar] [CrossRef]
- Love, A.E.H.; Goldstine, H.H. A Treatise on the Mathematical Theory of Elasticity; Dover Publications: New York, NY, USA, 1944. [Google Scholar]
- Mott, P.H.; Roland, C.M. Limits to Poisson’s ratio in isotropic materials. Phys. Rev. B 2009, 80, 132104. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).