On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design

Marth, Edmund; Bramerdorfer, Gerd

doi:10.3390/stats3030026

Open AccessArticle

On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design

by

Edmund Marth

^*

and

Gerd Bramerdorfer

Department of Electrical Drives and Power Electronics, Johannes Kepler University Linz, 4040 Linz, Austria

^*

Author to whom correspondence should be addressed.

Stats 2020, 3(3), 412-426; https://doi.org/10.3390/stats3030026

Submission received: 11 July 2020 / Revised: 8 August 2020 / Accepted: 15 September 2020 / Published: 22 September 2020

(This article belongs to the Special Issue Applied Statistics in Engineering)

Download

Browse Figures

Versions Notes

Abstract

In the field of electrical machine design, excellent performance for multiple objectives, like efficiency or torque density, can be reached by using contemporary optimization techniques. Unfortunately, highly optimized designs are prone to be rather sensitive regarding uncertainties in the design parameters. This paper introduces an approach to rate the sensitivity of designs with a large number of tolerance-affected parameters using cumulative distribution functions (CDFs) based on finite element analysis results. The accuracy of the CDFs is estimated using the Dvoretzky–Kiefer–Wolfowitz inequality, as well as the bootstrapping method. The advantage of the presented technique is that computational time can be kept low, even for complex problems. As a demanding test case, the effect of imperfect permanent magnets on the cogging torque of a Vernier machine with 192 tolerance-affected parameters is investigated. Results reveal that for this problem, a reliable statement about the robustness can already be made with 1000 finite element calculations.

Keywords:

bootstrapping; cogging torque; cumulative distribution function; electric machine; quantile; reliability; robust design; tolerances

1. Introduction

With the rise of modern optimization techniques in combination with continuously increasing computational power, it is becoming more and more likely to find outstanding solutions to certain design tasks, even if such optima are hidden in the narrowest design-space gaps [1,2]. In the field of electrical machine design, usual optimization targets are performance measures, like efficiency, cost, or torque ripple [3]. A common problem of highly optimized designs is that a small deviation of the design parameters from the ideal values may lead to a significant deterioration of the performance—the design is thus sensitive to parameter tolerances. To obtain designs which are not prone to a significant performance degradation in the presence of inevitable tolerances, some measure describing this resistance has to be incorporated into the (multi-objective) optimization process. A design which has a certain resistance against tolerances is usually called a robust design, and an optimization which takes into account the effect of tolerances is called a robust optimization. In this paper, a method is presented on how to evaluate the robustness of a design that has a large number of tolerance-affected parameters. The application of the method within the scope of an optimization will be dealt with in future work.

In the field of electrical machine design, intensive research is being carried out on the topic of robust design. A huge number of publications deal with the effect of tolerances, especially considering the cogging torque change caused by permanent magnet uncertainties, such as in [4,5,6,7]. The different influences of size, positioning, or magnetization strength and direction are often investigated by applying two or three discrete tolerance levels and searching for worst-case combinations. To reduce the inherent computational burden associated with finite element analyses, symmetry conditions with semi-analytical methods [5] or surrogate models [7,8] are often utilized. Besides searching for worst-case configurations based on discrete tolerance levels, stochastic approaches, like the six sigma quality method [9,10,11], are also investigated to predict the behavior of systems subject to tolerances. Using stochastic measures like the standard deviation

σ

might be preferable compared to the worst-case method [12]. For example, when designing an airplane propulsion, the worst-case measure might definitely be the proper method, since each single failure can have tremendous consequences; whereas for a standard industrial electric machine, this leads to an over-designed, and thus over-priced system which is not competitive.

An overview of different robustness criteria applied in the field of electrical machine design was recently presented in [13], where a quantile measure based on the evaluation of cumulative distribution functions (CDFs) was also introduced for a problem with four tolerance-affected design parameters. The capability of CDFs for large-scale tolerance analysis of problems showing a huge number of parameters and the feasibility of predicting their confidence bounds is presented in this paper by focusing on electric machine applications. The chosen test problem will be presented in Section 2, followed by an introduction to the cumulative distribution function and a theoretical method for confidence estimation in Section 3. In Section 4, the calculation of the sample data is described and the evolution of the CDF with increasing number of samples is illustrated. For a data-based estimation of the CDFs‘ confidence, the bootstrap method [14,15,16] was utilized (Section 5). Finally, we discuss benefits and limitations of the presented method and give a short outlook and conclusion.

2. Test Case Definition

To show the capabilities of using CDFs as a robustness measure for large-scale tolerance analysis, an outer rotor Vernier motor is used as a striking example. Such motors modulate the flux created by a low pole count winding system to an air gap field with major higher harmonic components. As a consequence, an appropriate high number of magnets has to be arranged on the outer rotor to interact with the corresponding air gap field. Such motors can be designed to have outstanding performance in low speed and high torque applications, but also suffer from a significantly nonlinear relationship between the dimensions and the magnetic flux distribution [17]. Thus, reliable surrogate models with sufficient accuracy are hard to obtain. The Vernier motor used for this investigation is shown in Figure 1, and some basic parameters are given in Table 1.

It might be mentioned that this Vernier motor has not a claim to be optimal in any sense—that is, it was not optimized for any special purpose, but should only serve as an illustrative and demanding real-world test case. The goal of this investigation was to predict the impact of permanent magnet uncertainties on the cogging torque. The uncertainties concern the

magnet width ( $Δ b_{m}$ ),
magnet height ( $Δ h_{m}$ ),
radial position ( $Δ x_{r}$ ), and
circumferential position ( $Δ x_{φ}$ ).

The tolerance-affected parameters are also shown in Figure 1. The width and height vary due to the manufacturing process of the permanent magnets, and their tolerances have maximum values of

δ_{b_{m}} = \pm 0.05 mm

and

δ_{h_{m}} = \pm 0.05 mm

, respectively. Radial and circumferential positions can vary due to the assembling process of the permanent magnets. If, for example, the magnets are glued, the adhesive gap thickness can vary, leading to a radial position which deviates from the expected value. For this investigation, a radial maximum displacement tolerance of

δ_{x_{r}} = - 0.01 mm

is assumed. Note that for the radial displacement, only a negative tolerance limit is given, what corresponds to a movement towards the center. The fourth parameter is the circumferential position of the magnets. They are buried into slots with a width of

b_{s}

, as illustrated in the highlighted area of Figure 1. The maximum circumferential displacement of a certain magnet i is a function of its actual tolerance-affected width, that is,

δ_{x_{φ}}^{i} = \pm \frac{b_{s} - b_{m Δ}^{i}}{2}

, where

b_{m Δ}^{i} = b_{m} + Δ b_{m}^{i}

.

In the test machine, the slots have a width of

b_{s} = b_{m} + δ_{b_{m}}

. Thus, a magnet i prone to the maximum tolerance level, that is,

b_{m Δ}^{i} = b_{m} + δ_{b_{m}}

, cannot move any more (

Δ x_{φ}^{i} = 0

). A magnet with a minimum width of

b_{m Δ}^{i} = b_{m} - δ_{b_{m}}

can move up to

Δ x_{φ}^{i} = \pm δ_{b_{m}}

. All parameters subject to tolerances are assumed to have uniform distribution and are summarized in Table 2.

It might be worth emphasizing that due to the stochastic nature of the tolerances, all 48 magnets are likely to have individual values for a given design. Therefore, their specific values will be modeled independently. Assuming a discretization of three steps for each of the four parameters, this leads to

3^{4} = 81

possible combinations for a single magnet. Taking into account that every magnet features its independent tolerance values, a total of

{(3^{4})}^{48} = 3^{192} > 4 \times 10^{91}

different variations are possible. If the simulation of one design would need one microsecond, the evaluation of all variants would need more than

10^{78}

years. Even if a proper analytical (surrogate) model would be available to calculate cogging torque, evaluating all combinations would never be an option. Hence, alternative approaches need to be developed.

3. Introduction to Cumulative Distribution Functions

From a set of n tolerance-affected sample values

X_{1}, \dots, X_{n}

, that is, n samples of a certain target evaluated under the influence of tolerances, the empirical cumulative distribution function (ECDF)

{\hat{F}}_{x} (x)

is defined as [18]

{\hat{F}}_{x} (x) = \frac{1}{n} \sum_{i = 1}^{n} I (X_{i}, x), I (X_{i}, x) = \{\begin{matrix} 1 if X_{i} \leq x \\ 0 if X_{i} > x . \end{matrix}

(1)

In words,

{\hat{F}}_{x} (x)

is the weighted count of samples

X_{i}

for which

X_{i}

is below or equal to the threshold, x. A set of exemplary random samples

X_{i}

, and the respective ECDF

{\hat{F}}_{x} (x)

are plotted in Figure 2.

From the cumulative distribution function, it is easy to determine how many of the samples have a value below or equal to a certain limit, or what the parameter limit

q_{x}

is, where a certain percentage p of the samples has a smaller or equal value. This threshold

q_{x}^{p}

is called the p-quantile. In Figure 2, the 90% quantile is shown, meaning that 90% of the samples have a value smaller or equal to

q_{x}^{0.9} = 2.73

.

Given the n samples, a continuous probability density function

f (x)

can also be fitted, using kernel density estimation (KDE). KDE is a popular approach to estimate the probability density function of random datasets, and due to its non-parametric nature, it is also capable of fitting complicated distributions [18,19,20]. In Figure 2, this KDE estimate

f (x)

is shown as a blue line.

From

f (x)

, the CDF

F_{x} (x)

can be determined as

F_{x} (x) = \int_{- \infty}^{x} f (ξ) d ξ .

(2)

The 90%-quantile in Figure 2 is shown based on this continuous CDF. As an absolute measure for the robustness of a design, any quantile value can be used. Another indicator is the steepness of

F_{x}

—that is, the steeper the CDF is, the more robust the system is.

It is clear that the ECDF

{\hat{F}}_{x}

, and consequently, also the fitted CDF

F_{x}

, can only be an estimate of the real CDF with

n \to \infty

. Thus, the remaining but essential question is: How good is this estimate for a given number of n? From the Dvoretzky–Kiefer–Wolfowitz inequality, a

(1 - α)

-confidence band for

F_{x}

can be constructed as follows [18]:

P (L (x) \leq F (x) \leq U (x) for all x) \geq 1 - α,

(3)

where

\begin{matrix} L (x) & = max {{\hat{F}}_{n} (x) - ϵ_{n}, 0} \\ U (x) & = min {{\hat{F}}_{n} (x) + ϵ_{n}, 1} \end{matrix}

(4)

and

ϵ_{n} = \sqrt{\frac{1}{2 n} log (\frac{2}{α})} .

(5)

Equation (3) implies that the probability

P

of the real CDF

F_{x}

lying in between some lower

L (x)

and upper

U (x)

boundary is greater than or equal to

1 - α

, where the definitions of the lower and upper bound are given by (4) and (5). For example, if we want 95% confidence (

α = 0.05

) of the estimation

{\hat{F}}_{x}

shown in Figure 2 with

n = 34

samples, from (5) an uncertainty of

ϵ_{n} = 0.233

, which is equivalent to

\pm 23.3 %

, can be calculated. Thus, the real CDF has a probability of 95% laying in the confidence slot of

\hat{F_{x}} \pm 23.3 %

. This slot, bounded by the light-grey dashed lines

L (x)

and

U (x)

, is also displayed in Figure 2. The method based on (3) to (5) will be referred to as the DKW method throughout the paper.

Let us assume that a 95% confidence interval with a slot of

\pm 1 %

is desired. An approximation of the required sample size can be calculated by rearranging (5), leading to n = 18,445 samples for this example. Note that no information about the actual problem is necessary using the DKW method. However, from (5), it can also be observed that the confidence slot and thus

{\hat{F}}_{x}

converges with

\frac{1}{\sqrt{n}}

.

If we are interested in the confidence range in parameter space x rather than in the probability space

p = F_{x} (x)

, the (E)CDF can be inverted to get

F_{x}^{- 1} (p)

. Since

F_{x}^{- 1} (p)

is basically the definition of the p-quantile, the inverse is also called a quantile function.

4. Creation of Sample Data and Evolution of the CDF

To create the test data for this investigation, a parametric finite element model was set up using the SyMSpace framework [21]. For the finite element calculations, FEMM software was used [22], to which SyMSpace features an interface. Since the circumferential positioning

Δ x_{φ}^{i}

of each magnet i is dependent on its actual width, which is also subject to tolerances, for this parameter the uniform distribution is mapped to a normalized range of

Δ ξ_{x_{φ}}^{i} = [- 1, 1]

, where

Δ ξ_{x_{φ}}^{i} = \frac{Δ x_{φ}^{i}}{δ_{x_{φ}}} .

(6)

Considering (6) and Figure 1, if

Δ ξ_{x_{φ}}^{i} = 1

, the affected magnet is moved in clockwise direction, such that it is touching the edge of the slot. For

Δ ξ_{x_{φ}}^{i} = - 1

, the magnet is against the slot edge in counterclockwise direction. In Figure 3 the normalized tolerance values of all magnets for an exemplary sample are visualized, where

Δ ξ_{x_{r}}^{i} = \frac{Δ x_{r}^{i}}{δ_{x_{r}}}, Δ ξ_{b_{m}}^{i} = \frac{Δ b_{m}^{i}}{δ_{b_{m}}}, Δ ξ_{h_{m}}^{i} = \frac{Δ h_{m}^{i}}{δ_{h_{m}}} .

(7)

To speed up calculations done by the finite element program, usually sectors with some periodic boundary conditions are defined and the rotor is rotated by one electrical period. Since these investigations consider tolerances leading to asymmetric designs, that is, every magnet is exposed to different tolerance values, the full cross-section has to be modeled in the finite element program and the rotor has to be rotated by the angular width of one main stator tooth (

360 ° / 9 = 40 °

in this case) to allow for analysis of all possible harmonics of the cogging torque. In our case, such a calculation takes approximately 15 min. From the obtained cogging torque

T_{c} (φ)

as a function of the rotor angle

φ

, the peak-to-peak value

T_{c, p p}

is evaluated, which also serves as a target variable for the following investigations.

After evaluating

n = 20

randomly defined motor cross-sections (a motor subject to a random set of tolerances will also be referred to as a sample throughout the paper), the CDF

F_{T}^{20}

is fitted from the

T_{c, p p}

values, as displayed in Figure 4. The right superscript denotes the number of n samples upon which the CDF is based, that is,

F_{T}^{n}

.

The continuous CDF

F_{T}^{20}

, which is fitted by kernel density estimation using a Gauss kernel, as well as its deviation to the empiric CDF

{\hat{F}}_{T}^{20}

with respect to probability

ε_{p}

(lower axis), and its deviation to

{\hat{F}}_{T}^{20}

with respect to the quantile value (cogging torque)

ε_{q}

(right axis) are also shown in Figure 4. From the appended axes, it can be observed that the deviation from the fitted to the empiric CDF is below 10% and

5 mNm

.

To have a single measure for the deviation between two (E)CDFs,

F_{x, 1} (x)

and

F_{x, 2} (x)

, the absolute area between them,

A_{d} (F_{x, 1}, F_{x, 2})

, can be evaluated:

A_{d} (F_{x, 1}, F_{x, 2}) = \int_{min (x)}^{max (x)} |F_{x, 1} (x) - F_{x, 2} (x)| d x .

(8)

Since the (E)CDF is a normalized quantity,

A_{d}

has the unit of the parameter which the distribution describes (

Nm

in this case).

A_{d}

, generally defined as the deviation between two arbitrary CDF functions, will serve as a parameter predicting the convergence of the approach presented in the following.

Now, from Figure 4 and also from Figure 2, it can be seen that the ECDF can be reasonably fitted by kernel density estimation, but it is not clear if a CDF, based on a low number of samples, reflects the statistical behavior to be expected when manufacturing this motor in a mass production process. Thus, the idea is to observe the evolution of the CDF when more and more samples are calculated. Starting from the CDF based on 20 samples, a selection of CDFs based on an increased number of samples is shown in Figure 5.

In total, 2000 samples were evaluated. The CDF fitted from this full dataset will serve as a reference CDF

F_{T}^{2 k}

. In Figure 5, and also in all the following investigations, only the KDE-fitted CDFs are considered.

Now, from Figure 5, it can be observed that

F_{T}

converges very fast. The appended axes below and on the right side show the deviation of

F_{T}^{n}

to

F_{T}^{2 k}

. From this Figure, it seems that

F_{T}^{500}

already converged quite close to a “final” CDF. Thus, it looks like a statistical statement on the robustness of cogging torque against magnet tolerances can be derived within 500 simulations for a problem with an incredible number of possible combinations.

An explanation for this desirable behavior is the insensitivity of the KDE-fitted CDF to fluctuations of the underlying data. On the one hand, the CDF is based on a set of sample values, where each single sample contributes with a value of

1 / n

to the functional, and on the other hand, the fitting process has a smoothing effect.

However, thinking on the iterative procedure of creating new samples and updating

F_{T}

, the question of how to define convergence remains, since in Figure 5 the convergence was identified by referring to a CDF based on much more samples (

F_{T}^{2 k}

) than actually available. We cannot rely on a comparison to this “best available” CDF, but have to compare to the CDFs from previous update steps and track the evolution of the CDFs.

To evaluate this behavior, every 10 samples a CDF was calculated, that is,

F_{T}^{10}, F_{T}^{20}, F_{T}^{30}, \dots F_{T}^{2 k}

, and the deviation area between successive CDFs,

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

, was evaluated. This course, and also the deviation with respect to

F_{T}^{2 k}

,

A_{d} (F_{T}^{n}, F_{T}^{2 k})

, is shown in Figure 6.

Looking at the deviation to

F_{T}^{2 k}

, which gives an absolute measure of the convergence, it turns out that a significant drop appears within the first 100 samples, then there is some fluctuation and from about 500 samples,

A_{d} (F_{T}^{n}, F_{T}^{2 k})

is stable and shows a steady convergence towards zero. The relative deviation,

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

, converges even faster to very low values, as can be seen from the lower axis of Figure 6. Thus, from

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

, one could interpret a faster convergence of

F_{T}

, but since it is a relative measure, it could have an integrating character. This means that even if

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

is low,

F_{T}

could slowly but steadily move in one direction, and no information is available about the “real” limit,

F_{T}^{\infty}

.

Another point which was not discussed so far is also shown in Figure 6—the effect of random sampling. Since the samples are defined randomly, or more precisely, the tolerance values of each parameter for every magnet are selected randomly considering its distribution, it is obvious that quite different CDFs can emerge, especially at the beginning. To verify this, the evolution of

A_{d} (F_{T}^{n}, F_{T}^{2 k})

and

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

was evaluated with the available samples in reversed order—that is, the sample number 2000 was used as the first one, sample number 1999 was used as the second one, and so on. The reversed order was designated with a negative sign of the sample index n in the corresponding deviation function, as drawn in Figure 6 with dotted lines. When comparing the deviations based on the standard ordered samples with the ones based on the reversed ordered samples, it is worth emphasizing that, up to sample index 1000, they relied on completely different samples. Having this in mind, the comparison shows interesting insights—the first one comes by observing

A_{d} (F_{T}^{- n}, F_{T}^{- 2 k})

, which has an (expected) deviation to

A_{d} (F_{T}^{n}, F_{T}^{2 k})

in the lower samples range, but shows good agreement with

A_{d} (F_{T}^{n}, F_{T}^{2 k})

already from

n = 400

onwards. The second one is that the relative evolution shows pretty much the same behavior, regardless of whether the samples are in standard or in reversed order.

To summarize this section:

The method seems promising, especially since $A_{d} (F_{T}^{n}, F_{T}^{2 k})$ and $A_{d} (F_{T}^{- n}, F_{T}^{- 2 k})$ converge to the same value even for different sets of samples (cf. range $400 < n < 1000$ in Figure 6).
From the relative deviation area $A_{d} (F_{T}^{n}, F_{T}^{n - 10})$ , no reference is available about the remaining deviation to $F_{T}^{\infty}$ .
The relative deviation seems to be quite robust concerning the random sampling, since its evolution has a high correlation for different sets of samples ( $n < 1000$ in Figure 6).

5. Evaluation of Data-Based CDF Confidence Values

For an iterative approach,

A_{d} (F_{T}^{n}, F_{T}^{x})

with

x ≫ n

cannot be evaluated, since no “best” reference

F_{T}^{x}

(

F_{T}^{2 k}

in the previous section) is available. From the relative evolution of

F_{T}

, or more precisely, of

A_{d} (F_{T}^{n}, F_{T}^{n - 10})

, no statement can further be drawn about the absolute deviation, due to the (possible) integrating character.

From the presented theory of Section 3, especially (5), one could determine which sample count n is required for guaranteeing a specified confidence interval. However, the underlying problem to be investigated is not taken into account, as (5) was developed for deriving confidence intervals for any empiric cumulative distribution function without loss of generality. Such general (not problem-specific) derivations usually implicate very conservative measures, such as confidence interval bounds. Besides, they often imply assumptions, like unbiased data sets, which can hardly be ensured for practical applications. While it can be more easily guaranteed for the tolerance-affected input variables, ensuring an unbiased performance measure is practically impossible, as the input–output relation is not known in general.

To overcome the lack of problem-specific information, the bootstrap approach is used to estimating the confidence interval [14,15,16,20]. This common method in statistical inference utilizes the sample data itself and not only the number of samples for confidence boundary estimation.

The confidence value is created using the following approach:

Create $n_{b}$ batches (bootstrap samples), where every batch contains n samples which are drawn from the original dataset with replacement. The original dataset also features n samples.
Create a CDF for every bootstrapped batch of samples $F_{T}^{b}$ and calculate the deviation to the CDF based on the original dataset, $A_{d} (F_{T}^{b}, F_{T}^{n})$ .
Investigate the distribution of the deviations by creating a CDF based on the deviation values $F_{A_{d}}$ .
Evaluate a confidence value based on $F_{A_{d}}$ .

Let us go through the steps using an example: Assume a dataset with

n = 100

samples. From these 100 samples,

n_{b} = 200

batches of samples are created, where each batch also contains

n = 100

samples. The samples of one batch are randomly selected out of the full dataset with replacement. From this, it follows that a specific sample from the original dataset can occur in a batch multiple times. The CDF based on all available samples,

F_{T}^{100}

, is plotted in Figure 7a together with the

n_{b} = 200

different CDFs calculated from the 200 batches,

F_{T}^{b}

.

Since every batch consists of a different set of samples, the corresponding CDFs look different. They form a tube around

F_{T}^{100}

, which defines some kind of confidence for the case where 100 samples are available. In the next step, we want to have a look at the deviation of every batch CDF to the original data CDF,

A_{d} (F_{T}^{b}, F_{T}^{100})

. The probability density function

f_{A_{d}}

of these deviations, as well as the corresponding cumulative distribution function

F_{A_{d}}

are shown in Figure 7b.

From

F_{A_{d}}

, we can now easily read which share of the batches leads to a deviation area below a certain limit, or the other way round: what is the maximum deviation area that a certain amount of batches does not exceed? In our example, 90% of the batches lead to a deviation area of

A_{d} \leq 0.0082 Nm

, as also shown in Figure 7b. This parameter will be used to rate the confidence of the batches and is designated by

C_{T}^{0.9}

. Although

C_{T}^{0.9}

gives the 90%-quantile of

A_{d} (F_{T}^{b}, F_{T}^{100})

, the index T is used. On the one hand, this was chosen because it should serve as a confidence value for pretending the distribution of the cogging torque

T_{c, p p}

. On the other hand, this choice can be argued by the following interpretation of

C_{T}

: If any CDF

F_{x} (x)

is horizontally shifted by any value

δ_{x}

, the area

A_{d}

between the original and the shifted CDF just corresponds to the shift value, since

F_{x}

is always normalized between zero and one—thus,

A_{d} = δ_{x}

. So, if in our example

F_{T}^{100}

is shifted by

\pm C_{T}^{0.9}

on the parameter axis, as depicted in Figure 7a, it can be interpreted as a worst-case slot, where 90% of all batch CDFs are within.

An aspect to consider when using this confidence interval is related to the worst-case scenario—especially for large-scale problems, it is very unlikely that the worst-case performance can be identified through the relatively small share of initially evaluated samples. Additionally, the CDF turns very flat when approaching a quantile level close to 1, following significant errors for the corresponding quantile values (here, cogging torque levels), even if the CDF exhibits only small uncertainties on the quantile level (that is, probability value). To be on the safe side, it is recommended to use this method only up to a quantile level of approximately 0.95.

In Figure 7a, the 90% limit slots calculated with the DKW method are additionally shown. Not to be confused with the

C_{T}^{0.9}

limit, the

L - U

slot defines the slot, wherein the real CDF (

F_{T}^{\infty}

) is with a probability of 90%. Similar to the evolution of

F_{T}

, now the question arises about how many batches are necessary until

F_{A_{d}}

converges. Since it is again about convergence of a cumulative distribution function, Equations (3)–(5) can be applied, which basically means the same convergence behavior for

F_{A_{d}}

as for

F_{T}

. For 5000 batches,

ϵ_{n} = 1.9 %

can be evaluated from (5). To show convergence, the deviation of

F_{A_{d}}^{n_{b}}

to

F_{A_{d}}^{5 k}

,

A_{d} (F_{A_{d}}^{n_{b}}, F_{A_{d}}^{5 k})

was evaluated as a function of the number of batches

n_{b}

and presented in Figure 8.

It can be seen that convergence is fast, and that the prediction continuously gets better with an increasing number of samples. It is further important to emphasize that the creation of

F_{A_{d}}

is very fast, because no finite element calculations are needed. One additional aspect concerning convergence is that taking the shape of the whole CDF into account is probably too strict as a criterion. Actually, we are interested in the convergence of particular quantiles

C_{T}^{p}

, rather than the convergence of the whole distribution function. In Figure 9,

C_{T}^{0.9}

is shown over the number of batches and for different values of total samples n.

It can be observed that

C_{T}^{0.9}

is fluctuating within the first 500 batches and has almost reached convergence after 1000 batches. What can further be seen even better than in Figure 8 is that

C_{T}^{0.9}

drops significantly with an increasing number of total samples.

The evolution of different confidence values over the number of available samples is shown in Figure 10, together with the absolute and relative deviation area (cf. Figure 6).

What can be observed from Figure 10 is that

C_{T}

seems to be a good measure for the absolute deviation, although it is only based on data available at every sample step.

In Figure 11 the evolution of different quantile values of the cogging torque

T_{c, p p}^{p}

are plotted together with the worst-case 90% confidence band calculated with the batch method presented within this section.

Additionally, the confidence bounds based on the quantile functions of L and U of the DKW method are illustrated by dashed lines. Especially for high quantile levels, p, the DKW method leads to much more conservative predictions of the confidence slot.

6. Discussion

What we have shown so far is a method to statistically predict the effect of manufacturing tolerances apparent in electric machine mass production, where a large number of tolerance-affected parameters are taken into account. If the performance variation of such an investigation is not satisfying, it might be interesting to investigate which of the tolerance-affected parameters has the highest impact. Unfortunately, this information cannot be deduced from the resulting CDF,

F_{T}

. Luckily, the method gives reasonable results with low computational effort, measured against the complexity of the problem. For example, the 2000 samples of the test case were simulated within approximately 8 h on an available HTCondor [23] cluster with around 150 workstation cores (where, of course, not all of them were used simultaneously). So, to identify which tolerance parameter has a high influence, and also which one can maybe neglected, the presented analysis can be performed for every parameter in a one-factor-at-a-time manner. For the test case, four different runs (every parameter of Table 2) would be necessary.

It should also be mentioned that for problems with a low number of tolerance-affected parameters, this method might not be the first choice. Sampling of tolerance-affected space using design of experiments, creating surrogate models upon these samples, and evaluating the influence of tolerances using the surrogate models will be more effective. The blurry and problem-dependent decision boundary for using state-of-the-art methods or the one presented here will be a topic of further investigation.

So far, most of the presented work on tolerance analysis has been about symmetric changes, such as the same change of the height of all utilized permanent magnets. This is done because the more realistic individual changes require much more computational effort and the modeling and evaluation complexity are significantly increased. The approach presented here allows for analyzing realistic scenarios that are much more likely to be present in the mass manufacturing of electric machines, while still maintaining reasonable computational cost.

By contrast, available counterparts from the literature featuring a considerable number of individual parameter changes are, for example, either based on (i) analytical models [24] or (ii) on theoretical investigations about worst-case scenarios [12]. Such approaches constitute solutions with much less required computational effort. However, they are generally not flexible, considering their application on many different machine types and corresponding designs. Besides, they are often limited to, for example, a particular performance measure, such as the worst-case one [12].

With the approach derived here, those limitations do not apply. In order to further improve the approach presented here, the authors are considering the development of a hybrid version, that is, an analysis facilitated through making use of the cumulative distribution function, which is both based on finite element-based analysis of selected samples, and consequent surrogate modeling and evaluation of further configurations. This would allow for further minimization of the computational burden, while utmost flexibility for analyzing diverse scenarios is sustained. Consequently, such further improved approaches are more likely to be applied for online robustness evaluation of designs to be evaluated within electric machine design optimization problems.

7. Conclusions and Outlook

This investigation has shown that a cumulative distribution function is suitable to statistically predict the effect of parameter uncertainties on some target functions with comparably low computational effort. More specifically, it is especially useful if there is a large number of uncertain parameters which would be hard to analyze by applying other available methods from the literature. The considered test case was about investigating the effect of manufacturing and assembling tolerances of Vernier machine permanent magnets on cogging torque. A data-based method has been presented for solving such problems. This further allows for predicting the confidence of the cumulative distribution function, which was applied to describe the statistical behavior of the cogging torque. The confidence evaluation utilizing the bootstrap method was compared with a mathematical confidence definition based on the Dvoretzky–Kiefer–Wolfowitz inequality. As was observed, the Dvoretzky–Kiefer–Wolfowitz definition follows much more conservative estimations. Future work will include the following activities:

Testing the presented approach on different problems to show its reliability and applicability.
Incorporating the method into a robust optimization framework.
Trying to gain separability of the tolerance parameters’ influence on robustness—that is, to find a method to evaluate from CDF data which parameters have a crucial impact on the target, and further derive which ones have little effect and thus can be neglected.

Author Contributions

Conceptualization, E.M. and G.B.; methodology, E.M. and G.B.; investigation, E.M.; writing—original draft preparation, E.M.; writing—review and editing, E.M. and G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by the COMET-K2 “Center for Symbiotic Mechatronics” of the Linz Center of Mechatronics (LCM) funded by the Austrian federal government and the federal state of Upper Austria.

Acknowledgments

The authors thank the whole SyMSpace team for their excellent support and their patience.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CDF	cumulative distribution function
DKW	Dvoretzky–Kiefer–Wolfowitz method
ECDF	empiric cumulative distribution function
KDE	kernel density estimation

References

Bramerdorfer, G.; Tapia, J.A.; Pyrhonen, J.J.; Cavagnino, A. Modern Electrical Machine Design Optimization: Techniques, Trends, and Best Practices. IEEE Trans. Ind. Electron. 2018, 65, 7672–7684. [Google Scholar] [CrossRef]
Duan, Y.; Ionel, D.M. A Review of Recent Developments in Electrical Machine Design Optimization Methods With a Permanent-Magnet Synchronous Motor Benchmark Study. IEEE Trans. Ind. Appl. 2013, 49, 1268–1275. [Google Scholar] [CrossRef]
Fatemi, A.; Ionel, D.M.; Popescu, M.; Chong, Y.C.; Demerdash, N.A.O. Design Optimization of a High Torque Density Spoke-Type PM Motor for a Formula E Race Drive Cycle. IEEE Trans. Ind. Appl. 2018, 54, 4343–4354. [Google Scholar] [CrossRef]
Gašparin, L.; Černigoj, A.; Markič, S.; Fišer, R. Additional cogging torque components in permanent-magnet motors due to manufacturing imperfections. IEEE Trans. Magn. 2009, 45, 1210–1213. [Google Scholar] [CrossRef]
Coenen, I.; Van Der Giet, M.; Hameyer, K. Manufacturing tolerances: Estimation and prediction of cogging torque influenced by magnetization faults. IEEE Trans. Magn. 2012, 48, 1932–1936. [Google Scholar] [CrossRef]
Ge, X.; Zhu, Z.Q. Sensitivity of Manufacturing Tolerances on Cogging Torque in Interior Permanent Magnet Machines With Different Slot/Pole Number Combinations. IEEE Trans. Ind. Appl. 2017, 53, 3557–3567. [Google Scholar] [CrossRef]
Bramerdorfer, G. Quantifying the Impact of Tolerance-Affected Parameters on the Performance of Permanent Magnet Synchronous Machines. IEEE Trans. Energy Convers. 2020, 8969. [Google Scholar] [CrossRef]
Kim, S.; Lee, S.G.; Kim, J.M.; Lee, T.H.; Lim, M.S. Uncertainty identification method using kriging surrogate model and Akaike information criterion for industrial electromagnetic device. IET Sci. Meas. Technol. 2020, 14, 250–258. [Google Scholar] [CrossRef]
Koch, P.N.; Yang, R.J.; Gu, L. Design for six sigma through robust optimization. Struct. Multidiscip. Optim. 2004, 26, 235–248. [Google Scholar] [CrossRef]
Lei, G.; Guo, Y.G.; Zhu, J.G.; Wang, T.S.; Chen, X.M.; Shao, K.R. System Level Six Sigma Robust Optimization of a Drive System With PM Transverse Flux Machine. IEEE Trans. Magn. 2012, 48, 923–926. [Google Scholar] [CrossRef]
Lei, G.; Wang, T.; Zhu, J.; Guo, Y.; Wang, S. System-Level Design Optimization Method for Electrical Drive Systems-Robust Approach. IEEE Trans. Ind. Electron. 2015, 62, 4702–4713. [Google Scholar] [CrossRef]
Yang, Y.; Bianchi, N.; Zhang, C.; Zhu, X.; Liu, H.; Zhang, S. A method for evaluating the worst-case cogging torque under manufacturing uncertainties. IEEE Trans. Energy Convers. 2020, 8969. [Google Scholar] [CrossRef]
Bramerdorfer, G. Robustness Criteria for Concurrent Evaluation of the Impact of Tolerances in Multiobjective Electric Machine Design Optimization. China Electrotech. Soc. Trans. Electr. Mach. Syst. 2020, 4, 4–12. [Google Scholar] [CrossRef]
Efron, B. Bootstrap Methods: Another Look at the Jackknife. Ann. Stat. 1979, 7, 1–26. [Google Scholar] [CrossRef]
Boos, D.D. Introduction to the Bootstrap World. Stat. Sci. 2003, 18, 168–174. [Google Scholar] [CrossRef]
Efron, B.; Narasimhan, B. The Automatic Construction of Bootstrap Confidence Intervals. J. Comput. Graph. Stat. 2020, 1–32. [Google Scholar] [CrossRef]
Toba, A.; Lipo, T.A. Generic torque-maximizing design methodology of surface permanent-magnet vernier machine. IEEE Trans. Ind. Appl. 2000, 36, 1539–1546. [Google Scholar] [CrossRef]
Wasserman, L. All of Statistics: A Concise Course in Statistical Inference Brief Contents; Springer: Berlin/Heidelberg, Germany, 2004; p. 442. [Google Scholar] [CrossRef]
Rhein, B.; Clees, T.; Ruschitzka, M. Robustness measures and numerical approximation of the cumulative density function of response surfaces. Commun. Stat. Simul. Comput. 2014, 43, 1–17. [Google Scholar] [CrossRef]
Chen, Y.C. A tutorial on kernel density estimation and recent advances. Biostat. Epidemiol. 2017, 1, 161–187. [Google Scholar] [CrossRef]
Silber, S.; Koppelstätter, W.; Weidenholzer, G.; Segon, G.; Bramerdorfer, G. Reducing Development Time of Electric Machines with SyMSpace. In Proceedings of the 2018 8th International Electric Drives Production Conference (EDPC), Schweinfurt, Germany, 4–5 December 2018. [Google Scholar] [CrossRef]
Meeker, D. Available online: www.femm.info (accessed on 10 July 2020).
Thain, D.; Tannenbaum, T.; Livny, M. Distributed computing in practice: The Condor experience. Concurr. Comput. Pract. Exp. 2005, 17, 323–356. [Google Scholar] [CrossRef]
Gerber, S.; Wang, R. Statistical analysis of cogging torque considering various manufacturing imperfections. In Proceedings of the 2016 XXII International Conference on Electrical Machines (ICEM), Lausanne, Switzerland, 4–7 September 2016; pp. 2066–2072. [Google Scholar] [CrossRef]

Sample Availability: Original data is available from the authors.

Figure 1. Outer rotor Vernier machine considered for the case study. Major parameters are given in Table 1.

Figure 2. Explanation of the (E)CDF based on

n = 34

random samples (black lines): empiric cumulative distribution function

{\hat{F}}_{x} (x)

; probability density function

f_{x} (x)

fitted by kernel density estimation; cumulative distribution function

F_{x} (x)

from

f_{x} (x)

(cf. (2)); lower

L (x)

and upper

U (x)

confidence bound derived from the Dvoretzky–Kiefer–Wolfowitz inequality (cf. (4)).

Figure 2. Explanation of the (E)CDF based on

n = 34

random samples (black lines): empiric cumulative distribution function

{\hat{F}}_{x} (x)

; probability density function

f_{x} (x)

fitted by kernel density estimation; cumulative distribution function

F_{x} (x)

from

f_{x} (x)

(cf. (2)); lower

L (x)

and upper

U (x)

confidence bound derived from the Dvoretzky–Kiefer–Wolfowitz inequality (cf. (4)).

Figure 3. Normalized tolerance values of all 48 permanent magnets for one exemplary sample of the Vernier machine.

Figure 4. Cumulative distribution functions created using the first 20 samples. Appended axes show the deviation of the empiric CDF

{\hat{F}}_{T}^{20}

to the fitted CDF

F_{T}^{20}

for the probability

ε_{p}

, as well as for the quantile values

ε_{q}

; deviation area

A_{d} = 7.95 \cdot 10^{- 3} Nm

.

Figure 4. Cumulative distribution functions created using the first 20 samples. Appended axes show the deviation of the empiric CDF

{\hat{F}}_{T}^{20}

to the fitted CDF

F_{T}^{20}

for the probability

ε_{p}

, as well as for the quantile values

ε_{q}

; deviation area

A_{d} = 7.95 \cdot 10^{- 3} Nm

.

Figure 5. Evolution of

F_{T}^{n}

for an increasing number of samples. The deviations

ε_{p}

and

ε_{q}

are with respect to the reference CDF

F_{T}^{2 k}

.

Figure 5. Evolution of

F_{T}^{n}

for an increasing number of samples. The deviations

ε_{p}

and

ε_{q}

are with respect to the reference CDF

F_{T}^{2 k}

.

Figure 6. Evolution of

A_{d}

when

F_{T}^{n}

is compared to previous CDF

F_{T}^{(n - 10)}

and to the final one

F_{T}^{2 k}

. Dotted lines show the same evaluations, but with the samples used in reversed order, designated by a negative sample index. Inset shows a zoom of the main figure.

Figure 6. Evolution of

A_{d}

when

F_{T}^{n}

is compared to previous CDF

F_{T}^{(n - 10)}

and to the final one

F_{T}^{2 k}

. Dotted lines show the same evaluations, but with the samples used in reversed order, designated by a negative sample index. Inset shows a zoom of the main figure.

Figure 7. Derivation of the confidence value: (a) CDF of the original set of samples

F_{T}^{100}

;

n_{b} = 200

CDFs from batches,

F_{T}^{b}

; interpretation of the 90% confidence value

C_{T}^{0.9}

; upper and lower bound 90% confidence band based on Equations (3)–(5),

L^{0.9}, U^{0.9}

and (b) probability density function

f_{A_{d}}

and CDF

F_{A_{d}}

of the deviations of batch CDFs to original data CDF.

Figure 7. Derivation of the confidence value: (a) CDF of the original set of samples

F_{T}^{100}

;

n_{b} = 200

CDFs from batches,

F_{T}^{b}

; interpretation of the 90% confidence value

C_{T}^{0.9}

; upper and lower bound 90% confidence band based on Equations (3)–(5),

L^{0.9}, U^{0.9}

and (b) probability density function

f_{A_{d}}

and CDF

F_{A_{d}}

of the deviations of batch CDFs to original data CDF.

Figure 8. Evolution of deviation of

F_{A_{d}}^{n_{b}}

to

F_{A_{d}}^{5 k}

,

A_{d} (F_{A_{d}}^{n_{b}}, F_{A_{d}}^{5 k})

over number of batches

n_{b}

for different numbers of total samples, n (big picture and zoom).

Figure 8. Evolution of deviation of

F_{A_{d}}^{n_{b}}

to

F_{A_{d}}^{5 k}

,

A_{d} (F_{A_{d}}^{n_{b}}, F_{A_{d}}^{5 k})

over number of batches

n_{b}

for different numbers of total samples, n (big picture and zoom).

Figure 9. Evolution of the 90% confidence value

C_{T}^{0.9}

over number of batches for different numbers of samples,

N_{s}

.

Figure 9. Evolution of the 90% confidence value

C_{T}^{0.9}

over number of batches for different numbers of samples,

N_{s}

.

Figure 10. Confidence values over the number of available samples together with an evolution of absolute and relative deviation areas of

F_{T}

(Figure 6).

Figure 10. Confidence values over the number of available samples together with an evolution of absolute and relative deviation areas of

F_{T}

(Figure 6).

Figure 11. Different cogging torque quantile values together with the 90% confidence values as a function of the actually available samples n; dashed lines correspond to the

{(L^{0.9})}^{- 1}, {(U^{0.9})}^{- 1}

confidence bounds.

Figure 11. Different cogging torque quantile values together with the 90% confidence values as a function of the actually available samples n; dashed lines correspond to the

{(L^{0.9})}^{- 1}, {(U^{0.9})}^{- 1}

confidence bounds.

Table 1. Parameters of the Vernier motor shown in Figure 1.

Parameter	Value	Description
p	24	number of pole pairs of the outer rotor
n	27	number of micro slots of the stator
$d_{o}$	$80 mm$	outer diameter of the rotor
$l_{f e}$	$20 mm$	stack length of the motor
$l_{a i r}$	$0.5 mm$	air gap length
$b_{m}$	$3.74 mm$	nominal width of magnets
$h_{m}$	$1.73 mm$	nominal height of magnets

Table 2. Tolerance definition.

Parameter	Symbol $Δ X$	Range of $Δ X$	Tolerance Limit $δ_{X}$	Distribution
magnet width	$Δ b_{m}$	$[- δ_{b_{m}}, + δ_{b_{m}}]$	$0.05 mm$	uniform
magnet height	$Δ h_{m}$	$[- δ_{h_{m}}, + δ_{h_{m}}]$	$0.05 mm$	uniform
radial position	$Δ x_{r}$	$[- δ_{x_{r}}, 0]$	$0.01 mm$	uniform
circumferential position	$Δ x_{φ}$	$[- δ_{x_{φ}}, + δ_{x_{φ}}]$	$\frac{b_{s} - b_{m Δ}}{2}$	uniform *

* on a normalized range (see Section 4).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Marth, E.; Bramerdorfer, G. On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design. Stats 2020, 3, 412-426. https://doi.org/10.3390/stats3030026

AMA Style

Marth E, Bramerdorfer G. On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design. Stats. 2020; 3(3):412-426. https://doi.org/10.3390/stats3030026

Chicago/Turabian Style

Marth, Edmund, and Gerd Bramerdorfer. 2020. "On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design" Stats 3, no. 3: 412-426. https://doi.org/10.3390/stats3030026

APA Style

Marth, E., & Bramerdorfer, G. (2020). On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design. Stats, 3(3), 412-426. https://doi.org/10.3390/stats3030026

Article Menu

On the Use of the Cumulative Distribution Function for Large-Scale Tolerance Analyses Applied to Electric Machine Design

Abstract

1. Introduction

2. Test Case Definition

3. Introduction to Cumulative Distribution Functions

4. Creation of Sample Data and Evolution of the CDF

5. Evaluation of Data-Based CDF Confidence Values

6. Discussion

7. Conclusions and Outlook

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI