Mechanical Properties Study of Miniature Steel Specimens Based on the Small Punch Test and Simulation Methods

The small punch test (SPT) can be very convenient to obtain mechanical properties due to its unique advantages from small-volume samples, and has gained wide popularity and appreciation among researchers. In this paper, the SPT test and finite element (FE) simulations were performed for three alloys, and the yield stresses (σYS) and ultimate tensile strengths (σUTS) from the uniaxial tensile test (UTT) were correlated with the yield force (Fy) and maximum force (Fm) of the small punch test (SPT) before and after compliance calibration. Finally, the effect of specimen size on the SPT curves was discussed. The results showed that the deviation between SPT test and FE simulation was due to the loading system stiffness, which was confirmed by the loading system compliance calibration test. The SPT curves before and after calibration have less influence on the empirical correlation results for σUTS, while the correlation results for σYS depend on the method used to determine Fy in the SPT curve. Finally, the simulation results indicated that the effect of specimen size on the force–displacement curve in the SPT is slight. This work also provides a reference for subsequent researchers to conduct empirical correlation studies using different specimen sizes.


Introduction
In the field of nuclear and petrochemical industries, a large number of facilities are affected by high-temperature environments, neutron irradiation, corrosion, and other harsh environments. After a period of service, the material properties of the equipment will deteriorate. How to obtain real-time mechanical properties of materials without damaging the structural integrity of in-service components has become the focus of many researchers.
As one of the small sample testing techniques (SSTT), the small punch test (SPT) has attracted extensive attention in the field of nuclear and petrochemical industry due to its unique advantages in obtaining material mechanical properties from small volume samples [1][2][3][4][5][6][7][8][9][10]. The SPT method, which originated in the early 1980s, was originally developed to test the changes in material properties caused by tempering embrittlement or irradiation embrittlement of in-service nuclear materials [1]. Now it has gradually developed to evaluate the mechanical property parameters of materials such as tensile properties [2][3][4][5], brittle-ductile transition temperature [6], fracture toughness [7], fatigue property [8][9][10], and creep properties [11][12][13]. In addition, the SPT method is also used to investigate creep crack propagation by some researchers [14][15][16]. The SPT test method is also used in the field of biomechanics [17]. At the same time, because of its advantages in the characteristics of "micro-damage" and "tests", SPT is particularly useful for some parts that cannot be tested by standardized tests (such as welded joints [18], thermal barrier coatings [19] and functionally gradient materials [20]).
However, despite the many advantages of SPT mentioned above, it faces many problems and challenges nowadays. One of these important issues is how to ensure the accuracy, reliability and comparability of SPT data from different laboratories. Although SPT is widely developed in the last decades, the standardization of this method is still in progress [21][22][23][24][25]. An important factor affecting the results that need to be considered is the accuracy of the correlation results between SPT and the uniaxial tensile test (UTT), which is highly dependent on the sensitivity of the test equipment [26][27][28]. This result makes it difficult to transfer the procedures related to the evaluation of mechanical parameters (σ YS and σ UTS ) by different researchers [29,30].
To solve this problem, a number of researchers have extensively discussed the different experimental parameters that affect the reliability of SPT results. Lucas et al. [31] first studied the effects of specimen thickness, hole diameter of the lower die, and punch size on force-displacement curves with different materials by experiments. Campitelli et al. [32] studied the effects of specimen thickness and friction coefficient on the force-displacement curves of AISI 316L-and F82H-MOD-tempered martensitic steels by numerical simulation. Xu et al. [33] used the FE method to discuss the influence of the punch diameter and hardness, hole diameter of lower die and chamfer radius, distance of punch from the center and specimen thickness on SPT results. Zhou et al. [34] studied the effects of friction coefficient, specimen thickness, stamping rate and a lower die aperture on force-displacement curves of SUS304 stainless steel by the GTN model; Andrés et al. [35] studied the influence of different clamping conditions on the upper die on the SPT and the small punch creep test (SPC). Peng et al. [36] systematically studied the influence of small deviations of various test parameters on the results of SPT, from the aspects of specimen geometric deviation, material mechanical properties, damage parameters, pre-tightening condition of the upper die and the friction coefficient between the punch or die and the specimen. And finally, the sensitivity of each test parameter in the five stages of the force-displacement curve was summarized.
However, in addition to the factors mentioned above, there is an important factor whose influence is often ignored, and this is the way in which the value of the displacement used for representing the force-displacement response is determined [37,38]. Different researchers have used different SPT displacement measurement systems, resulting in differences in loading system compliance, which in turn may cause their measured displacement values to shift in the direction of displacement between the SPT test and the FE simulation [39]. This deviation may be more obvious for some non-contact displacement sensors or COD-type extensometers that measure the displacement through the punch in the experimental equipment [40]. Moreno et al. [39] performed a thorough analysis in order to clarify this matter and the results showed that the different force-displacement curves in SPT can be obtained by the above different measurement methods. Hähner et al. [40] also pointed out that the initial stage curves of force-displacement in SPT are affected by using the non-contact displacement transducers. The results of Ávila et al. [27] showed that this factor leads to a deviation between SPT tests and FE simulations, and attributed this uncertainty to the large effect of elastic displacement associated with the low stiffness of the device setup. Although the LVDT sensor is recommended to obtain relatively accurate displacement results through direct contact with the bottom of the specimen in the current standard [21], this displacement measurement system is not used by all researchers. Therefore, it is a matter of discussion whether the direct use of the SPT curve before compliance calibration to correlate it with the material properties produces significantly different results from the empirical correlation of the SPT curve after calibration. In addition, the diameter of a standard small specimen is usually 8 mm or 10 mm, and the specimen's shape is either round or square, which depends on different countries and regions. As in European Union, the round specimen of Φ 8 mm × 0.5 mm is adopted. In China, round specimens with Φ 10 mm × 0.5 mm are usually used. In contrast, a 10 mm × 10 mm × 0.5 mm or 8 mm × 8 mm × 0.5 mm square specimen can be conveniently sampled from the sharp impact specimen compared with a round specimen [33]. Therefore, it is necessary to investigate the effect of different size tests on the SPT results.
In this paper, the mechanical properties of 316L, 347L stainless steels, and a new high-entropy alloy, Co 32 Cr 28 Ni 32.94 Al 4.06 Ti 3, were investigated by SPT and UTT. The effect of loading system compliance on the empirical correlation results between SPT and UTT was systematically investigated. Finally, the effect of specimen sizes on the SPT curves was investigated by the (3D) FE model. The objective of this paper was to evaluate the mechanical properties of steels for pressure vessels by the SPT method and to discuss the results of the empirical correlation between the characteristic parameters (F y and F m ) on the force-displacement curves before and after the compliance calibration and the material properties (σ YS and σ UTS ) of the standardized tests. These findings will surely contribute to the future standardization of SPT and provide a reference for subsequent researchers to conduct empirical correlation studies using different specimen sizes.

Materials
The materials used in this paper were 316L, 347L stainless steels and Co 32 Table 2. The optical microscope images of the metallographic microstructure of the above materials are shown in Figure 1.

Uniaxial Tensile Test
In order to obtain the mechanical properties of the materials, uniaxial tensile tests were carried out at room temperature using AG-X plus a universal electronic testing machine (SHIMADZU corp., Kyoto, Japan), as shown in Figure 2. A 52 mm × 14 mm × 2.5 mm plate-shaped tensile test specimen was used (Figure 3), and tensile tests were carried out at a strain rate of 1 × 10 −3 mm/s. Finally, the yield stress (σYS), ultimate tensile strength

Uniaxial Tensile Test
In order to obtain the mechanical properties of the materials, uniaxial tensile tests were carried out at room temperature using AG-X plus a universal electronic testing machine (SHIMADZU corp., Kyoto, Japan), as shown in Figure 2. A 52 mm × 14 mm × 2.5 mm plate-shaped tensile test specimen was used (Figure 3), and tensile tests were carried out at a strain rate of 1 × 10 −3 mm/s. Finally, the yield stress (σ YS ), ultimate tensile strength (σ UTS ), and uniform elongation (ε) of three materials are presented in Table 3.

Small Punch Test
SPT specimens with a diameter of 10 mm and a thickness of 0.8 mm were cut from the plate-shaped tensile specimens after the experiment. Then, the specimens were polished to 0.55 mm on both sides by 600, 1200, and 1500 grit size papers, and finally, the

Small Punch Test
SPT specimens with a diameter of 10 mm and a thickness of 0.8 mm were cut from the plate-shaped tensile specimens after the experiment. Then, the specimens were polished to 0.55 mm on both sides by 600, 1200, and 1500 grit size papers, and finally, the

Small Punch Test
SPT specimens with a diameter of 10 mm and a thickness of 0.8 mm were cut from the plate-shaped tensile specimens after the experiment. Then, the specimens were polished to 0.55 mm on both sides by 600, 1200, and 1500 grit size papers, and finally, the specimens were finely polished to a mirror brightness of 0.5 mm by diamond lapping paste to meet the standard requirements for the surface roughness of the specimens. Figure 4 shows the macroscopic morphology of the small punch specimen before and after the experiment.

Small Punch Test
SPT specimens with a diameter of 10 mm and a thickness of 0.8 mm were cut from the plate-shaped tensile specimens after the experiment. Then, the specimens were polished to 0.55 mm on both sides by 600, 1200, and 1500 grit size papers, and finally, the specimens were finely polished to a mirror brightness of 0.5 mm by diamond lapping paste to meet the standard requirements for the surface roughness of the specimens. Figure 4 shows the macroscopic morphology of the small punch specimen before and after the experiment.  In this paper, the small punch test apparatus, modified based on the small punch creep test machine, was used to conduct SPT. Its force sensor and displacement sensor are coupled to the loading brake, and the force and displacement changes are recorded by loading the punch. Once the force is received on the force sensor, the displacement sensor begins to record the punch displacement data. This displacement sensor can be used to obtain relatively accurate displacement data of the punch on the top. The schematic diagram of the apparatus is shown in Figure 5. The apparatus for clamping the specimen consists of an upper die, and a circular lower die with a hole diameter of 4 mm. The specimen is placed horizontally and perpendicular to the direction of the force. During the test, the loading is applied to the specimen by means of a punch and a ball with a diameter of 2.5 mm. The test was conducted in a quasi-static condition with a displacement rate of 0.5 mm/min.  In this paper, the small punch test apparatus, modified based on the small punch creep test machine, was used to conduct SPT. Its force sensor and displacement sensor are coupled to the loading brake, and the force and displacement changes are recorded by loading the punch. Once the force is received on the force sensor, the displacement sensor begins to record the punch displacement data. This displacement sensor can be used to obtain relatively accurate displacement data of the punch on the top. The schematic diagram of the apparatus is shown in Figure 5. The apparatus for clamping the specimen consists of an upper die, and a circular lower die with a hole diameter of 4 mm. The specimen is placed horizontally and perpendicular to the direction of the force. During the test, the loading is applied to the specimen by means of a punch and a ball with a diameter of 2.5 mm. The test was conducted in a quasi-static condition with a displacement rate of 0.5 mm/min.

FE Model and Numerical Simulation
The FE model used in this paper is shown in Figure 6. Both a two-dimensional (2D) axisymmetric model and quarter-symmetric three-dimensional (3D) model were used to simulate the SPT by ABAQUS large commercial FE software [42]. A 2D model with 2000 axisymmetric 4-node square elements, homogenized mesh, and reduced integration with

FE Model and Numerical Simulation
The FE model used in this paper is shown in Figure 6. Both a two-dimensional (2D) axisymmetric model and quarter-symmetric three-dimensional (3D) model were used to simulate the SPT by ABAQUS large commercial FE software [42]. A 2D model with 2000 axisymmetric 4-node square elements, homogenized mesh, and reduced integration with hourglass control (CAX4R) was mainly used to investigate the effect of equipment compliance on experimental data, while a 3D model with about 64,000 8-node linear brick elements, homogenized mesh, and reduced integration with hourglass control (C3D8R) was used to study the effect of specimen size on experimental data. Most researchers widely use these two models in SPT simulations [5,43]. They are suitable for analyzing cases involving large stress and strain gradients, as well as for studying complex contact problems. The agreement between the 2D and 3D model results was confirmed in the article [29,30] by Altstadt et al. Isotropic elastic-plastic materials are subject to the von Mises yield criterion and the corresponding J 2 flow theory [44,45], which takes the form as follows, where σ 1 , σ 2 , and σ 3 are the three principal stresses respectively. The material properties' inputs in ABAQUS were imported from real stress-strain data evaluated by uniaxial tensile tests of related materials, as shown in Table 3. The constitutive equation can be expressed in the following form, where the σ p (ε p ) equation is derived from the uniaxial tensile behavior [32]. Generally, the FE model constructed consists of four parts, namely the (1) punch, (2) upper die, (3) lower die, and (4) specimen. Unlike the test, the FE model perfectly simulates the test setup under ideal conditions. The upper and lower dies and punch are defined as rigid bodies, and the specimen is defined as a deformable body. The upper and lower dies were completely fixed to simulate the clamping condition during the experiment, while the displacement constraint was applied to the punch until the specimen fails. The friction coefficient between the punch and specimen was set to 0.2, which is a typical friction coefficient for steel-steel contact in the absence of lubrication [46]. The equipment dimensions for the SPT test and FE simulation were kept consistent. The relevant parameters of specimens and apparatus for different models are summarized in Table 4. The mesh refinement was performed to ensure the accuracy of simulation results. The mesh sizes of square and round specimens in the 3D model were consistent to avoid the impact of mesh sensitivity on simulation results [47,48]. where σ1, σ2, and σ3 are the three principal stresses respectively. The material properties' inputs in ABAQUS were imported from real stress-strain data evaluated by uniaxial tensile tests of related materials, as shown in Table 3. The constitutive equation can be expressed in the following form, where the σp(εp) equation is derived from the uniaxial tensile behavior [32]. Generally, the FE model constructed consists of four parts, namely the (1) punch, (2) upper die, (3) lower die, and (4) specimen. Unlike the test, the FE model perfectly simulates the test setup under ideal conditions. The upper and lower dies and punch are defined as rigid bodies, and the specimen is defined as a deformable body. The upper and lower dies were completely fixed to simulate the clamping condition during the experiment, while the displacement constraint was applied to the punch until the specimen fails. The friction coefficient between the punch and specimen was set to 0.2, which is a typical friction coefficient for steel-steel contact in the absence of lubrication [46]. The equipment dimensions for the SPT test and FE simulation were kept consistent. The relevant parameters of specimens and apparatus for different models are summarized in Table 4. The mesh refinement was performed to ensure the accuracy of simulation results. The mesh sizes of square and round specimens in the 3D model were consistent to avoid the impact of mesh sensitivity on simulation results [47,48]. (a)

SPT Experiment Results
The typical force-displacement responses of the three materials are shown in Figure 7a. Three test groups were analyzed for each material, and the results show a high degree of coincidence of the force-displacement curves. The force-displacement data of the SPT experiment curve was recorded until the force exceeded the maximum force point and after the specimen ruptured.

SPT Experiment Results
The typical force-displacement responses of the three materials are shown in Figure  7a. Three test groups were analyzed for each material, and the results show a high degree of coincidence of the force-displacement curves. The force-displacement data of the SPT experiment curve was recorded until the force exceeded the maximum force point and after the specimen ruptured. As shown in Figure 7b, The force-displacement curve can be divided into five different regions: elastic bending (region I), plastic bending (region II), membrane-stretching (region III), plastic instability (region IV), and unstable fracture (region V). However, the boundaries between these five regions are not clearly defined, and the delineation of the different regions is highly artificial. The force in the initial elastic stage (region I) and plas- As shown in Figure 7b, The force-displacement curve can be divided into five different regions: elastic bending (region I), plastic bending (region II), membrane-stretching (region III), plastic instability (region IV), and unstable fracture (region V). However, the boundaries between these five regions are not clearly defined, and the delineation of the different regions is highly artificial. The force in the initial elastic stage (region I) and plastic stage (region II) increases almost linearly, and an obvious inflection point can be seen between the two regions. This inflection point is called the elastic-plastic transition point. The ordinate value corresponding to this point is defined as the elastic-plastic transition force or yield force (F P ) (corresponding to point A in Figure 7b) by most researchers, which is usually used to correlate with the yield strength (R P ) and regarded as the dividing point between region I and region II. The plastic bending region (region II) ends when the slope of the force-displacement curve increases significantly. In region III, it can be seen that the slope of the force-displacement curve gradually increases. The reason is that after the plastic bending stage, the effect of strain hardening in the material overcomes the reduction of the specimen thickness caused by the punch and enables it to bear the force at an increased rate. This situation is similar to the strain-strengthening stage of the uniaxial tensile test, and the difference is that the stress condition is a biaxial stress state. Therefore, the dividing point between region II and region III (corresponding to point B in Figure 7b) is regarded as the specimen entering the membrane-stretching stage from bending deformation.
The dividing point (corresponding to point C in Figure 7b) between membrane stretching (region III) and plastic instability (region IV) can be regarded as a balance point between the material force bearing and deformation resisting. When the value of force exceeds the value of Point C, due to the decrease in the local thickness of the specimen, the microvoids in the material continuously nucleate, grow, and coalesce, thus inducing microcracks on the specimen surface. On the macro level, the specimen will become softened, and the slope of the force-displacement curve gradually decreases until the maximum force (corresponding to point D in Figure 7b) is reached and the stage of plastic instability (region IV) ends. Figure 8 shows the SEM images of the three materials observed after the specimens fractured (region V). It can be observed that a large number of dimples formed in the material, and a large number of microvoids coalesced. Then, the force of the specimens begins to drop sharply, which is called an unstable fracture region (region V). is regarded as the specimen entering the membrane-stretching stage from bending deformation.
The dividing point (corresponding to point C in Figure 7b) between membrane stretching (region III) and plastic instability (region IV) can be regarded as a balance point between the material force bearing and deformation resisting. When the value of force exceeds the value of Point C, due to the decrease in the local thickness of the specimen, the microvoids in the material continuously nucleate, grow, and coalesce, thus inducing microcracks on the specimen surface. On the macro level, the specimen will become softened, and the slope of the force-displacement curve gradually decreases until the maximum force (corresponding to point D in Figure 7b) is reached and the stage of plastic instability (region IV) ends. Figure 8 shows the SEM images of the three materials observed after the specimens fractured (region V). It can be observed that a large number of dimples formed in the material, and a large number of microvoids coalesced. Then, the force of the specimens begins to drop sharply, which is called an unstable fracture region (region V).    Figure 9a shows the experiment and simulation results of the three materials used in this study. When compared with the experiment results, the dividing points in the forcedisplacement curve of simulation are more pronounced. The general trends of the experiment and simulation responses are approximately similar in the initial four regions but different in the slopes. This deviation is more pronounced at the initial stage of the forcedisplacement curve, as shown in Figure 9b. The reason can be explained as that, contrary to the experiment, the specimen dies and punch are defined as rigid bodies in the simulation process, but in fact are elastic deformers. In the practical data recording, different  Figure 9a shows the experiment and simulation results of the three materials used in this study. When compared with the experiment results, the dividing points in the force-displacement curve of simulation are more pronounced. The general trends of the experiment and simulation responses are approximately similar in the initial four regions but different in the slopes. This deviation is more pronounced at the initial stage of the force-displacement curve, as shown in Figure 9b. The reason can be explained as that, contrary to the experiment, the specimen dies and punch are defined as rigid bodies in the simulation process, but in fact are elastic deformers. In the practical data recording, different displacement measurement methods lead to the displacement sensor at the top of the punch recording additional displacement due to the elastic deformation of the test frame [39] during the loading process, so that the displacement data recorded by the displacement sensor at the top of the punch (corresponding to SPT test) are slightly higher than the displacement data on the upper face of the specimen (corresponding to SPT simulation). The above situation leads to deviations of the force-displacement curves between the test and simulation. This deviation is also reported in literature [28,32]. displacement measurement methods lead to the displacement sensor at the top of the punch recording additional displacement due to the elastic deformation of the test frame [39] during the loading process, so that the displacement data recorded by the displacement sensor at the top of the punch (corresponding to SPT test) are slightly higher than the displacement data on the upper face of the specimen (corresponding to SPT simulation). The above situation leads to deviations of the force-displacement curves between the test and simulation. This deviation is also reported in literature [28,32].

SPT Simulation Results
(a) (b) Figure 9. Comparison of force-displacement curve between the experiment and FE simulation. (a) at all stages; (b) at the initial stage.

Calibration of Loading System Compliance
In the previous section, we analyzed and explained the reasons for the deviations between the experimental and simulation results. However, the reasonableness of the analytical results for the curves in Figure 9 needs to be further investigated by applying the compliance calibration of the loading system to the experimentally obtained force-displacement curves. It is worth noting that regarding the compliance calibration of the loading system, different compliance calibration methods are used by researchers [27,28,32,39,40,46,49]. The displacement (δupper) on the upper face of the specimen can be derived from the displacements (δext) through an appropriate correction of the respective elastic compliances involved, referred to as Cext [28,32,39,40]. Some researchers [27,43,46] believe that a linear calibration of the force-displacement (δext) response can be achieved with the compliance calibration curve obtained by this method. However, other research-

Calibration of Loading System Compliance
In the previous section, we analyzed and explained the reasons for the deviations between the experimental and simulation results. However, the reasonableness of the analytical results for the curves in Figure 9 needs to be further investigated by applying the compliance calibration of the loading system to the experimentally obtained force-displacement curves. It is worth noting that regarding the compliance calibration of the loading system, different compliance calibration methods are used by researchers [27,28,32,39,40,46,49]. The displacement (δ upper ) on the upper face of the specimen can be derived from the displacements (δ ext ) through an appropriate correction of the respective elastic compliances involved, referred to as C ext [28,32,39,40]. Some researchers [27,43,46] believe that a linear calibration of the force-displacement (δ ext ) response can be achieved with the compliance calibration curve obtained by this method. However, other researchers have argued that the effective contact area with the punch, the applied stress, and the strain field obtained during the SPT are neither constant nor uniform and continuously vary during the test. At the same time, as the punch displacement increases, the deformation of the specimen and the effective contact area of the material increases, and the specimen stiffness gradually increases [27]. Thus, the nonlinear calibration for the force-displacement (δ ext ) response should be considered in the actual compliance calibration to obtain the best match between the test and simulation curves. Meanwhile, Ávila et al. [27] also pointed out that if only the initial stiffness is used to correct the original curve, a large overcorrection may be obtained for high displacement values.
Therefore, a nonlinear compliance calibration curve was used in this paper. The compliance calibration of the loading system was performed using a cylindrical tungsten carbide specimen with a diameter of 10 mm and a thickness of 5 mm instead of the small circular piece in the test. The calibration method is similar to the reference [43]. In the first loading step, the maximum force was obtained, The subsequent force should not be exceeded in the SPT, and some unloading-loading cycles were performed until a steady state of the force-displacement (δ ext ) curve was reached. The last loading step of this calibration test was recorded, and a fifth-order polynomial regression was established from this data as a calibration equation. This curve was used to calibrate the δ ext obtained from the SPT test, which resulted in a new displacement (δ upper ) equal to the displacement of the upper surface of the specimen. It should be noted that the high sensitivity of the calibration process on the test apparatus makes the loading-unloading process unstable. Therefore, in addition to the compliance calibration curves obtained in this study, the correction results for AISI 304L by Moreno et al. [39] and the three calibrations for JRQ by Lucon et al. [49] are also shown in Figure 10. loading step, the maximum force was obtained, The subsequent force should not be exceeded in the SPT, and some unloading-loading cycles were performed until a steady state of the force-displacement (δext) curve was reached. The last loading step of this calibration test was recorded, and a fifth-order polynomial regression was established from this data as a calibration equation. This curve was used to calibrate the δext obtained from the SPT test, which resulted in a new displacement (δupper) equal to the displacement of the upper surface of the specimen. It should be noted that the high sensitivity of the calibration process on the test apparatus makes the loading-unloading process unstable. Therefore, in addition to the compliance calibration curves obtained in this study, the correction results for AISI 304L by Moreno et al. [39] and the three calibrations for JRQ by Lucon et al. [49] are also shown in Figure 10. As can be seen from the results in Figure 10, the final calibration curves are different due to the displacement measurement systems, test apparatus, and punch materials used by the researchers. It can also be found that the results in the reference [47] show that there are still slight deviations in the three compliance calibration results for the same material with the same test apparatus. Therefore, the compliance calibrations in Appendix A of EN 10,371 [21] are given as a recommended range.
The compliance curve in Figure 10 can be expressed in the following form: where δΝ WC is the elastic deformation of the test apparatus and punch, F is the corresponding force. The force-displacement curve in Figure 7 can also be expressed in the following As can be seen from the results in Figure 10, the final calibration curves are different due to the displacement measurement systems, test apparatus, and punch materials used by the researchers. It can also be found that the results in the reference [47] show that there are still slight deviations in the three compliance calibration results for the same material with the same test apparatus. Therefore, the compliance calibrations in Appendix A of EN 10,371 [21] are given as a recommended range. The compliance curve in Figure 10 can be expressed in the following form: where δ N WC is the elastic deformation of the test apparatus and punch, F is the corresponding force. The force-displacement curve in Figure 7 can also be expressed in the following form: where δ exp includes the elastic bending deformation due to specimen deflection plastic indentation of the punch and elastic deformation of the test frame. Thus, the actual displacement of the specimen can be expressed as: where δ upper is the displacement of the upper face of the specimen; the corrected results for the above three materials by Equation (5) are shown in Figure 11. As can be seen, the corrected curves obtained using the above calibration method generally agree well with the simulated curves, with only slight deviations at the initial stages of the curves. Therefore, the deviation between the experiment and simulation results can be attributed to the influence of loading system compliance.

Correlation between SPT Curve and Tensile Properties before and after Compliance Calibration
The above results confirm that the deviation between experimental and FE results in the displacement is caused by the compliance of the testing machine configuration and the displacement measurement method. It is important to note that due to the high sensitivity of the SPT, the disturbance generated in the process of correcting the compliance of the loading system may lead to the failure of the calibration process, and extra care should

Correlation between SPT Curve and Tensile Properties before and after Compliance Calibration
The above results confirm that the deviation between experimental and FE results in the displacement is caused by the compliance of the testing machine configuration and the displacement measurement method. It is important to note that due to the high sensitivity of the SPT, the disturbance generated in the process of correcting the compliance of the loading system may lead to the failure of the calibration process, and extra care should be taken in this process. In fact, due to the different displacement measurement methods and materials in the SPT equipment, the results of the compliance calibration are very different. One of the best methods to avoid this problem is to use LVDT contact displacement measurements directly. This method can directly obtain accurate displacement data at the bottom of the specimen and prevent the possible failure of compliance calibration. However, due to the conditions in different laboratories, direct contact displacement measurements sometimes cannot be used. Thus, the effect of testing machine configuration compliance on the empirical correlation results is worth discussing.

Correlation with Ultimate Tensile Strength (σ UTS )
Researchers widely use the maximum force (F m ) as the characteristic force associated with the ultimate tensile strength (σ UTS ). The primary empirical correlation forms are as follows [50]: where h 0 is the specimen's initial thickness, F m is the maximum force used to determine the force characteristic value of σ UTS , and u m is the value of displacement corresponding to the force characteristic F m . α 1 , α 2 , α 1 ', and α 2 ' are the correlation coefficients related to the material. The results before and after compliance calibration for the three materials are presented in Table 5. Equations (6) and (7), respectively, were used to empirically correlate the maximum force (F m ) and the ultimate tensile strength (σ UTS ) of the three materials before and after compliance calibration. Equations (8) and (9) are the empirical correlation equations before compliance calibration and Equations (10) and (11) are the after-compliance calibration equations. The relation between the SPT characteristic force (F m ) and UTT strength property (σ UTS ) can then be expressed as follows: Before compliance calibration: After compliance calibration: Comparing Equation (8) with Equation (10), it can be seen that the correlation coefficients in the empirical correlation equations obtained from Equation (6) are less affected before and after the compliance calibration and differ only in the correlation coefficient α 2 . However, it can be seen from the results of the compared Equation (9) with Equation (11) that the correlation coefficients are slightly different before and after compliance calibration. The reason for the above situation is that empirical correlation equation, Equation (6), for each material is only concerned with F m , which hardly changes before and after the calibration, and thus the difference is not significant. For the same reason, the empirical correlation equation, Equation (7), for each material is not only relevant to F m but to u m , and u m changed significantly before and after the calibration, so the data obtained after the compliance calibration of Equation (7) are different from before. Suppose the uncorrected curve is used to evaluate the material ultimate tensile strength (σ UTS ); an overestimation of results may be obtained (i.e., the larger value of σ UTS will be obtained by the uncorrected curve compared with the corrected curve with the same F m /(h 0 ·u m )), and therefore unsafe results will be obtained.

Correlation with Yield Stress (σ YS )
The empirical correlation equation of yield stress is usually expressed in the following form, where β 1 and β 2 are the empirical correlation constants, h 0 is the initial thickness of SPT specimen, and F y is the elastic-plastic transition force in the SPT curve. The key to determining yield stress (σ YS ) lies in how to determine the corresponding characteristic force, i.e., elastic-plastic transition force (F y ). Different methods for determining F y are proposed, as described in references [28,50]. In all the current assessments, F y_Mao [51], F y_CEN [22], F y_t/10 [52], and F y_t/100 [6] methods are adopted by most researchers, as shown in Figure 12.
Comparing Equation (8) with Equation (10), it can be seen that the correlation coefficients in the empirical correlation equations obtained from Equation (6) are less affected before and after the compliance calibration and differ only in the correlation coefficient α2. However, it can be seen from the results of the compared Equation (9) with Equation (11) that the correlation coefficients are slightly different before and after compliance calibration. The reason for the above situation is that empirical correlation equation, Equation (6), for each material is only concerned with Fm, which hardly changes before and after the calibration, and thus the difference is not significant. For the same reason, the empirical correlation equation, Equation (7), for each material is not only relevant to Fm but to um, and um changed significantly before and after the calibration, so the data obtained after the compliance calibration of Equation (7) are different from before. Suppose the uncorrected curve is used to evaluate the material ultimate tensile strength (σUTS); an overestimation of results may be obtained (i.e., the larger value of σUTS will be obtained by the uncorrected curve compared with the corrected curve with the same Fm/(h0·um)), and therefore unsafe results will be obtained.

Correlation with Yield Stress (σYS)
The empirical correlation equation of yield stress is usually expressed in the following form, where β1 and β2 are the empirical correlation constants, h0 is the initial thickness of SPT specimen, and Fy is the elastic-plastic transition force in the SPT curve. The key to determining yield stress (σYS) lies in how to determine the corresponding characteristic force, i.e., elastic-plastic transition force (Fy). Different methods for determining Fy are proposed, as described in references [28,50]. In all the current assessments, Fy_Mao [51], Fy_CEN [22], Fy_t/10 [52], and Fy_t/100 [6] methods are adopted by most researchers, as shown in Figure 12.  The Mao method [51] minimizes the error between the linear function and the SPT initial stage curve by establishing two linear procedures, and the resulting intersection point is F y_ Mao, and the intersection point projected vertically onto the SPT curve is F y_CEN . The t/10 and t/100 methods were performed by drawing a parallel line tangent to the elastic region I of the SPT curve. The line was translated along the deflection axis to the points whose values are t/10 and t/100, respectively. The intersection value between the parallel line tangent and SPT curve was determined as the characteristic force F y . The maximum slope of the tangent line corresponding to the slope at the inflection point is suggested in Hähner et al. [40] as the slope of the elastic region I of the SPT curve before compliance calibration, and this method was used in this paper to determine the yield stress (σ YS ) before compliance calibration.
The characteristic forces (F y ) associated with the yield stress (σ YS ) before and after the compliance calibration are listed in Table 6. The relation between the SPT characteristic force (F y ) and UTT material property (σ YS ) can then be expressed as follows: where Equations (13)-(16) as well as Equations (17)- (20) correspond to the four methods for determining F y , respectively. A comparison of the empirical correlation coefficients in the equations above intuitively shows that the differences before and after the compliance correction are obvious for different methods of determining the yield force (F y ). The empirical correlation equations before and after compliance calibration determined using F y_Mao , F y_CEN , and F y_t/10 methods showed small deviations, and the empirical correlation coefficients β 1 and β 2 do not differ significantly. In contrast, the empirical correlation equations determined by the F y_t/100 method showed larger deviations in the empirical correlation coefficients β 1 and β 2 before and after compliance calibration, which indicates that the F y_Mao , F y_CEN , and F y_t/10 methods are less affected by the compliance calibration and are more applicable to the characteristic forces of the above-mentioned study materials. It can also be seen that the regression coefficient R 2 of the empirical correlation equation for yield strength (σ YS ) is relatively small compared to the empirical correlation equation for ultimate tensile strength (σ UTS ), indicating that its empirical correlation equation has a large scattering, which may be caused by the small number of data points. Table 6 illustrates the values of the empirical correlation constant β 1 from the literature of different researchers. It should be noted that for the empirical correlation equation of yield strength, the coefficient β 2 may not necessarily exist, depending on the differences in the mechanical properties of the material and the influence of the test apparatus. Table 7 combined with Equations (17) and (18) shows that the values of β 1 determined by the Mao and CEN methods are close to the values in the literature [5,32,[41][42][43][44][45][46][47][48][49][50][51][52][53]. Therefore, by combining the two perspectives above, F y_Mao and F y_CEN may be the more desirable characteristic forces for the three materials of interest in this study.  Figure 13 compares the different methods of determining the yield force (F y ) for the three materials studied in this paper. It can be seen that the difference in the methods of determining the yield force (F y ) before and after compliance correction is slight. This result indicates that the determination of the yield stress (σ YS ) is not affected by the loading system stiffness. It can also be seen that if the empirical correlation coefficient in Equation (8) is constant, the t/10 method obtains the highest yield strength of the material, while the other three methods obtain the yield strength of the material relatively close. However, this result was demonstrated only for the three materials studied in this paper, and its applicability to other materials remains further confirmed.

Effect of Different Specimen Sizes on SPT Curve
In order to avoid the influence of some neglected factors on the SPT results during the experiments, the subsequent studies of different specimen sizes were based on FE analysis. The different specimen sizes generally used by most researchers are presented in Table 8. The data of 316L stainless steel mentioned above were used for FE simulations The mesh sizes of square specimens and round specimens in the 3D model were identica to avoid the impact of mesh sensitivity on simulation results.

Material
Specimen Diameter (mm) Specimen Shape 316L 8 10 Round specimen Square specimen Figure 14 shows the results of the SPT curves for all cases in Table 7. It can be seen that the FE simulation results for round and square specimens with the same diamete exhibit almost identical SPT curves. This result shows that the SPT curves were not influ enced by the specimen's shape when the specimen periphery was in a fully clamped con dition. Moreover, for specimens with diameters of 8 mm and 10 mm, respectively, there is a high degree of coincidence in the first four regions of the SPT curves. Still, the two curves begin to diverge near the maximum force, and the general downward trend in force remains consistent beyond the maximum force. In general, the effect of specimens diameter on the force-displacement curve is not apparent, and a slight difference induced by the specimen diameter can be found beyond the maximum force point.

Effect of Different Specimen Sizes on SPT Curve
In order to avoid the influence of some neglected factors on the SPT results during the experiments, the subsequent studies of different specimen sizes were based on FE analysis. The different specimen sizes generally used by most researchers are presented in Table 8. The data of 316L stainless steel mentioned above were used for FE simulations. The mesh sizes of square specimens and round specimens in the 3D model were identical to avoid the impact of mesh sensitivity on simulation results.  Figure 14 shows the results of the SPT curves for all cases in Table 7. It can be seen that the FE simulation results for round and square specimens with the same diameter exhibit almost identical SPT curves. This result shows that the SPT curves were not influenced by the specimen's shape when the specimen periphery was in a fully clamped condition. Moreover, for specimens with diameters of 8 mm and 10 mm, respectively, there is a high degree of coincidence in the first four regions of the SPT curves. Still, the two curves begin to diverge near the maximum force, and the general downward trend in force remains consistent beyond the maximum force. In general, the effect of specimens' diameter on the force-displacement curve is not apparent, and a slight difference induced by the specimen diameter can be found beyond the maximum force point. Figure 15 shows the equivalent stress distribution of four different types of specimens of 316L stainless steel when the punch displacement was 1.7 mm. It can be seen from the figure that although the overall stress distribution of square specimens and round specimens is different, the stress distribution of the contact part with the punch at the center of the specimen is roughly the same. The difference between round or square specimens with different diameters is only evident in the non-contact area with the punch.  Table 7. Figure 15 shows the equivalent stress distribution of four different types of s mens of 316L stainless steel when the punch displacement was 1.7 mm. It can be seen the figure that although the overall stress distribution of square specimens and ro specimens is different, the stress distribution of the contact part with the punch a center of the specimen is roughly the same. The difference between round or square s imens with different diameters is only evident in the non-contact area with the punc (a) (b) Figure 14. Force-displacement curve of SPT test for specimens in Table 7. Figure 14. Force-displacement curve of SPT test for specimens in Table 7. Figure 15 shows the equivalent stress distribution of four different types of specimens of 316L stainless steel when the punch displacement was 1.7 mm. It can be seen from the figure that although the overall stress distribution of square specimens and round specimens is different, the stress distribution of the contact part with the punch at the center of the specimen is roughly the same. The difference between round or square specimens with different diameters is only evident in the non-contact area with the punch.    Figure 16 shows the equivalent plastic strain distribution on the bottom surface of the specimen at the maximum force in the path from the center of the specimen to the edge. Combined with the contour plots of the equivalent plastic strain distribution for specimens of different diameters and shapes in Figure 15, it can be more intuitively seen that the different specimen sizes have almost the same equivalent plastic strain distributions for the SPT deformation process. Figure 16 shows the equivalent plastic strain distribution on the bottom surface o the specimen at the maximum force in the path from the center of the specimen to th edge. Combined with the contour plots of the equivalent plastic strain distribution fo specimens of different diameters and shapes in Figure 15, it can be more intuitively see that the different specimen sizes have almost the same equivalent plastic strain distribu tions for the SPT deformation process. Based on the above results, it can be seen that different specimen sizes, i.e., roun and square specimens with different diameters kept clamped around, have little effect o the SPT curve results. This implies that the results obtained are theoretically transferabl and comparable when SPT tests are performed by different researchers using square an circular specimens, respectively. However, this conclusion must be carried out under th condition that the periphery of the specimen is wholly clamped in the experimental re sults.

Conclusions
In this paper, SPT experiments and simulations were carried out on 316L, 347L stain less steels, and a new high-entropy alloy Co32Cr28Ni32.94Al4.06Ti3, and the effect of specime sizes on the SPT curve were investigated by simulation. The main conclusions are as fo lows: (1) The discrepancy of the force-displacement of SPT between the test and the simu lation results is mainly due to the loading system stiffness.
(2) The empirical correlation results for the ultimate tensile strength (σUTS) by Equa tion (6) and for the yield strength (σYS) by Equation (12) have little difference before an after the loading system compliance calibration, but the correlation results by Equatio (7) have a greater difference because of the changes in um before and after the loadin system compliance calibration.
(3) Simulation results on different specimen shapes and sizes show that the effect o specimen shapes and sizes on the SPT results can be ignored. This conclusion was con firmed by the 316L stainless steel simulation results. Therefore, the experimental result obtained based on the above specimen size provide a reference for different researcher even if their specimen shapes and sizes differ.
This investigation shows that insufficient stiffness of a loading system can lead to a inaccurate material properties by the small punch test, but the results after correction ar Based on the above results, it can be seen that different specimen sizes, i.e., round and square specimens with different diameters kept clamped around, have little effect on the SPT curve results. This implies that the results obtained are theoretically transferable and comparable when SPT tests are performed by different researchers using square and circular specimens, respectively. However, this conclusion must be carried out under the condition that the periphery of the specimen is wholly clamped in the experimental results.

Conclusions
In this paper, SPT experiments and simulations were carried out on 316L, 347L stainless steels, and a new high-entropy alloy Co 32 Cr 28 Ni 32.94 Al 4.06 Ti 3 , and the effect of specimen sizes on the SPT curve were investigated by simulation. The main conclusions are as follows: (1) The discrepancy of the force-displacement of SPT between the test and the simulation results is mainly due to the loading system stiffness.
(2) The empirical correlation results for the ultimate tensile strength (σ UTS ) by Equation (6) and for the yield strength (σ YS ) by Equation (12) have little difference before and after the loading system compliance calibration, but the correlation results by Equation (7) have a greater difference because of the changes in u m before and after the loading system compliance calibration.
(3) Simulation results on different specimen shapes and sizes show that the effect of specimen shapes and sizes on the SPT results can be ignored. This conclusion was confirmed by the 316L stainless steel simulation results. Therefore, the experimental results obtained based on the above specimen size provide a reference for different researchers even if their specimen shapes and sizes differ.
This investigation shows that insufficient stiffness of a loading system can lead to an inaccurate material properties by the small punch test, but the results after correction are still trustworthy. It also provides a reference for subsequent researchers to conduct empirical correlation studies using different specimen sizes.