1. Introduction
Even now, when the forms of structures are ever changing, the number of concrete structures is increasing. As cracks occur in concrete structures and performance degradation appears, disasters resulting from collapse continue to occur, and the number of buildings that require repairs is also increasing. Cracks in concrete structures are often caused by problems in design and construction, sudden application of loads exceeding the design load, and repetitive loads applied over a long period of time [
1]. If loads are applied to structures in which micro-cracks have occurred, cracks grow from the micro-cracks, and surface cracks during construction often lead to structural cracks [
2]. The current safety inspection of structures poses disadvantages in that a lot of time and manpower are required due to the reliance on visual inspection by experts. Research on automatic crack recognition technology to combat these disadvantages using an image processing technique has been receiving attention. The crack image recognition technique can be used to monitor structures in a faster measurement cycle by reducing the required manpower and time, thereby enabling a quick response when serious defects occur in the structures [
3].
Abdel-Qader et al. [
3] extracted cracks from bridge images using fast Haar transform (FHT), fast Fourier transform, Sobel, and Canny, and provide a comparison of four crack detection techniques. Yamaguchi and Hashimoto [
4] claimed that computation time is important in applying an image-based system to a practical application and contributed to the enhancement of image processing speed by proposing a percolation-based algorithm to calculate the circularity of an object during image processing and skip circular objects. Zhang et al. [
5] proposed a method to extract crack images from noisy concrete surface images by using a distance histogram-based shape descriptor. Crack measurement using images has been studied mainly with respect to a variety of structures. Sinha [
6] presented a method to extract cracks from buried concrete pipes, and Ho [
7] presented a method for measuring the cable surface cracks of cable-stayed bridges. Wu [
8] proposed a method for the accurate measurement of cracks in concrete road pavement. Li et al. and Zou et al. proposed the FoSA and CrackTree methods for crack detection from pavement images, respectively [
9,
10].
As such, crack measurement using images has been widely studied. However, in the previous studies, the cracks were extracted in a controlled environment. With nuclear power plants or hydroelectric dams, it is difficult to shoot close-up images, so the image shooting distance becomes large in these cases. In addition, changes in illumination occur according to the weather and time in an outdoor environment, and this effect should be considered when using crack recognition techniques. Meanwhile, if the shooting distance increases, the spatial resolution is reduced, and thus the contrast of the crack is lowered. In this case, the change of illuminance causes the contrast of the crack and concrete background to change. In the outdoor environment, the shooting distance and illuminance cannot be controlled, and these changes affect the crack extraction process using thresholding and boundary detection used in the previous studies, thereby having a significant effect on the crack extraction performance.
Jahanshahi and Masri [
11] suggested a technique that enables crack detection and quantification at any focal length, shooting distance, and resolution by adding the use of depth perception to the crack image recognition technique in an outdoor environment. Jahanshahi et al. [
12] analyzed printed crack images that had crack widths ranging from 0.4 to 2.0 mm by changing the shooting distance from 725 mm to 1760 mm, and determined that as the number of pixels that represent the cracks decreases, the accuracy of the crack width measurement is reduced, and a reduction in shooting distance, increase in focal length, and enhancement of resolution are needed to increase the accuracy of the crack measurement. Li et al. [
13] propose a technique that can measure cracks in a bridge from a long distance. As the measurement distance increased, the crack measurement error became larger, while the light source and ISO did not have a significant effect. Wada and Kono [
14] measured cracks in a hydroelectric dam with a size of 0.2 mm at a distance of 120 m using an 80–400 mm telephoto lens and a 12.6 Megapixel sensor. Jahanshahi [
11,
12] and Li et al. [
13] concluded from their analysis that an increase in shooting distance leads to a reduction of the spatial resolution of the image, and the decrease in the number of pixels that represent the cracks lowers the accuracy of the crack measurement. Li et al. [
13] analyzed the effect of illumination on the accuracy of crack measurement by varying the ISO values and the presence of a light source. In previous studies, the crack measurement error of the crack measurement algorithm developed was verified by varying the shooting distance and the presence of a light source with the acknowledgment that the light source and shooting distance have an effect on the accuracy of the crack measurement [
11,
12,
13]. However, though the measurements showed that changes in the light source and shooting distance caused crack measurement error, the effects on image recognition were not quantitatively analyzed.
In this study, a quantitative analysis of the effects of image acquisition conditions on the surface crack recognition in the specimen image of the concrete structure was conducted through an outdoor experiment. A concrete specimen with openings at regular intervals was fabricated, and the specimen was made to be similar to the bar target so as to facilitate modulation transfer function (MTF) analysis. Images of the concrete crack specimen were taken while increasing the shooting distance according to changes in the illumination occurring during the daytime. Through the inspection of cracks using the images taken during the daytime, the range of crack widths that can be recognized depending on the shooting distance and changing illumination were determined through experimental methods. For the image analysis, image evaluation methods such as edge response analysis and modulation transfer function (MTF) were used, and the effects on human visual perception were objectively analyzed using contrast sensitivity and Munsell value.
  2. Crack Image Analysis Techniques
The ability to recognize cracks in an acquired image is related to the shooting distance and spatial resolution of the camera. The spatial resolution represents the ability of the image acquisition system that can express fine details in the image. The spatial resolution can be calculated using the resolution of the sensor and the focal length of the lens, through which the measurable crack width can be estimated. The spatial resolution is the spatial extent that one pixel of the image occupies in the ground, and the theoretical spatial resolution can be obtained using the geometric relationship between the variables of the camera as shown in Equation (1):
      
xGSD represents the distance of the ground that one pixel of the image has, ximage the size of pixel, Dw the shooting distance, and f the focal length. Ss in Equation (2) is the sensor size, and SR the resolution of the sensor.
However, in the case where the camera is applied to a real environment, a difference arises between the theoretically estimated spatial resolution and the actual resolution. For that reason, research to verify the spatial resolution of the camera in the actual environment has been conducted. Mraz et al. presented a method for verifying the performance of a digital imaging system used in a highway application [
15]. They pointed out that the specification provided by the camera manufacturer has a low level of reliability, and the quality of the image cannot be accurately predicted when other optical systems such as lenses are connected to the camera, and also proposed a method to analyze the factors affecting the image quality by using an optical theory.
  2.1. Crack Edge Response Analysis
Crack edge response analysis is a method for evaluating the sharpness of the boundary between the dark and bright areas in the acquired image. The image obtained by taking a picture of the actual object with a distinct boundary as shown in 
Figure 1a has blurred edge due to the blur phenomenon, as shown in 
Figure 1b [
16].
The edge spread function (ESF) is the response of the system to an ideal edge [
17]. The point spread function (PSF), which is the first-order partial differential function of ESF, is commonly used to indicate the quality of the imaging system. PSF represents the diffusion degree of the points in the image and is used to evaluate the sharpness of the image. 
Figure 2b shows PSF and full width of half maximum (FWHM). In a PSF function, the sharpness is evaluated based on the FWHM size that corresponds to the width at half of the maximum amplitude [
16]; the better the sharpness, the smaller the FWHM.
  2.2. Modulation Transfer Function Method
Modulation Transfer Function (MTF) is one of the techniques for evaluating the spatial resolution. The methods used to obtain MTF include an edge method and a sine wave method. The edge method is used to obtain LSF by differentiating the ESF extracted from the boundary between the bright and dark sides and calculating MTF through a Fourier transform.
In order to convert ESF into MTF, the LSF function should be calculated first, and LSF can be obtained by differentiating ESF as shown in Equation (3) [
18]:
        
The transform of LSF is obtained via the Fourier transform, and the optical transfer function (OTF) is calculated as shown in Equation (4):
        
 is the frequency, where OTF (Optical transfer function) contains both the phase and amplitude information of the signal, which is converted to DFT as shown in Equation (5):
        
The DFT of Equation (4) can be represented by Euler’s formula as in Equation (6), where 
A and 
B are shown in Equations (7) and (8), and MTF can be obtained as the absolute value of OTF as shown in Equation (9):
        
 where 
N is the total number of data points and 
n is an integer.
Another MTF measuring method involves taking a picture of the bar target with a camera; the MTF value is calculated as the relative ratio of modulation (
Mi) of the image and modulation (
Mo) of the object, as shown in Equation (10) [
15]:
        
First, the modulation value for the target region should be calculated. For reflecting targets, M
o is defined as shown in Equation (11) [
15]:
        
 and 
 are the maximum and minimum reflectance at a given uniformly illuminated background [
15]. 
 is defined as in Equation (12):
        
are the maximum and minimum intensity values of the image [
15]. The actual image taken by the camera has lower resolution and contrast than the object, and the MTF value is a measure of the transformation of the spatial frequency entered in the process of creating the image.
  2.3. Contrast Sensitivity Analysis
The contrast in the image that can be perceived by humans varies depending on the intensity of the luminance, and the width of the PSF cannot represent human perception. In order to quantify the ability to distinguish the contrast by considering the human color perception capability, studies on the contrast sensitivity have been conducted. The contrast sensitivity defines the threshold between visible and invisible. The contrast is the value obtained by quantifying the relative luminance of the background and the object, and is represented using Weber contrast (C
W) and Michelson contrast (C
M) methods [
19]:
        
 where 
 and 
 are the maximum luminance and minimum luminance, and 
 is the luminance of the background.
The ability to distinguish the structure through the contrast in the image can also be analyzed through a Munsell value. The Munsell value is known as the lightness scale with perceptually uniform intervals. With respect to the relationship between luminance and Munsell value, as the luminance becomes higher, it is difficult to distinguish the difference in the luminance that can be perceived as shown in 
Figure 3 [
20].
  4. Analysis of Experimental Results
Table 3 shows a summary of the crack specimen images acquired by the experiment. It illustrates a comparison of the crack specimen images acquired by changing the shooting distance from 5 to 100 m at 13 lx and 52,000 lx with the same specimens.
 A comparison of the images acquired at 13 lx and 52,000 lx reveals that the image acquired at 13 lx exhibits the whole dark contrast, whereas the image acquired at 52,000 lx shows a bright contrast. The images acquired from the distance of 5 m show the contrast of the cracks that are darker than background, which makes it easy to identify the cracks due to the clear contrast. It can be confirmed that as the image acquisition distance increases, the contrast of the cracks becomes blurred and it is difficult to recognize the cracks with small widths located in the left part of the crack specimen in the image acquired from a distance of 100 m. The degree to which the contrast of the crack is blurred as the shooting distance increases is slightly different depending on the illumination of 13 lx and 52,000 lx, and thus there is also a difference in the crack width that can be recognized. In order to analyze the difference in the crack widths that remain visible in each image, the brightness of the image was represented using an intensity profile with a range of 0–255. 
Figure 8 shows the intensity profile that represents the contrast of the crack specimen image shown in 
Table 3.
As shown in the upper left corner of 
Figure 8a, the intensity profile displays the contrast of the dotted area (A–B) in a graph. The place where the contrast becomes lower represents the crack area in the image, and the background exhibits a relatively high value. A comparison of the images acquired at 13 lx and 52,000 lx can confirm that as the contrast of the background has a range of about 150 and 240 at 13 lx and 52,000 lx in the image taken from the distance of 5 m, the difference in brightness appears large, whereas the contrast of the crack shows a range of 7 and 19 at 13 lx and 52,000 lx, indicating that there is no big difference. An arrow of 
Figure 8a shows the intensity difference between the crack and the background numerically. At 13 lx, the intensity difference was about 140, and there was an intensity difference of 220 at 52,000 lx, showing that the intensity difference between the crack and the background is larger at 52,000 lx than at 13 lx. However, the intensity profile showed a change in the image of 
Figure 8b, which was acquired by increasing the image acquisition distance to 25 m. The contrast of the crack was shown to rise sharply in the part where the crack width is small on the left area of the graph. As the contrast of the 2 mm crack exhibits 105 and 218 at 13 lx and 52,000 lx, respectively, a rise in the contrast of 98 and 200 can be confirmed. In particular, the contrast rose sharply at 52,000 lx, and the intensity difference from the background due to the rise in the contrast was sharply reduced to 26. The reduction in the intensity difference can be explained as the cause for the 2 mm crack appearing blurred in the image taken from the shooting distance of 25 m in 
Figure 8c. As the image acquisition distance increased, the contrast of the crack rose sharply, and thus the intensity difference decreased. Therefore, it can be confirmed that it becomes difficult to recognize the crack in the crack specimen image.
  4.1. Crack Edge Response Analysis Results
Figure 9 shows a photo in which the crack area is magnified in the crack specimen image. The part shown in red represents the boundary line of the crack and background to extract the edge spread function (ESF). On the basis of the extracted ESF, a line spread function (LSF) was obtained using Equation (3), and then the full width at half maximum of LSF was obtained to analyze the sharpness.
 In 
Figure 10, FWHM of LSF at the boundary line of the crack specimen image acquired at 13 lx and 52,000 lx was calculated and illustrated in the graph.
FWHM showed smaller value at 13 lx than at 52,000 lx, and the overall sharpness was found to be better. The average FWHM at 13 lx and 52,000 lx was 1.79 pixels and 2.29 pixels, showing a difference of about 0.5 pixels, and the maximum FWHM difference was 0.87 pixels. Except for the case of the 10 mm crack, the difference in illumination turned out to generate a difference of 8%–36% in the sharpness of the crack boundary line.
  4.2. MTF Analysis Results
MTF analysis was performed using the edge method and the sine wave method, which are IEC standards. Since the form of the crack specimen is similar to the bar target, two types of analyses could be performed. The edge method that can analyze MTF in a single boundary line without the bar target was used in the short distance image, and the sine wave method was used in the long distance image where the resolution becomes low, and the number of samplings is reduced. First, the edge method was used in the analysis, and the spatial resolution according to the illumination was evaluated by measuring the values of MTF 10% that represents visibility and MTF 50% that represents sharpness in the measured MTF curve.
Figure 11 shows the response of the modulation transfer function (MTF) acquired at the 12 mm crack boundary line from the 5m image acquisition distance using an edge method.
 It was found that the MTF of the crack image acquired at 52,000 lx was more sharply reduced than that of the image acquired at 13 lx. MTF10 of 13 lx and 52,000 lx were 0.430 lp/mm and 0.295 lp/mm, respectively, and the visibility was about 1.6 times higher at 13 lx. MTF50 was 0.167 lp/mm and 0.93 lp/mm, respectively at 13 lx and 52,000 lx, and a sharpness 1.7 times higher was shown at 13 lx.
In 
Figure 12, MTF10 and MTF50 according to the crack widths of the images acquired at the 5 m image acquisition distance were obtained, respectively, and illustrated in the graph.
A comparison of MTF10 of the images acquired at 13 lx and 52,000 lx reveals that a higher value of the average of 0.080 lp/mm was shown at 13 lx than at 52,000 lx, and better visibility was found at 13 lx. In addition, sharpness showed a tendency to be 0.038 lp/mm higher on average at 13 lx than at 52,000 lx. Sharpness and visibility were increased proportionally with the increases in the crack widths, which is considered to be one of the causes of easier recognition of the area with a large crack width in the crack specimen image.
Figure 13 shows a graph of MTF analyzed using the sine wave method.
 If the image acquisition distance increases, MTF decreases from the crack with a small width due to a reduction in spatial resolution. It can be confirmed that as the image acquisition distance increases, the MTF of the image acquired at 52,000 lx shown in 
Figure 13b is more rapidly reduced than that of the image acquired at 13 lx shown in 
Figure 13a. Mraz calculated the minimum recognizable crack width by using MTF10, which corresponds to the spatial resolution of the camera [
15]. Therefore, this study analyzed the effects of illumination on the recognizable crack width according to the image acquisition. distance by assuming the crack width that corresponds to MTF10 in the MTF curve obtained experimentally in 
Figure 13 as the minimum recognizable crack width.
Figure 14 shows the minimum recognizable crack width calculated using MTF10. As the image acquisition distance increased, the minimum recognizable crack width increased, and a difference in the minimum recognizable crack width occurred depending on the illumination. The minimum crack widths at the maximum image acquisition distance of 100 m were 7.4 mm and 9.3 mm, respectively, at 13 lx and 52,000 lx, showing a 1.9 mm difference. The average minimum recognizable crack width difference depending on the illumination was 1.24 mm, and it was confirmed that a crack of a smaller width could be recognized at 13 lx. The shooting distance for crack recognition can be changed by the camera angle, and thus the ground spatial resolution (GSD) unit (mm/pixel) was added to the horizontal axis in 
Figure 14. The relationship between spatial resolution and camera direction is shown in 
Appendix C.
   4.3. Contrast Sensitivity Analysis Results
The minimum recognizable crack width obtained using MTF is a value calculated in consideration of the resolution of a camera and has a difference from the sharpness that is visually felt by humans. 
Figure 15 illustrates the difference between the minimum contrast of the crack and the maximum contrast of the background as the contrast in the acquired crack image.
In the crack image acquired at the 25 m image acquisition distance of 
Figure 15a, the contrast appeared larger at 52,000 lx than at 13 lx in the case of the crack width of more than 8 mm, and the opposite results were obtained with a crack width of less than 6 mm. These results are the same as in the case where the intensity profile of the crack image acquired at 52,000 lx of 
Figure 8b rapidly rises as the crack width increases. In the crack image acquired at the 100 m image acquisition distance of 
Figure 15b, there was almost no difference between the contrast at 13 lx and at 52,000 lx.
However, it can be confirmed that in the image acquired at the image acquisition distance of 100 m in 
Table 3, there is a difference that the crack shows at 13 lx and 52,000 lx. In order to analyze this difference, which is visually felt, the Weber contrast and Michelson contrast were calculated. 
Figure 16 and 
Figure 17 illustrate the Weber contrast (C
W) and Michelson contrast (C
M), calculated using Equations (13) and (14).
In the crack image acquired at the 25 m image acquisition distance of 
Figure 16a and 
Figure 17a, Weber contrast and Michelson contrast were higher at 13 lx than at 52,000 lx; even a crack of a small width has a great effect on the minimum recognizable crack width—Weber contrast and Michelson contrast were higher at 13 lx. In the crack image acquired at the image acquisition distance of 100 m, there was almost no difference between the contrast at 13 lx and 52,000 lx, as shown in 
Figure 15; however, as 
Figure 16b and 
Figure 17b show, the Weber contrast and Michelson contrast showed higher values at 13 lx than at 52,000 lx in the same way as in the above result. This suggests that there are differences in the degree to which the crack looks blurred in the crack image acquired at the image acquisition distance of 100 m, as shown in 
Table 3.
Figure 18 illustrates the contrast, Weber contrast, and Michelson contrast changes depending on the image acquisition distance and crack width in three-dimensional graphs.
 When the image acquisition distance increased, the contrast was reduced from the crack with a small crack width. 
Figure 18b,c shows that the Weber contrast and Michelson contrast exhibited overall higher values at 13 lx than at 52,000 lx.
The Weber fraction for distinguishing the contrast is known to be about 0.1–0.2 [
21,
22]. In this study, the crack width where Weber contrast becomes 0.1 in 
Figure 18b was assumed to be the minimum recognizable crack width, and then the minimum recognizable crack width was estimated depending on the image acquisition distance. 
Figure 19 shows two cases of the minimum crack widths depending on the image acquisition distance at 13 lx and 52,000 lx.
The average of the minimum recognizable crack width of the crack image acquired at 13 lx and 52,000 lx was 2.53 mm and 4.40 mm, showing a difference of 1.87 mm. The minimum crack width of the crack image acquired at the maximum crack image acquisition distance of 100 m was 5.81 mm and 9.34 mm, respectively, at 13 lx and 52,000 lx, showing a difference of 3.44 mm. Thus, it can be confirmed that a crack of a smaller width can be recognized in the crack image acquired at 13 lx. In addition, the Munsell value was used to analyze the effects of illumination on crack recognition in terms of visual perception. 
Figure 20 illustrates the intensity profile (
Figure 8d) of the crack images acquired at the image acquisition distance of 100 m, along with the Munsell value.
The Munsell value shows perceptual characteristics, illustrating that as the brightness is higher, the change of brightness that can be perceived decreases. The difference of Munsell value between the background and the crack image taken at 52,000 lx becomes smaller than that of the image taken at 13 lx. As for the Munsell value of the crack image acquired at 52,000 lx, the background is about 10, and the crack wider than 16 mm is 9, and therefore the brightness can be distinguished by the value difference of 1. However, since the cracks smaller than 14 mm show a value difference of less than 1 from the background, it is difficult to distinguish the difference of brightness between the crack and the background. On the other hand, as the value of the crack image acquired at 13 lx is 8 in the background, showing a value difference of more than 1 from the cracks wider than 8 mm, the brightness difference can be distinguished up to the crack of a smaller width. As a result, the analysis found that the crack widths that show a value difference of more than 1 from the background were 8 mm and 14 mm at 13 lx and 52,000 lx, and therefore the minimum crack width that can be recognized at 13 lx was smaller.
  5. Conclusions
This study objectively evaluated the effects of crack image acquisition conditions for concrete structures on the recognition of the cracks in crack images through outdoor experiments and closely examined the ranges of crack width that can be recognized depending on the illumination. Through the experiments, a specimen with a crack that has a certain width was produced similarly to the bar target. Image analysis techniques such as MTF and contrast sensitivity were used, and the crack widths that can be recognized in terms of visual perception were analyzed. MTF analysis results showed that MTF10 and MTF50 were found to be smallest in the crack images taken at 52,000 lx, which is the maximum illumination during the daytime, and as the shooting distance increased, the recognizable crack widths became relatively large, which makes it more difficult to recognize the cracks. The analysis on the sharpness in the boundary line of the cracks found that the sharpness of the crack image taken at 13 lx was higher than that of the crack taken at 52,000 lx, but the difference was not significant. The changes of the contrast sensitivity depending on the shooting conditions were investigated based on the analysis on the Weber contrast and Michelson contrast of the crack images. The analysis results confirmed that the overall Weber contrast and Michelson contrast were lower in the crack images taken at 52,000 lx, compared to the crack images taken at 13 lx, and the crack image taken at 52,000 lx in the daytime environment has a larger effect on the sharpness reduction than the crack image taken at 13 lx in the low-illumination environment.
If the illuminance is high or low, the contrast of the crack and concrete background increases or decreases. In this study, the maximum illuminance of 52,000 lx and the minimum illuminance of 12 lx were selected as experimental values in order to analyze the effects of the contrast changes on the crack recognition. If illuminance is higher or lower than that used in the experiment, the contrast increases or decreases. Even in this case, the effects can be estimated with reference to this paper.
The contrast and sharpness of the crack are among the most important variables even when cracks are extracted using visual perception as well as image processing techniques, such as thresholding or boundary extraction. Therefore, it is expected that the analysis results of MTF and contrast sensitivity can be used as quantitative indicators that represent the possibility of detecting cracks.