1. Introduction
Small-scale mining (SSM) has devastating impacts on the natural environment when not regulated properly, and many small-scale mining operations are operated illegally. SSM is a low-cost, labor-intensive method of mining [
1] in areas where gold is easily accessible, such as on the river banks where alluvial gold deposits can be found. Land degradation is a consequence of such mining activities and remains an important global issue, as the global demand for precious minerals will continue to increase [
2]. Around two-thirds of the total supply of these minerals comes from countries in South America, South Asia, and Sub-Saharan Africa [
3].
In Ghana, “galamsey” is a term commonly used to describe illegal mining activity in Ghana. Mantey et al. [
4] and Owusu-Nimo et al. [
5] suggest that galamsey operations are an illegal or unregulated form of SSM and processing of gold that lies at or below soil and water surfaces in Ghana. Galamsey operations have historically only been associated with simple tools and manual labor [
6,
7] but the use of mechanized equipment like excavators has recently also come into play [
4,
8], most probably due to the influx of foreign nationals who changed the operational dynamics of galamsey operations [
4]. The precious minerals are gathered discreetly and sold in contravention of state laws [
4,
9]. Galamseyers also do not pay tax, many mines are in delicate or prohibited areas, and often, human safety is put at risk [
5,
10,
11,
12].
Galamsey operations tend to leave behind many wastelands in the form of pits flooded with water, deforested lands, and polluted water bodies [
5,
13] that are hazardous to the local people, livestock and wildlife. Accessing SSM sites is difficult due to their remote locations and is extremely dangerous [
5,
14]. Remote sensing provides a way to monitor such remote locations on a regular basis, but the use of optical imagery is restricted because of the tropical climate of Southern Ghana. Snapir et al. [
15], who attempted to map the expanding areas of galamsey, had difficulty in obtaining cloud-free imagery to evaluate change detection. They only found one cloud-free Landsat-8 image and were unable to find suitable Landsat images for multi-temporal compositing for their study in Ghana. The effective monitoring of such large-scale unregulated mining activity requires a continuous supply of image data that is available year-round, as well as night and day.
Synthetic Aperture Radar (SAR) imagery presents a solution due to the active nature of SAR sensors, which emit microwaves that can penetrate through clouds and also function during the nighttime [
16,
17,
18]. Except for one recently published study by [
19], SAR has not yet been applied to the mapping and monitoring of SSM in Ghana. SAR can provide valuable insights into the specific locations of SSM and the growth rate of galamsey over time [
20]. Questions such as, “In which seasons are SSM most active?” have not yet been answered. Having an uninterrupted supply of usable, easily obtainable imagery, as provided by Sentinel-1, could provide governments with strategic information to improve regulation efforts to address the issues associated with SSM [
4,
5]. Globally, SAR has been used extensively for different applications requiring an active sensing system that is not affected by cloud cover or the time of day, such as the monitoring of ships, wetlands, floods and deforestation [
21,
22,
23]. SAR is also used for the monitoring of crops such as sugarcane [
24], or the mapping of the geomorphology of coastlines [
25] using its scattering properties. Further examples of the application of SAR can be seen in [
26,
27,
28].
Remote sensing has been widely used for the application of land cover classification [
29] as a method for mapping and monitoring land use [
30]. Land cover mapping is essential for the estimation of land cover change because land cover change is directly related to human and natural health and growth [
30,
31,
32]. Typical applications of land cover classification are in the estimation of deforestation, for analyzing the extent of flood events, and determining areas for biodiversity conservation [
30,
33,
34,
35]. Additionally, it can be used for monitoring illegal logging and mining activities in remote locations [
5,
14,
36].
Land cover mapping using remote sensing is driven by the need for knowledge of the spatial distribution of land use and land cover (LULC) [
27]. Remote sensing data has become easily accessible, and the advancement of computers and algorithms has caused immense growth in the field of land cover classification [
37]. Classified images are then used to assist in policymaking and decision-making on the effects of urbanization and industrial development, illegal activities, protecting important natural areas, human settlement planning, food security and more [
31,
38,
39,
40], especially in remote areas where data is scarce. Classification is, therefore, a potential approach to mapping SSM in remote areas.
SSM sites around rivers typically comprise extensive, intermixed areas of bare soil and water pools resulting from the mining process [
14]. A typical pool of water after the mining process has begun is about 20 m in diameter, which challenges the spatial resolution limits of Sentinel-1. However, using multi-date imagery often produces more accurate results than single-date imagery [
14]. The application of multi-temporal filtering of SAR for monitoring SSM has not yet been tested. SAR flood monitoring also provides insight into mapping the water pools of the alluvial SSM sites, by using change detection (e.g., [
41,
42]). Since around the year 2000, machine learning classification algorithms such as random forests have been tested and found to give reliable classification results on SAR imagery [
41,
43,
44,
45]. Different machine learning classifiers have not yet been compared for the classification of SSM with SAR imagery. Classifying SAR imagery is more challenging than classifying optical imagery because ground features are harder to distinguish with the human eye than those that can be seen in optical imagery. Knowing which classifier produces the most accurate and reliable results is important for an effective monitoring system.
Most of the literature with regards to illegal mining or small-scale mining globally is about policymaking, such as the studies by [
1,
6,
46,
47,
48,
49,
50,
51,
52], or the environmental impacts or health assessments that involve manual sampling methods, such as the studies by [
53,
54,
55,
56,
57,
58]. Only a few examples exist of where remote sensing is used to map or monitor SSM and these include [
14,
15,
36,
59], who used optical imagery.
Almeida-Filho and Shimabukuro [
60] have published the only study so far using the backscatter values of SAR to map the degeneration caused by SSM. The aim of their study was to investigate the possibility of using SAR to detect degradation areas caused by independent gold miners, “garimpeiros”, in the Amazon because of the regular cloud cover in the region. Almeida-Filho and Shimabukuro [
60] used three 18-m resolution, L-band, HH polarization Japanese Earth Resource Satellite-1 (JERS-1) images from the years 1993 (dry season), 1994 (rainy season), and 1996 (rainy season). Speckle filtering was applied with a 7 × 7 window kernel and the images were resampled to 30 m resolution to match the resolution of Landsat TM, acquired in 1994, which was used as the reference image. Their study area was the Tepequém plateau, situated in northern Brazil, which consisted of savannah grass that was surrounded by tropical rain forest. They found that the low grassland vegetation produced subtle tonal contrasts in the SAR imagery compared to the high contrast produced by deforestation in studies performed in the forested areas, e.g., [
61,
62,
63]. The similar backscatter responses between the eroded areas and the savannah grass made identification of the degraded areas from gold-mining activities unidentifiable when using a single-date JERS-1 SAR image. Their change detection normalized difference index (NDI) technique showed that the land cover change was detectable with the low tonal contrast of the grasslands. Due to the lack of a reliable classifier for SAR imagery, the authors of [
60] were unable to produce a thematic map of the degradation areas found in their 1993–1994 NDI image.
From the study by Almeida-Filho and Shimabukuro [
60], it is clear that SAR has the potential to detect SSM. Their challenge of detecting the degradation areas in single-date imagery can be tested over forested areas where gold mining takes place, such as in Ghana. The recent publication of research by [
19] proved that SSM in Ghana can be detected with SAR. They used Sentinel-1 time series data and compared the mean, minimum, and maximum backscatter difference images of both polarizations. They found that the minimum backscatter images were the most sensitive to detecting changes caused by mining-induced land cover changes in Ghana and that a threshold value of +1.65 dB was suitable to classify this change. It is, however, uncertain what the impact of multi-temporal filtering of a time series of Sentinel-1 imagery would have on SSM detection accuracies. The use of a wider variety of machine learning algorithms, especially applied to different SAR wavelengths and features, has not been sufficiently investigated.
In this study, we evaluated how different SAR sensors compare regarding the mapping of SSM when applying classification. We tested how single-product speckle-filtered SAR performed, compared with multi-temporal filtered SAR, when applying classification, and which classification algorithm is best suited for the mapping of SSM with SAR imagery.
4. Discussion
By comparing the classification results of the S1 sensor with those from K5, the difference in wavelength is evident. The X-band K5 imagery yielded significantly lower accuracies than the C-band S1 imagery. This may be due to the shorter wavelength of K5, which is less sensitive to differences in roughness between the land cover classes. To test this, an additional classification was performed with just the VH bands of S1 and S1-MT. The results showed that the overall accuracy and kappa coefficient results for Sentinel-1 VH are less accurate than using both polarizations, but they are still more accurate than the K5 results. There is also a bigger difference in overall accuracy and kappa coefficient for the multi-class classification scheme than the binary-water classification scheme. This further supports the notion that the C-band is better suited for this application than the X-band, even at a lower resolution.
The radiometric calibration of K5 may also influence the weaker results. Comparing multi-temporal filtering with single-date filtering showed that S1-MT performed more accurately, with margins of less than 3% in producer’s accuracy for the multi-class classification and margins of less than 2% in producer’s accuracy for the binary-water classification. Multi-temporal filtering reduces the amount of speckle, but the difference in UA and PA between S1 and S1-MT is negligible. This suggests that the time, data, and computation cost of performing multi-temporal filtering might not be worthwhile for this application.
Confusion exists within the multi-class classification scheme, especially between water and bare ground. This is evident by the high omission and commission errors produced by this classification scheme. With the S1 imagery, the water class and the bare ground class are mixed at the ASM sites around the river, due to the use of lower resolution than the K5 imagery. Therefore, the binary-water classification scheme obtained significantly higher accuracies due to the elimination of confusion. This is confirmed by the low omission and commission errors of the binary-water classification scheme. The implication is that while small-scale mining operations cannot be directly detected using this approach, the scattered water pools associated with these operations can be detected to a fair degree of accuracy.
The same trend as in the SAR dataset comparison occurs in the machine learning comparison, where the binary-water classification scheme outperforms the multi-class classification scheme. The difference between the highest-performing dataset and lowest-performing dataset for the multi-class classification scheme is around 21%, whereas the difference for the binary-water classification scheme is around 10%. The difference between the highest performing classifier for the multi-class classification scheme is less than 11%, and the difference for the binary-water classification scheme is less than 7%. The high variance in the multi-class classification scheme results is due to the misclassification of classes that took place during this classification process.
For the machine learning classifier comparison, the random forest classifier was significantly more accurate than the other classifiers, with a probability of 0.87 at a 95% confidence level. Random forest is the most robust of the machine learning algorithms and deals very well with high dimensionality and complex data. C-SVM and SVM also achieved high overall accuracies, where C-SVM outperformed SVM. This is likely because the data structure was of a higher dimensionality, where the C-SVM kernel could find a better fit for the hyperplane to group the data. The decision trees classifier gave the least accurate as well as the least reliable results. This is because of how the trees are split in the algorithm and not iterated through, unlike in the random forest method. KNN is the simplest of the algorithms and performed slightly better than decision trees. KNN is not robust, demonstrating high dimensionality and complex data.
In Ghana, the mapping of illegal mining using remote sensing has only been attempted in one study [
15], where they used multi-temporal optical imagery. They were able to detect the galamsey sites with change detection in Ghana but recommended the use of SAR imagery because of the interference of cloud cover. Bangira et al. [
45] compared different machine learning classifiers where they mapped different types of water bodies; the SVM classifier outperformed the RF classifier, with an average overall accuracy of 91.7% compared to 79.5%. Their study incorporated SAR imagery and optical imagery, as well as indices, into the training of the classifiers. The results from the classification conducted in this study showed that RF outperformed SVM. A likely reason that SVM performed better than RF in Bangira et al.’s [
45] study is that the indices and optical imagery gave more information on the water body types, meaning that it was simpler for the SVM classifier to classify the data. RF is prone to overclassifying, and this may have been the case in their study. Bangira et al. [
45] also assessed the Otsu threshold methods and made the point that for water body mapping, using thresholds is simpler and yields accurate enough results for classification. From studies where floods were mapped with SAR imagery (e.g., [
41,
89,
90,
91]), threshold methods were used, therefore suggesting that mapping SSM with thresholds is a simpler and more effective option.
The lower accuracies obtained from S1 using only the VH polarization, when compared to using both VH and VV, indicate that the combination of these polarizations increases the discriminatory power of the machine learning algorithms. Since the VH-only S1 classification outperformed the VH-only K5 classification, it can also be argued that the difference in accuracy obtained between these two sensors is largely due to the difference in wavelength and not due to the increased dimensionality offered by S1′s dual polarization. In a forested environment such as this, the C-band is therefore preferable to the X-band for mapping SSM.