Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest

Pardini, Gabriel R.; Tasinaffo, Paulo M.; Shiguemori, Elcio H.; Kuck, Tahisa N.; Maximo, Marcos R. O. A.; Gyotoku, William R.

doi:10.3390/a18020102

Open AccessArticle

Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest

by

Gabriel R. Pardini

¹

,

Paulo M. Tasinaffo

^1,*

,

Elcio H. Shiguemori

²

,

Tahisa N. Kuck

²

,

Marcos R. O. A. Maximo

¹

and

William R. Gyotoku

¹

Instituto Tecnológico de Aeronáutica (ITA), São José dos Campos 12228-900, SP, Brazil

²

Instituto de Estudos Avançados (IEAv), São José dos Campos 12228-900, SP, Brazil

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(2), 102; https://doi.org/10.3390/a18020102

Submission received: 24 December 2024 / Revised: 2 February 2025 / Accepted: 6 February 2025 / Published: 13 February 2025

(This article belongs to the Special Issue Visual Attributes in Computer Vision Applications)

Download

Browse Figures

Versions Notes

Abstract

The Amazon biome is frequently targeted by illegal activities, with clandestine mining being one of the most prominent. Due to the dense forest cover, criminals often rely on covert aviation as a logistical tool to supply remote locations and sustain these activities. This work presents an enhancement to a previously developed landing strip detection algorithm tailored for the Amazon biome. The initial algorithm utilized satellite images combined with the use of Convolutional Neural Networks (CNNs) to find the targets’ spatial locations (latitude and longitude). By addressing the limitations identified in the initial approach, this refined algorithm aims to improve detection accuracy and operational efficiency in complex rainforest environments. Tests in a selected area of the Amazon showed that the modified algorithm resulted in a recall drop of approximately 1% while reducing false positives by 26.6%. The recall drop means there was a decrease in the detection of true positives, which is balanced by the reduction in false positives. When applied across the entire biome, the recall decreased by 1.7%, but the total predictions dropped by 17.88%. These results suggest that, despite a slight reduction in recall, the modifications significantly improved the original algorithm by minimizing its limitations. Additionally, the improved solution demonstrates a 25.55% faster inference time, contributing to more rapid target identification. This advancement represents a meaningful step toward more effective detection of clandestine airstrips, supporting ongoing efforts to combat illegal activities in the region.

Keywords:

convolutional neural networks; deep learning; landing strip detection

1. Introduction

Environmental conservation is a highly relevant topic in the global environmental context, especially with the ongoing climate changes. In this sense, the conservation of the Amazon holds special significance, given its vast area of over 5 million square kilometers [1].

Due to the large quantity and diversity of natural resources, the Amazon is often targeted by illegal exploratory activities, with illegal mining on indigenous lands being one of the most prominent [2]. According to [3], 91% of Brazilian mining is located in the Legal Amazon, which can lead to humanitarian problems due to the violent actions of such criminals against indigenous tribes. According to [4], if the Amazon were considered a country, it would have had the fourth highest homicide rate in the world in 2017, demonstrating how ingrained criminality is in this region.

However, for such criminal activities to be carried out, it is necessary to establish consistent supply lines [5], given the high forest density. In this context, clandestine aviation acts as a logistical ally [6], transporting materials necessary for the maintenance of these activities. In 2021, there were 456 illegal airstrips within 5 km of mining sites [3], demonstrating the connection between these two factors. Additionally, in 2021 there were 804 airstrips within environmental conservation areas [3], showing the link between this activity and the deforestation of this biome.

Thus, it becomes necessary to quickly and accurately map airstrips in the Amazon. In this context, a mapping of such targets already exists, carried out by [3]. However, some weaknesses in this mapping hinder its use by the competent authorities. Firstly, this mapping was conducted through visual inspection of satellite images, which complicates its reproducibility given the vastness of the Amazon biome, making the task less dynamic. Moreover, images from 2021 were used for this task, making the results less relevant to the present day.

Some works aim to automate the search for such targets. One of the seminal works for identifying these targets was conducted by Alves et al. [7]. In their approach, images from Synthetic Aperture Radar (SAR) sensors were used, and the Circularity Ratio (CR) [8] was calculated to assess the shape of the targets, selecting those that were more elongated and straight.

Additionally, ref. [9] conducted a study to perform segmentation without using Artificial Neural Networks (ANNs). The proposed methodology focused on decomposing the RGB bands into three matrices and then generating a fourth grayscale matrix from them. After that, attributes with greater variability were searched for and grouped using clustering techniques.

In this context, Convolutional Neural Networks (CNNs) have shown very promising results in target detection tasks since the work of [10], with new architectures frequently being launched that present superior results. For example, the You Only Look Once (YOLO) architecture, initially conceived by [11], shows very promising results in object detection tasks. Over the years, several versions of this architecture have been created, with new tasks being added. The authors of [12] presented the 8th version of this network, which added the capability to perform image classification and segmentation. The latest version of YOLO is its 11th version [13], which presents improvements over previous versions but only performs object detection tasks.

A brief evolution in research related to CNNs shows how each study has contributed to the advancements in this field. Beginning with [14], their work had a significant impact on the development of CNNs, as their discoveries about the functional organization of the visual cortex, including receptive fields, orientation columns, and binocular interaction, provided a crucial theoretical foundation that inspired the structure and function of CNNs.

According to [15], “the neocognitron, introduced by Fukushima, is a self-organizing neural network that, after the self-organization process, acquires a hierarchical structure similar to the visual system model proposed by Hubel and Wiesel, allowing visual pattern recognition independent of its position”. This position-invariance feature is fundamental to developing modern CNNs, which use hierarchical layers to extract and recognize complex image features.

Rumelhart et al. [16] introduced the backpropagation technique, which efficiently adjusts weights in CNNs by propagating the error backward through the layers and using the error gradient to minimize the difference between the network’s actual output vector and the desired output vector. Each subsequent layer in a CNN combines features extracted from the previous layers, enabling the detection of increasingly complex features in images.

LeCun et al. [17] contributed by applying the backpropagation algorithm to handwritten digit recognition, such as the MNIST dataset used for determining ZIP codes on mail, demonstrating CNNs’ ability to handle distortions and positional variations.

The findings of [18] further established that standard feedforward neural networks are universal approximators, capable of approximating any measurable function to the desired accuracy. This fact implies that shortcomings in applications can often be attributed to inadequate learning, insufficient hidden units, or stochastic relationships between input and target.

In recent years, ref. [19] noted that “machine learning techniques, especially when applied to neural networks, have played an increasingly important role in pattern recognition system design”. The advances in these techniques have enabled the creation of more accurate and efficient systems capable of handling a wide range of pattern recognition tasks across various fields.

As noted by [20], CNNs have been widely applied in fields such as object detection, fault diagnosis, and image recognition. Since 2010, these applications have emerged as some of the most active research areas in computer vision.

In the literature, several papers discuss target detection algorithms, specifically focusing on the YOLO model. Studies such as [11,21], and more specifically [22,23] develop research with the YOLOv3 model, while [24,25] address YOLOv5. Ref. [26] provides an overview of YOLO models from YOLOv1 to YOLOv8, making them excellent resources for object detection studies. Additionally, works such as [27] address the limitations of traditional algorithms due to background noise and limited information, proposing improvements such as attention mechanisms in architecture to optimize the detection of small targets.

Using such techniques to identify targets, ref. [28] performed target classification through transfer learning [29]. In their work, a CNN was used to extract image features. With these data, classification algorithms were used to determine whether the dataset contained airstrips.

In his work, ref. [30] presented the first architecture of the so-called Visual Transformer (ViT), specializing in [10]’s work for computer vision. In the following years, several improvements were made, such as [31]’s Swin Transformer, which enhanced the architecture further. Later, ref. [32] presented the Global Context Visual Transformers (GCViT), which showed very promising results in image classification.

The literature review reveals that studies on airstrip detection in the Amazon often address isolated aspects of the problem rather than providing a comprehensive solution. A notable exception is the work of [33], which employs various techniques to identify these targets. However, several areas in their proposed solution still require improvement. This study contributes by enhancing [33]’s algorithm by reducing the number of false positives while maintaining high levels of precision, addressing limitations in the original approach to better meet the specific challenges of this detection task. Additionally, a key focus is improving the algorithm’s inference time, enabling faster target identification and more efficient monitoring.

This work is organized as follows: In Section 1, the introduction is presented. Section 2 covers the materials and methods, Section 3 discusses the results, and Section 4 provides the concluding remarks.

2. Materials and Methods

In this section, we present the entire methodology related to the work. First, we will discuss data acquisition and processing to generate the datasets for training the CNNs. We will then present the proposed algorithm, the metrics created to evaluate performance, and the experiment conducted. It is worth noting that all experiments were carried out on a computer with an AMD Ryzen 9 7900X CPU, 256 GB of RAM, and an Nvidia RTX 4090 GPU.

2.1. Data Acquisition

To generate the dataset for training Artificial Intelligence (AI) techniques, we used the mapping conducted by [3], as shown in Figure 1. This work provides the spatial location of 2869 airstrips as of 2021. To capture the targets in their entirety, we created 2 × 2 km square cutouts. The choice of this size is due to the objective of finding illegal airstrips, which are relatively small; therefore, a 2 km square captures most targets to their full extent. In addition, these cutouts were made to make the targets more evident in relation to the size of the analyzed image. Satellite scenes typically cover large areas, meaning the targets would appear very small compared to the total image size. Furthermore, the available hardware was unable to process entire satellite scenes.

Using the airstrip locations, we selected the “Image © 2023 Planet Labs PB” dataset from the Planet satellite constellation [34], which provides images with a spatial resolution of 4.7 m, enabling clear visualization of the targets. Additionally, these images consist of four spectral bands, three in the visible region (RGB) and one in the near-infrared (NIR) region, each with 16-bit radiometric resolution. Using the previously obtained squares, we cropped the targets using scenes from June 2021 to ensure temporal alignment between the mapping and the dataset. After this, images in which it was difficult to identify the target were excluded, resulting in a final set of 1989 Planet constellation images of airstrips. Such exclusions were made because, in some images, a large number of clouds obstructed visibility, making it impossible to observe the target.

Furthermore, the algorithm must detect the target using different satellite imaging sensors, as the most recent data may only be available from a specific constellation, depending on the sensor’s temporal resolution. Therefore, to achieve such generalization, the same process was applied to images from the Sentinel-2 satellite [35,36], selecting the same four bands as those from the Planet constellation. This database contains images with a 10-meter spatial resolution, which still allows for clear identification despite offering a lower target definition. Additionally, for the cutouts, we chose images from June 2021. After the visual verification stage, we obtained 2468 images of Sentinel-2 airstrips. Figure 2 shows an example of the same airstrip for both sensors.

To generate the dataset of images that do not contain airstrips, we conducted an empirical sampling of the Amazon. The reason for this is that, due to the high forest density of the biome, a random sampling would produce images with very similar shapes, mostly representing forest areas. Therefore, images were empirically selected to contain shapes resembling airstrips, such as rivers, cities, and highways. Figure 3 shows two samples contained in this set.

2.2. Data Processing

When examining the images, we noticed a mismatch in the radiometric resolutions of the sensors, with Planet images having 16-bit resolution and Sentinel-2 images having 12-bit resolution. Therefore, we scaled all cutouts to 8 bits so that all images have the same value scale per pixel. This rescaling was performed because the YOLO-series CNNs are implemented using the Ultralytics framework [12], which only supports images with 8-bit resolution and a maximum of three bands. Additionally, we only considered the visible bands, as we observed in Figure 2 that the target is well-represented in these bands. Additionally, since the goal of this work is to improve the original algorithm presented in [33], we used only the visible bands, as this was the best configuration reported in the original study.

For the Planet images, we standardized the image dimensions by cropping each image to 352 × 352 pixels. This value was chosen because the smallest dimension of the raw images was 366 pixels in width, and the images needed to have dimensions that are multiples of powers of 2. Since 352 is a multiple of 32, it was selected to be used in CNNs. Thus, 352 is the nearest number that satisfies both criteria.

Due to the difference in spatial resolution, Sentinel-2 images had much smaller dimensions compared to Planet images. Therefore, we performed upsampling of all Sentinel-2 images to dimensions of

352 \times 352

using bicubic interpolation [37].

2.3. Base Algorithm

In this section, the base algorithm proposed by [33] will be presented, being an algorithm in 5 stages. The first stage involves an iterative search of the scenes. To reduce potential issues with targets that span across multiple scenes, the current images are extended by 2 km from the spatially adjacent scenes. Then,

352 \times 352

pixel cutouts are made from the images with 75% overlap between successive cutouts. There are two reasons for this overlap. The first is to reduce the likelihood of missing any cutout that does not fully cover a target in the image. The second is that the previously generated datasets have centered targets, so a greater overlap increases the probability of having a cutout with the target centered. However, there is a trade-off as increasing the overlap also increases the computational cost of the algorithm. Thus, 75% was the empirically determined value that balances these factors.

The second stage involves an image classification algorithm that determines whether the current cutout contains the target or not. This step acts as a filter for the subsequent steps, forwarding only the cutouts most likely to contain the target. After this, the segmentation of the cutout is performed using a pre-trained segmentation algorithm to generate the specific shape of the target.

In the fourth stage, post-processing of the generated segment occurs. Then, the minimum bounding box of the segment is generated using the algorithm presented by [38], and the smallest dimension of this rectangle is checked. If it is less than 250 m in spatial dimensions, this value was chosen because we verified that the smallest landing strip in the training set was 434.23 m. Thus, 250 m becomes a reasonable threshold for discarding detections, and the segment is discarded. This step is important because small scars in the forest, despite their size, can closely resemble landing strips.

Still, in the fourth stage, the centroid of the segment is calculated and related to a pixel coordinate. Thus, knowing the position of the centroid relative to the cutout and the position of the cutout relative to the entire scene, it is possible to determine the centroid’s position in the complete scene. With this position, and using the georeferenced coordinate of the original image, the geographic location of the landing strip can be obtained.

In the fifth and final stage, clustering of the points generated by the algorithm is performed. This stage uses a modified version of hierarchical clustering [39], which merges points based on a distance threshold. If two points are within this threshold distance, they are merged into a cluster. The reason for this is that there is significant overlap between cutouts, so a valid target is likely detected by more than one cutout. Therefore, clustering merges these points to avoid multiple predictions of the same target. Due to the required size of the target, a merging distance of 1250 m was used. The Algorithm 1 presents the pseudo-code of the algorithm.

Algorithm 1 Landing Strip Detection Algorithm Presented in [33]

1:: function LandingStripDetection(image)
2:: Input: image, a georeferenced satellite image with dimensions $w i d t h \times h e i g h t$
3:: Output: A set of geographical coordinates of detected landing strips
4:: Step 1: Patch Extraction and Processing
5:: Patch size: $352 \times 352$
6:: Step size for patch movement: $(88, 88)$ (75% overlap)
7:: Initialize list of detected centroids: $C = []$
8:: for $x = 1$ to $w i d t h$ with step size 88 do
9:: for $y = 1$ to $h e i g h t$ with step size 88 do
10:: Extract patch $p (x, y)$ from image with top-left corner at $(x, y)$
11:: Step 2: Classification
12:: Apply the classification function $f_{class} (p (x, y))$
13:: if $f_{class} (p (x, y)) = 0$ (Non-Landing Strip) then
14:: Discard the patch
15:: else
16:: Step 3: Segmentation
17:: Generate the segment $S (x, y) = f_{seg} (p (x, y))$
18:: Step 4: Bounding Box Size Check
19:: Calculate the minimum bounding box $B (x, y)$ for the segment $S (x, y)$
20:: The size of the bounding box is defined by its smallest dimension $d_{min} (B (x, y))$
21:: if $d_{min} (B (x, y)) \geq 250$ meters then
22:: Calculate the centroid $C (x, y) = \frac{1}{n} \sum_{j = 1}^{n} x_{j}$ of the segment
23:: Append the centroid $C (x, y)$ to the list C
24:: end if
25:: end if
26:: end for
27:: end for
28:: Step 5: Clustering of Detected Points
29:: Perform clustering on centroids C using hierarchical clustering
30:: Merge points within a distance threshold of 1250 meters
31:: Return the final set of clustered centroids representing predicted landing strips
32:: end function

2.4. Modified Algorithm

Based on the algorithm presented by the authors of [33], it was observed that despite the high recall rate achieved, the number of false positive predictions was relatively high compared to the best result presented.

To address this issue, several improvements were introduced in the fourth step of the algorithm. Firstly, the Normalized Difference Water Index (NDWI) [40] is calculated for each pixel of the generated segment using the original image bands, without any prior processing. Since, in production, the algorithm receives the raw satellite image without the preprocessing steps mentioned earlier, it is possible to retrieve these data. This step checks if any pixel in the image represents water bodies, and if so, the cutout is discarded. This is crucial because the Amazon region contains numerous water bodies, which can, depending on the cutout, resemble the shapes of landing strips. Additionally, when the minimum bounding box of the generated segment is calculated, the Circularity Ratio (CR) is also computed. If the CR is greater than 0.1, the cutout is discarded—this threshold was determined empirically through extensive testing.

Furthermore, after calculating the segment’s centroid, this location is checked for its proximity to any federal or state highway [41]. If the distance is less than 50 m, the location is discarded. This is an important step to eliminate potential false positives, as the cutout has only regional context and not a complete view of the entire scene. Highways often have shapes that closely resemble landing strips.

To improve the perform in terms of computational time of the algorithm, the entire process described above was parallelized using 4 threads. This means that 4 scenes are processed in parallel, with each scene going through the mentioned steps. The number of threads was chosen to be the highest value that would not cause memory issues on the GPU.

Additionally, a new step was added to the algorithm after the generation of the locations. This step involves obtaining images with dimensions of 2 × 2 km around all the generated locations. With these images, a new classification step is performed using a different classifier from the previous one. The reason for this new step is that, in the initial stage, due to the nature of the search conducted, the targets were unlikely to be centered, which could confuse the algorithm and result in a high number of false positives. However, with the locations of the predictions, it is now possible to have images with the targets centered, which enhances this classification step. In Algorithm 2, the modified algorithm is presented.

Algorithm 2 Modified Landing Strip Detection Algorithm

1:: function ModifiedLandingStripDetection(image)
2:: Input: image, a georeferenced satellite image with dimensions $w i d t h \times h e i g h t$
3:: Output: A set of geographical coordinates of detected landing strips
4:: Step 1: Patch Extraction and Processing
5:: Patch size: $352 \times 352$
6:: Step size for patch movement: $(88, 88)$ (75% overlap)
7:: Initialize list of detected centroids: $C = []$
8:: for $x = 1$ to $w i d t h$ with step size 88 do
9:: for $y = 1$ to $h e i g h t$ with step size 88 do
10:: Extract patch $p (x, y)$ from image with top-left corner at $(x, y)$
11:: Step 2: Classification
12:: Apply the classification function $f_{class} (p (x, y))$
13:: if $f_{class} (p (x, y)) = 0$ (Non-Landing Strip) then
14:: Discard the patch
15:: else
16:: Step 3: Segmentation
17:: Generate the segment $S (x, y) = f_{seg} (p (x, y))$
18:: Step 4: Enhanced Validation
19:: Calculate the NDWI for the segment pixels
20:: if any pixel in segment represents water bodies then
21:: Discard the patch
22:: end if
23:: Calculate the minimum bounding box $B (x, y)$ for the segment $S (x, y)$
24:: Calculate the Circularity Ratio $C R (B (x, y))$
25:: if $C R (B (x, y)) > 0.1$ then
26:: Discard the patch
27:: end if
28:: Calculate the centroid $C (x, y) = \frac{1}{n} \sum_{j = 1}^{n} x_{j}$ of the segment
29:: if distance to any federal or state highway $< 50$ meters then
30:: Discard the location
31:: else
32:: Append the centroid $C (x, y)$ to the list C
33:: end if
34:: end if
35:: end for
36:: end for
37:: Step 5: Clustering of Detected Points
38:: Perform clustering on centroids C using hierarchical clustering
39:: Merge points within a distance threshold of 1250 meters
40:: Step 6: Enhanced Classification
41:: for each centroid $C_{i}$ in C do
42:: Obtain images of dimensions $2 \times 2$ km around $C_{i}$
43:: Apply new classification function $f_{new_class} (i m a g e_{2 x 2})$
44:: if $f_{new_class} (i m a g e_{2 x 2}) \neq 0$ (Landing Strip) then
45:: Add $C_{i}$ to the final list of predicted landing strips
46:: end if
47:: end for
48:: Return the final set of predicted landing strips
49:: end function

2.5. Training Parameters

As presented in the previous sections, the modified algorithm requires three distinct algorithms: two for classification and one for segmentation. For the first classification algorithm, YOLOv8 was used, as in the original paper. For the second classification task, the GCViT network was employed due to its demonstrated success in achieving excellent results in classification tasks. YOLOv8 was again utilized for the segmentation task, as in the original paper.

For the training of neural networks, defining certain hyperparameters is necessary. Initially, the batch size for all training executions was set to 16, as this was the largest value that did not cause memory issues.

Additionally, to reduce the possibility of overfitting during training, learning rate scaling was implemented using the CosineAnnealingLR strategy [42], as defined by the following equation:

η_{t} = η_{\min} + \frac{1}{2} (η_{\max} - η_{\min}) (1 + cos (\frac{T_{cur}}{T_{\max}} π))

(1)

where

η_{t}

represents the current learning rate at epoch

T_{cur}

, and

T_{\max}

represents the maximum number of training epochs. The parameters

η_{\min}

and

η_{\max}

define the minimum and maximum learning rates, which are set before the start of the training.

We searched for the best hyperparameter values.

η_{\min}

was searched within the range from

10^{- 6}

to

10^{- 8}

in 20 equally spaced intervals. Similarly,

η_{\max}

was searched in the range from

10^{- 3}

to

10^{- 5}

in 20 equally spaced intervals. Additionally, we searched for the best optimizer for training, selecting between Adam and Stochastic Gradient Descent (SGD) [43].

For hyperparameter tuning, the recall metric was used for classification training. For segmentation training, the metric used was Intersection Over Union (IoU). IoU is calculated by the ratio of the number of pixels in the intersection between the predicted and ground truth segments to the union of these segments [44].

For YOLOv8, the loss function chosen is the standard one defined by [12] for the network. For GCViT, binary cross-entropy [45] was chosen due to its extensive use in image classification tasks.

To reduce overfitting during training, data augmentation was applied to the selected images [46]. The selected operations included horizontal and vertical flips, translation limited to 20% of the image dimensions, rotation by a random angle between 0 and 90º counterclockwise, and removal of a rectangle with dimensions corresponding to 15% of the image at a random position. These operations are not applied simultaneously to the images. During each training epoch, each image undergoes a random combination of the aforementioned operations, with each operation having a 50% probability of being applied to an image in a given epoch.

2.6. Performance Metrics

As in the original work, to enable comparison in this study, a predicted location is considered a true positive (TP), or a correct detection, if it is within a certain distance threshold from a previously mapped location. Conversely, a predicted location is classified as a false positive (FP) if no mapped location exists within the chosen distance threshold. Finally, a mapped location is considered a false negative (FN) if no predicted location falls within the defined distance threshold.

To ensure the effectiveness of this metric, only one correct detection is considered per mapped location. For example, if two predicted locations are within the distance threshold of a mapped location, only one correct detection will be counted.

When analyzing the impact of detections, it is evident that a false negative carries more weight than a false positive. A false negative means that a landing strip was not detected by the algorithm, implying that a potential focal point of illegal activity would not be identified by authorities. On the other hand, a false positive indicates a predicted location that does not correspond to a real landing strip, which is a less significant error since all locations must be visually validated by authorities. However, the number of false positives should be controlled, as a higher number would delay the visual validation process.

Therefore, to evaluate which network composition produces superior results, the recall metric will be used, defined as follows:

R e c a l l = \frac{T P}{T P + F N} .

(2)

The recall metric will be used to determine how many landing strips were detected by the algorithm in relation to the total number of landing strips. Additionally, the number of false positives will be analyzed to minimize such errors. Thus, the best composition will be the one that achieves the highest recall and the lowest number of FPs.

A distance threshold of 1500 m between predictions and actual targets will be used to calculate performance metrics. This is because landing strips, being of considerable length, may have predictions that do not exactly coincide with the center or any specific point of the strip. A smaller distance could result in counting errors, as a prediction that represents a point on the strip might still be considered incorrect if it is not exactly at the center.

2.7. Experiment Across the Amazon Region

To compare the results of the algorithms, tests were conducted in the same area as in the original paper, using the same images. Regarding the modified algorithm, two tests were performed: one without the second classifier (GCViT) and another with both classifiers, in order to evaluate the impact of the second classifier. Figure 4 shows the test region where the algorithm was evaluated. A visual inspection of all targets present in June 2023 was carried out in this region, and images from that month were acquired to compare the performance differences between the original and modified algorithms. In the original paper [33], results for different dataset compositions are presented for this area. In this study, the comparison will be made against the best-performing dataset composition from the original work.

After that, the modified algorithm will be applied across the entire Amazon biome. A total of 22,164 Planet images from June 2023 were acquired, amounting to 2.8 terabytes of data, covering the entire Legal Amazon region. The activation will be performed on the same images used for the original algorithm, and, as in the original paper, a visual inspection of the images was conducted to verify which landing strips mapped by MapBiomas remained in the same locations on the date the images were captured. This ensures that the evaluation reflects current conditions, allowing for a meaningful comparison between the modified and original algorithms.

3. Experimental Results and Discussions

This section presents the results of the conducted experiments. Firstly, the hyperparameters found during training will be demonstrated. Following this, the performance of the algorithm on the entire Legal Amazon will be presented, along with some analyses of the results.

3.1. Hyperparameter Search

Table 1 shows the best training hyperparameters for all networks across different datasets. It was observed that the value of

T_{m a x}

, which represents the maximum number of epochs, is set to 100 for all trainings. This value was chosen because the models began to show signs of overfitting before reaching the hundredth epoch.

3.2. Results for the Selected Region

After executing the algorithm in the same area where the original algorithm was applied, the comparison of results is presented in Table 2. Analyzing this table, it is evident that there is a slight decrease in recall, which is expected given that the additional steps in the algorithm are designed to reduce the number of false positives. This, in turn, would inevitably impact some true positives. However, the demonstrated decrease is minimal, amounting to less than 1%.

The analysis of false positive reduction reveals a significant decrease in comparison to previous results for the selected area, highlighting a substantial improvement in this aspect and demonstrating the effectiveness of the modifications made. Figure 5 shows the outcomes of the modified algorithm for the given area, where it is evident that the majority of previously mapped locations were accurately detected by the algorithm, further validating its effectiveness. Furthermore, when comparing Figure 6, which displays the results of the original algorithm for the same area, and Figure 5, it becomes clear that the improved algorithm produces fewer predictions while maintaining good metrics for true positive detections. The original algorithm shows a high number of predictions along roads, whereas this issue is notably absent in the improved algorithm.

Furthermore, for the selected region, it is observed that the second classifier has the effect of reducing false positive predictions while maintaining a similar recall, highlighting the importance of its inclusion in the proposed algorithm.

Additionally, examining the computational cost of the algorithm, it was observed that the total activation time for this area averaged 7 min and 46 s over five runs. This represents a performance improvement compared to the baseline algorithm, which took 11 min to execute in the same area [33]. This reduction in execution time highlights the efficiency of the modifications made to the algorithm, making it faster.

3.3. Results for the Application Across the Entire Amazon Biome

When activating the algorithm across the entire Amazon biome, a recall of 91.60% was achieved for previously mapped landing strips, which is a decrease compared to the initial algorithm, where this value was 93.30% [33]. However, when examining the number of predictions made by both algorithms, the modified algorithm produced 54,368 predictions across the entire biome, which represents a 17.88% decrease in the number of predictions made by the algorithm.

This result indicates a substantially reduced number of false positives generated by the modified algorithm. The recall for previously mapped targets only decreased slightly, demonstrating the modified algorithm’s effectiveness in detecting true targets.

Additionally, when analyzing the computational cost of the algorithm for full activation, a total of 32 h was required to process the entire biome, compared to the 43 h taken by the original algorithm for the same task. This reduction can be attributed to the parallelization achieved through threading. This further demonstrates the effectiveness of the modified algorithm, not only in reducing false positives but also in improving processing efficiency.

4. Conclusions

The Amazon is the largest and most diverse biome on Earth, containing an unimaginable amount of natural resources. Due to this factor, irregular exploitation is an urgent issue in the Brazilian socio-environmental context, leading to increased crime rates and often causing deep scars on indigenous tribes. In this context, many of these activities face logistical problems, as in the Amazon, due to the high forest density, it is complex to establish supply lines to maintain activities. Therefore, clandestine aviation becomes an ally, enabling the delivery of necessary materials to any location in the forest with considerable speed.

In this context, this work presented a modification of the seminal algorithm for solving this problem [33], aiming to address some of its shortcomings. For this purpose, pre-existing MapBiomas mappings were used to compose the training datasets for the neural networks. The modifications made caused a slight drop in the recall of the identified targets in both tests. However, there was a significant reduction in the number of false positives, which was a notable issue in the original algorithm.

In the test conducted on a specific region of the biome, the recall dropped by less than 1%, but there was a 26.6% reduction in the number of false positives. For the test applied across the entire Amazon biome, the recall drop was slightly larger, at 1.7%, but the number of false positives decreased by 17.88%. These results indicate a substantial improvement in the algorithm, as the significant reduction in false positives leads to less time required for visual inspection of the predictions, thus enhancing the speed and efficiency of biome mapping. This, in turn, contributes significantly to Brazil’s environmental protection efforts.

One important point is that this improvement comes with a small loss in recall. However, since the recall values remain high, this trade-off is justified by the substantial reduction in false positives.

Regarding future work, it would be important to explore how the modified algorithm can be implemented for large-scale, ongoing mapping of the biome. Furthermore, the recently released YOLOv11 network [13] could be an interesting addition to integrate into the presented algorithm for further performance enhancements. Additionally, an important aspect could be the implementation of the proposed methodology in other biomes worldwide to assess the overall effectiveness of the technique. Moreover, another relevant aspect is to check for the existence of water body maps of the Amazon to incorporate this information into the algorithm. Furthermore, it is relevant to conduct tests with other classical techniques to compare their performance against CNNs.

Author Contributions

G.R.P. designed the proposed model. G.R.P., P.M.T., T.N.K., M.R.O.A.M. and E.H.S. supervised the article’s writing and grammatical review. G.R.P. also wrote the theoretical part of the article. W.R.G. contributed to the introduction, specifically providing a brief evolution of research related to CNNs, highlighting how each study has contributed to advancements in the field. All authors have read and agreed to the published version of the manuscript.

Funding

The authors thank the Brazilian Aeronautics Institute of Technology (Instituto Tecnológico de Aeronáutica—ITA). This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001. Marcos Maximo is partially funded by the National Research Council of Brazil (CNPq) through the grant 307525/2022-8. Elcio H. Shiguemori is also funded by the CNPq, under the grant process 316948/2023-3.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Dataset available on request from the authors.

Acknowledgments

I would like to thank the valuable improvement tips given by the good reviewers of this journal.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CNN	Convolutional Neural Network
MNIST	Modified National Institute of Standards and Technology
CR	Circularity Ratio
YOLO	You Only Look Once
ANN	Artificial Neural Networks
ViT	Visual Transformer
GCViT	Global Context Visual Transformers
SGD	Stochastic Gradient Descent
AI	Artificial Intelligence

References

IBGE. IBGE Atualiza Mapa da Amazônia Legal. 2020. Available online: https://agenciadenoticias.ibge.gov.br/agencia-sala-de-imprensa/2013-agencia-de-noticias/releases/28089-ibge-atualiza-mapa-da-amazonia-legal (accessed on 23 April 2024).
Rudke, A.P.; Sikora de Souza, V.A.; Mota dos Santos, A.; Freitas Xavier, A.C.; Corrêa Rotunno Filho, O.; Martins, J.A. Impact of mining activities on areas of environmental protection in the southwest of the Amazon: A GIS- and remote sensing-based assessment. J. Environ. Manag. 2020, 263, 110392. [Google Scholar] [CrossRef] [PubMed]
MapBiomas. Mapeamento Das pistas de Pouso e Garimpo na Amazônia. Available online: https://brasil.mapbiomas.org/wp-content/uploads/sites/4/2023/08/MapBiomas_Pistas_de_Pouso_06.02.2023_1.pdf (accessed on 2 April 2024).
Soares, R.R.; Pereira, L.; Pucci, R. Ilegalidade e Violência na Amazônia. In Report Amazônia 2030; Centro de Empreendedorismo da Amazônia: Belém, Brazil, 2021. [Google Scholar] [CrossRef]
da Veiga, M.M.; Benedito da Silva, A.R.; Hinton, J.J. O Garimpo de ouro na Amazônia: Aspectos Tecnológicos, Ambientais e Sociais; CETEM/MCT: Rio de Janeiro, Brazil, 2002. [Google Scholar]
Bastos Furtado, E.; Franchi, T.; Rodrigues, L.B.; Da Frota Simões, G. Asas que devastam a Amazônia: Uma análise do cenário de pistas de pouso e voos irregulares que dão suporte ao garimpo ilegal na ti yanomami. Revista (RE)DEFINIÇÕES DAS FRONTEIRAS 2024, 2, 16–51. [Google Scholar] [CrossRef]
Alves, S.A.S.; Assante, L.R.; Sano, E.E.; Meneses, P.R. Abordagem metodológica baseada em imagens do SAR-R99B para identificar prováveis pistas de pouso não-homologadas na Amazônia. Acta Amaz. 2009, 39, 727–729. [Google Scholar] [CrossRef]
Selkirk, K.E. Pattern and Place: An Introduction to the Mathematics of Geography; Cambridge University Press: Cambridge, UK, 1982. [Google Scholar]
Oliveira, G.S. Reconhecimento de Pistas Clandestinas na Amazônia Dispondo de Imagem Satélite; DCTA/ITA/TC 011; ITA: São José dos Campos, Brazil, 2022. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems; Pereira, F., Burges, C.J., Bottou, L., Weinberger, K.Q., Eds.; Curran Associates, Inc.: San Jose, CA, USA, 2012; Volume 25, Available online: https://proceedings.neurips.cc/paper_files/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf (accessed on 15 May 2024).
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar] [CrossRef]
Jocher, G.; Chaurasia, A.; Qiu, J. Ultralytics YOLOv8. Available online: https://github.com/ultralytics/ultralytics (accessed on 23 May 2024).
Jocher, G.; Qiu, J. Ultralytics YOLO11, Version 11.0.0, 2024. AGPL-3.0 License. Available online: https://github.com/ultralytics/ultralytics (accessed on 23 May 2024).
Hubel, D.H.; Wiesel, T.N. Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 1962, 160, 106–154. [Google Scholar] [CrossRef] [PubMed]
Fukushima, K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 1980, 36, 193–202. [Google Scholar] [CrossRef] [PubMed]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.; Jackel, L.D. Backpropagation Applied to Handwritten Zip Code Recognition. Neural Comput. 1989, 1, 541–551. [Google Scholar] [CrossRef]
Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural Netw. 1989, 2, 359–366. [Google Scholar] [CrossRef]
LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
Chen, H.; Deng, Z. Bibliometric Analysis of the Application of Convolutional Neural Network in Computer Vision. IEEE Access 2020, 8, 155417–155428. [Google Scholar] [CrossRef]
Lu, Y.; Zhang, L.; Xie, W. YOLO-compact: An Efficient YOLO Network for Single Category Real-time Object Detection. In Proceedings of the 2020 Chinese Control and Decision Conference (CCDC), Hefei, China, 22–24 August 2020; pp. 1931–1936. [Google Scholar] [CrossRef]
Zhao, S.; You, F. Vehicle Detection Based on Improved Yolov3 Algorithm. In Proceedings of the 2020 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), Vientiane, Laos, 11–12 January 2020; pp. 76–79. [Google Scholar] [CrossRef]
Gongguo, Z.; Junhao, W. An improved small target detection method based on Yolo V3. In Proceedings of the 2021 International Conference on Electronics, Circuits and Information Engineering (ECIE), Zhengzhou, China, 22–24 January 2021; pp. 220–223. [Google Scholar] [CrossRef]
Lu, Z.; Ding, L.; Wang, Z.; Dong, L.; Guo, Z. Road Condition Detection Based on Deep Learning YOLOv5 Network. In Proceedings of the 2023 IEEE 3rd International Conference on Electronic Technology, Communication and Information (ICETCI), Changchun, China, 26–28 May 2023; pp. 497–501. [Google Scholar] [CrossRef]
Yu, X.; Kuan, T.W.; Zhang, Y.; Yan, T. YOLO v5 for SDSB Distant Tiny Object Detection. In Proceedings of the 2022 10th International Conference on Orange Technology (ICOT), Shanghai, China, 10–11 November 2022; pp. 1–4. [Google Scholar] [CrossRef]
Hussain, M. YOLOv1 to v8: Unveiling Each Variant–A Comprehensive Review of YOLO. IEEE Access 2024, 12, 42816–42833. [Google Scholar] [CrossRef]
Yi, H.; Liu, B.; Zhao, B.; Liu, E. Small Object Detection Algorithm Based on Improved YOLOv8 for Remote Sensing. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 1734–1747. [Google Scholar] [CrossRef]
Gomes, L.d.S. Detecção Automática de Pistas de Pouso na Amazônia Legal com uso de Imagens SAR e Aprendizado de máQuina; DCTA/ITA/TC 049; ITA: São José dos Campos, Brazil, 2022. [Google Scholar]
Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv 2021, arXiv:2010.11929. [Google Scholar]
Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. arXiv 2021, arXiv:2103.14030. [Google Scholar]
Hatamizadeh, A.; Yin, H.; Heinrich, G.; Kautz, J.; Molchanov, P. Global Context Vision Transformers. arXiv 2023, arXiv:2206.09959. [Google Scholar]
Pardini, G.R.; Kuck, T.N.; Shiguemori, E.H.; Máximo, M.R.O.A.; Dietzsch, G.; Molina, P. Multistage Algorithm for Detecting Landing Strips in the Amazon Rainforest. IEEE Geosci. Remote Sens. Lett. 2024, 21, 1–5. [Google Scholar] [CrossRef]
PBC, Planet Labs. Planet Application Program Interface: In Space for Life on Earth. Available online: https://api.planet.com (accessed on 2 January 2024).
European Space Agency. ESA Sentinel Online. 2024. Available online: https://sentinel.esa.int/web/sentinel/missions (accessed on 16 February 2024).
Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
Keys, R. Cubic convolution interpolation for digital image processing. IEEE Trans. Acoust. Speech Signal Process. 1981, 29, 1153–1160. [Google Scholar] [CrossRef]
Chaudhuri, D.; Samal, A. A simple method for fitting of bounding rectangle to closed regions. Pattern Recognit. 2007, 40, 1981–1989. [Google Scholar] [CrossRef]
Murtagh, F.; Contreras, P. Algorithms for hierarchical clustering: An overview. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2012, 2, 86–97. [Google Scholar] [CrossRef]
Gao, B. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
Ministério dos Transportes. Base Georeferenciadas—Rodovias. 2024. Available online: https://www.gov.br/transportes/pt-br/assuntos/dados-de-transportes/bit/bit-mapas (accessed on 25 March 2024).
Loshchilov, I.; Hutter, F. SGDR: Stochastic Gradient Descent with Warm Restarts. arXiv 2017, arXiv:1608.03983. [Google Scholar]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Everingham, M.; Van Gool, L.; Williams, C.K.I.; Winn, J.; Zisserman, A. The Pascal Visual Object Classes (VOC) Challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
Bishop, C.M. Pattern Recognition and Machine Learning; Springer: New York, NY, USA, 2006. [Google Scholar]
Shorten, C.; Khoshgoftaar, T.M. A Survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]

Figure 1. Map of the Brazilian Amazon with the airstrip mapping conducted by [3].

Figure 2. Representation of the visible region bands of the same airstrip in the Planet and Sentinel-2 sensors.

Figure 3. Samples of images that comprise the representative set of non-airstrips.

Figure 4. Representation of the chosen region for analysis, with purple points indicating previously mapped landing strips.

Figure 5. Result of the modified algorithm for the proposed area.

Figure 6. Result of the original algorithm for the proposed area.

Table 1. Summary of all hyperparameters for the training of the networks. The terms “seg” and “cls” denote the segmentation and classification modes of YOLOv8.

Network	$η_{\max}$	$η_{\min}$	Optimizer
YOLOv8-cls	$3 \times 10^{- 4}$	$1 \times 10^{- 7}$	Adam
YOLOv8-seg	$3 \times 10^{- 4}$	$2 \times 10^{- 7}$	Adam
GCViT	$1 \times 10^{- 4}$	$1 \times 10^{- 7}$	Adam

Table 2. Comparison between the original [33] algorithm and the modified one for the same area.

Algorithm	Recall	False Positives
Original	93.38%	120
Modified—1 classifier	92.62%	100
Modified—2 classifiers	92.62%	88

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pardini, G.R.; Tasinaffo, P.M.; Shiguemori, E.H.; Kuck, T.N.; Maximo, M.R.O.A.; Gyotoku, W.R. Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest. Algorithms 2025, 18, 102. https://doi.org/10.3390/a18020102

AMA Style

Pardini GR, Tasinaffo PM, Shiguemori EH, Kuck TN, Maximo MROA, Gyotoku WR. Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest. Algorithms. 2025; 18(2):102. https://doi.org/10.3390/a18020102

Chicago/Turabian Style

Pardini, Gabriel R., Paulo M. Tasinaffo, Elcio H. Shiguemori, Tahisa N. Kuck, Marcos R. O. A. Maximo, and William R. Gyotoku. 2025. "Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest" Algorithms 18, no. 2: 102. https://doi.org/10.3390/a18020102

APA Style

Pardini, G. R., Tasinaffo, P. M., Shiguemori, E. H., Kuck, T. N., Maximo, M. R. O. A., & Gyotoku, W. R. (2025). Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest. Algorithms, 18(2), 102. https://doi.org/10.3390/a18020102

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improved Algorithm to Detect Clandestine Airstrips in Amazon RainForest

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Acquisition

2.2. Data Processing

2.3. Base Algorithm

2.4. Modified Algorithm

2.5. Training Parameters

2.6. Performance Metrics

2.7. Experiment Across the Amazon Region

3. Experimental Results and Discussions

3.1. Hyperparameter Search

3.2. Results for the Selected Region

3.3. Results for the Application Across the Entire Amazon Biome

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI