Marine Oil Pollution in an Area of High Economic Use: Statistical Analyses of SAR Data from the Western Java Sea

In this paper, we analyze more than 2000 Synthetic Aperture Radar (SAR) images of the Western Java Sea acquired by Sentinel-1 SAR-C and ENVISAT ASAR, with the aim to generate oil pollution statistics for a sea region of high economic use. The spatial distributions show that most oil pollution occurs along the major shipping routes and at oil production sites in that area. The majority of the spills have sizes of less than 1 km2 and an axial ratio smaller than 10. For two sets of SAR images, we compared the results obtained by different operators, who analyzed the same images. While more than 50% of the spills were not found by both operators, the overall spatial patterns derived from their results are the same. Our results indicate that the observed differences are mainly due to lookalikes, which can easily be confused with oil spills, but also due to small oil spills that were overseen by one of the operators. These assumptions are supported by the fact that the percentage of spills jointly found by both operators increased when only oil spills were considered that were found on SAR images acquired at higher mean wind speeds.


Introduction
Pollution by oil spills affects the marine environment, and to sensitive environments such as Marine Protected Areas (MPA) it poses a major threat. Through this, oil pollution also affects coastal communities, due to impacts on health, fishing industries and tourism [1]. While oil pollution caused by major accidents on platforms or ships is more often reported on by public media, smaller spills resulting from operating platforms or ships are very common, and because of their frequent occurrence, they introduce more oil into the marine ecosystem [2]. Among the multiple sources of operational oil pollution are routine operations on ships and vessels, primarily the release of waste, bilge water or ballast water used in tankers. On platforms, oil is frequently released into the environment during service operations. Though international regulations for the discharge of oil from ships and platforms exist, they do not prohibit oil discharge completely, if certain conditions are met. Most often, ships deliberately or by accident release more oil into the environment than permitted [3].
For the routine surveillance of oil pollution in marine environments, satellite based Synthetic Aperture Radar (SAR) has proven effective and has been in operational use for many years [4][5][6]. The advantages of using SAR images for oil detection include the wide-area coverage and the independence of daytime and weather conditions [7]. Oil spills dampen short gravity-capillary waves on the water surface, and since those small-scale waves are responsible for the radar backscattering, oil spills appear on SAR images as dark patches [8,9]. They are, however, not the only reason for dark patches [8]. Other phenomena can cause similar patterns on SAR imagery, the so-called lookalikes, examples of which are ship wakes, wind field modulations and areas of low wind speed, wind shadowing by land, and rain cells or natural surface films [10,11]. Especially the latter are

Region of Interest
The region of interest (ROI) of this study is the Western Java Sea, between 3 • S and 7 • S and between 105 • E and 111 • E, as shown by the red rectangle in Figure 1. The borders of the ROI are Sumatra to the West and Java to the South, with the Sunda Strait in between, connecting the Java Sea and the Indian Ocean. The northern border is the Karimata Strait, which forms the connection to the South China Sea. Beyond the ROI, the Java Sea continues towards the east into the Makassar Strait and into the Flores and Banda Seas.
In the ROI, several MPA are located, such as Seribu Islands and Karimun Jawa [25], making the region vulnerable to any kind of oil pollution. At the same time, two major oil production sites are located in the area, off the coasts of western Java and southern Sumatra, and some major shipping routes lead through the Java Sea, connecting the economic centers in China and Japan with Europe, Africa, Australia, and the Persian Gulf. We also chose this region, because it had already been subject to earlier studies [21,26].
The regional climate in the ROI is dominated by the monsoon, resulting in a biannual inversion of the wind direction. From November through March, the northwest monsoon is accompanied by a rainy season, which reaches its peak in January and February. In contrast, the southeast monsoon from May to September is accompanied by a dry season, with lowest precipitation in September. During the transition periods between these two seasons, windspeeds are lower and wind direction changes frequently [27].

SAR images
The SAR data used were acquired at C-band by the SAR-C aboard Sentinel-1A and Sentinel-1B and by the ASAR aboard ENVISAT, which were launched in 2014, 2016 and 2002, respectively. SAR images from the ROI were downloaded from the Alaska Satellite Facility Distributed Active Archive Center (ASF DAAC) [28]. A total of 1969 Sentinel-1A and -1B SAR images were available from the period from the first provision of Sentinel-1 data in early October 2014 until the end of the 2018/19 northwest monsoon season in late February 2019. These were used for the statistical analysis of oil spills in the ROI.
For each image, we calculated the mean windspeed using reanalysis data from the Copernicus Marine Environment Monitoring Service (CMEMS) [29]. The wind data were provided on a 0.25° × 0.25° grid at a 6-hourly temporal resolution. As oil spills can only be detected in a certain wind speed range [30], any oil pollution statistics can be biased by SAR data acquired under unfavorable wind conditions. In order to avoid such bias, we excluded all SAR data, whose corresponding mean wind speed was lower than 2 m/s or higher than 10 m/s. A total of 85 images were excluded, i.e., 4.3%, because of this criterion. The coverage of the ROI by the remaining 1884 SAR images is shown in Figure 2.
Note that the coverage is highest in the southern and western parts of the ROI, off the coasts of Java and Sumatra, where typical values are between 150 and 200 images. Where different satellite tracks overlap, the coverage may reach 300 images. In contrast, the coverage is rather low in the ROI's northeast, where typical values range between 50 to 100 images. In this part of the ROI, no SAR images were acquired before 2017, which explains the overall low coverage.
In addition, and to compare results of the visual oil spill detection performed by different operators, we used two sets of SAR images of the ROI: the first was a subset of the above-mentioned Sentinel 1 SAR imagery and consisted of 289 SAR images acquired from October 2014 to February 2017; the second set consisted of 100 ENVISAT ASAR images acquired from May 2006 to February 2012.

SAR images
The SAR data used were acquired at C-band by the SAR-C aboard Sentinel-1A and Sentinel-1B and by the ASAR aboard ENVISAT, which were launched in 2014, 2016 and 2002, respectively. SAR images from the ROI were downloaded from the Alaska Satellite Facility Distributed Active Archive Center (ASF DAAC) [28]. A total of 1969 Sentinel-1A and -1B SAR images were available from the period from the first provision of Sentinel-1 data in early October 2014 until the end of the 2018/19 northwest monsoon season in late February 2019. These were used for the statistical analysis of oil spills in the ROI.
For each image, we calculated the mean windspeed using reanalysis data from the Copernicus Marine Environment Monitoring Service (CMEMS) [29]. The wind data were provided on a 0.25 • × 0.25 • grid at a 6-hourly temporal resolution. As oil spills can only be detected in a certain wind speed range [30], any oil pollution statistics can be biased by SAR data acquired under unfavorable wind conditions. In order to avoid such bias, we excluded all SAR data, whose corresponding mean wind speed was lower than 2 m/s or higher than 10 m/s. A total of 85 images were excluded, i.e., 4.3%, because of this criterion. The coverage of the ROI by the remaining 1884 SAR images is shown in Figure 2.
Note that the coverage is highest in the southern and western parts of the ROI, off the coasts of Java and Sumatra, where typical values are between 150 and 200 images. Where different satellite tracks overlap, the coverage may reach 300 images. In contrast, the coverage is rather low in the ROI's northeast, where typical values range between 50 to 100 images. In this part of the ROI, no SAR images were acquired before 2017, which explains the overall low coverage.
In addition, and to compare results of the visual oil spill detection performed by different operators, we used two sets of SAR images of the ROI: the first was a subset of the above-mentioned Sentinel 1 SAR imagery and consisted of 289 SAR images acquired from October 2014 to February 2017; the second set consisted of 100 ENVISAT ASAR images acquired from May 2006 to February 2012.
Preferably, VV-polarization images were used, because of the higher radar contrast caused by oil pollution [31], and HH-polarization images otherwise. The pixel size of the Sentinel 1 SAR images was 10 m × 10 m and of the ENVISAT SAR images 30 m × 30 m. We recorded the coordinates of the estimated center of each oil spill identified in a SAR image, as well as its length L s and width W s . Preferably, VV-polarization images were used, because of the higher radar contrast caused by oil pollution [31], and HH-polarization images otherwise. The pixel size of the Sentinel 1 SAR images was 10 m × 10 m and of the ENVISAT SAR images 30 m × 30 m. We recorded the coordinates of the estimated center of each oil spill identified in a SAR image, as well as its length Ls and width Ws.

Statistical Analyses
In order to gain an understanding of the overall distribution of marine oil pollution in the Western Java Sea, we performed statistical analyses of all oil spills found on the SAR images. The inhomogeneous spatial coverage by all SAR images ( Figure 2) was taken into account to derive the normalized oil pollution, i.e., the average number of detected oil spills per SAR image, binned into a 0.05° × 0.05° grid. For each grid cell [i;j], the normalized oil pollution, , was calculated dividing the number of oil spills, whose center coordinates were inside the grid cell, , by the number of SAR images covering the grid cell, : The normalized oil pollution, therefore, does not depend on the inhomogeneous SAR coverage. Finally, the recorded length and width were used to calculate the axial ratio, Rs = Ls/Ws, and the spill's area, As, assuming an elliptical shape:

Different Operators
Analyses of long-term changes in the regional marine oil pollution often require combinations of data sets generated by different operators. In this case, problems may arise when those datasets were generated based on different approaches, or when the operators evaluated the SAR images differently. In order to have an idea of the magnitude of such differences, we compared results from two independent operators, who used the same set of SAR images for their oil spill detection.

Statistical Analyses
In order to gain an understanding of the overall distribution of marine oil pollution in the Western Java Sea, we performed statistical analyses of all oil spills found on the SAR images. The inhomogeneous spatial coverage by all SAR images ( Figure 2) was taken into account to derive the normalized oil pollution, i.e., the average number of detected oil spills per SAR image, binned into a 0.05 • × 0.05 • grid. For each grid cell [i;j], the normalized oil pollution, N ij poll , was calculated dividing the number of oil spills, whose center coordinates were inside the grid cell, (1) The normalized oil pollution, therefore, does not depend on the inhomogeneous SAR coverage. Finally, the recorded length and width were used to calculate the axial ratio, R s = L s /W s , and the spill's area, A s , assuming an elliptical shape:

Different Operators
Analyses of long-term changes in the regional marine oil pollution often require combinations of data sets generated by different operators. In this case, problems may arise when those datasets were generated based on different approaches, or when the operators evaluated the SAR images differently. In order to have an idea of the magnitude of such differences, we compared results from two independent operators, who used the same set of SAR images for their oil spill detection.
The two sets of SAR images, 289 Sentinel-1 SAR-C scenes and 100 ENVISAT ASAR scenes, were analyzed in previous studies [20,21] and were re-evaluated in this paper. All SAR image inspections were performed visually using the Sentinel Application Platform (SNAP) toolbox provided by the European Space Agency (ESA). Care was taken when recorded locations of the spills' centers were compared because of their uncertainty, which Remote Sens. 2022, 14, 880 5 of 17 is due to a subjective estimation of center coordinates by different operators. In addition, elongated oil spills, whose SAR signatures were interrupted, might have been interpreted differently, i.e., as two separate spills or as one single spill. Hence, a manual inspection of all datasets was performed and signatures of oil spills with close center coordinates were double checked.
The different datasets, including all parameters recorded for every detected oil spill, were then processed to generate maps of the spills' spatial distribution and of the mean surface area covered by the spills. Confusion matrices were generated to allow for quantitative estimates of the differences caused by different operators. Finally, we analyzed if a given wind speed range had an effect on the observed differences, e.g., because manifestations of marine oil pollution may have been confused with those of biogenic surface films at lower wind speeds, and if any effect of heavy rain events before the SAR acquisitions can be inferred from our results.

Spatial Distributions of Oil Spills
On the 1884 Sentinel-1 SAR scenes we evaluated, a total of 3441 oil spills were identified. That is, on average, almost two imprints of oil spills were found on every SAR image acquired in the ROI. The normalized oil pollution, i.e., the spatial distribution of the detected oil spills, derived from those data is shown in Figure 3. Highest numbers of oil spills per 100 SAR scenes were found in coastal regions. Three areas are especially eye-catching, where the normalized oil pollution well exceeds 10 spills per 100 SAR scenes, resulting in red patches on the map shown in Figure 3. Further areas with high numbers of detected oil spills, exceeding 2 spills per 100 SAR scenes (cyan and green colors), were found off western Java (Jakarta Bay) and along a long curved line that connects the Sunda Strait in the west with the central Javanese coast in the east.
Remote Sens. 2022, 14, x FOR PEER REVIEW 5 of 17 recorded locations of the spills' centers were compared because of their uncertainty, which is due to a subjective estimation of center coordinates by different operators. In addition, elongated oil spills, whose SAR signatures were interrupted, might have been interpreted differently, i.e., as two separate spills or as one single spill. Hence, a manual inspection of all datasets was performed and signatures of oil spills with close center coordinates were double checked. The different datasets, including all parameters recorded for every detected oil spill, were then processed to generate maps of the spills' spatial distribution and of the mean surface area covered by the spills. Confusion matrices were generated to allow for quantitative estimates of the differences caused by different operators. Finally, we analyzed if a given wind speed range had an effect on the observed differences, e.g., because manifestations of marine oil pollution may have been confused with those of biogenic surface films at lower wind speeds, and if any effect of heavy rain events before the SAR acquisitions can be inferred from our results.

Spatial Distributions of Oil Spills
On the 1884 Sentinel-1 SAR scenes we evaluated, a total of 3441 oil spills were identified. That is, on average, almost two imprints of oil spills were found on every SAR image acquired in the ROI. The normalized oil pollution, i.e., the spatial distribution of the detected oil spills, derived from those data is shown in Figure 3. Highest numbers of oil spills per 100 SAR scenes were found in coastal regions. Three areas are especially eyecatching, where the normalized oil pollution well exceeds 10 spills per 100 SAR scenes, resulting in red patches on the map shown in Figure 3. Further areas with high numbers of detected oil spills, exceeding 2 spills per 100 SAR scenes (cyan and green colors), were found off western Java (Jakarta Bay) and along a long curved line that connects the Sunda Strait in the west with the central Javanese coast in the east.  We compared these results with a map showing the ship traffic density in 2016 and 2017 [32] and found that most areas of increased normalized oil pollution coincide with the major shipping routes. In Figure 4, those routes are superimposed as pale blue lines to the map of normalized oil pollution, and they can explain the observed high frequency of marine oil pollution in the Sunda Strait, Jakarta Bay, and along the curved line off the Javanese coast. We also note that a narrow shipping route connecting the Sunda Strait and the Karimata Strait in the north leads to higher normalized oil pollution; however, a major route in the north-east of the ROI, which connects the Karimata Strait with the central Java Sea, cannot be inferred from our map, likely because of the low coverage of that area by SAR data (Figure 2). Moreover, the location of major shipping routes cannot explain the observed hot spots, i.e., the red patches on the map. We compared these results with a map showing the ship traffic density in 2016 and 2017 [32] and found that most areas of increased normalized oil pollution coincide with the major shipping routes. In Figure 4, those routes are superimposed as pale blue lines to the map of normalized oil pollution, and they can explain the observed high frequency of marine oil pollution in the Sunda Strait, Jakarta Bay, and along the curved line off the Javanese coast. We also note that a narrow shipping route connecting the Sunda Strait and the Karimata Strait in the north leads to higher normalized oil pollution; however, a major route in the north-east of the ROI, which connects the Karimata Strait with the central Java Sea, cannot be inferred from our map, likely because of the low coverage of that area by SAR data (Figure 2). Moreover, the location of major shipping routes cannot explain the observed hot spots, i.e., the red patches on the map. A more detailed view at the oil production sites Offshore South-East Sumatra (OSES) and Offshore and North-West Java (ONWJ) is provided in Figure 5. In this case, the major oil fields (after [33]) are superimposed as purple patches onto the map of normalized oil pollution, and the major gas fields as pink patches. Within the oil production site OSES ( Figure 5A), most spills were found at the oil fields Intan, Winduri, and Cintta (see the red patches in Figure 3). The normalized oil pollution at oil field Zelda is increased, though about four times smaller than at the other oil fields. Within the oil production site ONWJ ( Figure 5B), most spills were found in the center of the Adjurna oil field. Note that the above-mentioned curved line of increased normalized oil pollution runs through the northern part of the ONWJ site; hence, higher numbers of oil spills along this line can be attributed to high ship traffic ( Figure 4) rather than oil production. Additionally, note that major gas fields, marked as pink patches in all panels of Figure 5, do not coincide with areas of increased normalized oil pollution, unless they are in close vicinity to oil fields.
For each detected oil spill, the lengths of its major and minor axes (i.e., the spill's length and mean width) were recorded, and they were used to estimate the spill-covered area and the axial ratio (i.e., the inverse compression factor), both assuming an elliptical shape. A more detailed view at the oil production sites Offshore South-East Sumatra (OSES) and Offshore and North-West Java (ONWJ) is provided in Figure 5. In this case, the major oil fields (after [33]) are superimposed as purple patches onto the map of normalized oil pollution, and the major gas fields as pink patches. Within the oil production site OSES ( Figure 5A), most spills were found at the oil fields Intan, Winduri, and Cintta (see the red patches in Figure 3). The normalized oil pollution at oil field Zelda is increased, though about four times smaller than at the other oil fields. Within the oil production site ONWJ ( Figure 5B), most spills were found in the center of the Adjurna oil field. Note that the above-mentioned curved line of increased normalized oil pollution runs through the northern part of the ONWJ site; hence, higher numbers of oil spills along this line can be attributed to high ship traffic (Figure 4) rather than oil production. Additionally, note that major gas fields, marked as pink patches in all panels of Figure 5, do not coincide with areas of increased normalized oil pollution, unless they are in close vicinity to oil fields.
For each detected oil spill, the lengths of its major and minor axes (i.e., the spill's length and mean width) were recorded, and they were used to estimate the spill-covered area and the axial ratio (i.e., the inverse compression factor), both assuming an elliptical shape.  Figure 5. Same as Figure 3, but with a focus on major oil production sites; A: 'Offshore North-East Sumatra'; B: 'Offshore North-West Java'. Purple patches mark the locations of oil fields, and pink patches mark the locations of gas fields (redrawn after [33]). Figure 6 shows the respective distributions of the spill size in the upper panel and of the axial ratio in the lower panel. To both distributions, a Burr (Type XII) distribution was fitted (black curves), with the best-fit parameters added to each panel. The Burr (Type XII) distribution is a continuous distribution for non-negative variables and is defined by three parameters, α, c, and k, as [34]: In addition, the (inverse) quantile functions were added as red curves, and the horizontal black dotted lines denote the 95% quantile of the respective distribution.  Figure 3, but with a focus on major oil production sites; A: 'Offshore North-East Sumatra'; B: 'Offshore North-West Java'. Purple patches mark the locations of oil fields, and pink patches mark the locations of gas fields (redrawn after [33]). Figure 6 shows the respective distributions of the spill size in the upper panel and of the axial ratio in the lower panel. To both distributions, a Burr (Type XII) distribution was fitted (black curves), with the best-fit parameters added to each panel. The Burr (Type XII) distribution is a continuous distribution for non-negative variables and is defined by three parameters, α, c, and k, as [34]:  Approximately 50% of the detected oil spills covered sea surface areas smaller than 0.7 km 2 , and about 90% areas smaller than 5.5 km 2 (upper panel of Figure 6). Neglecting outliers, the mean surface area of the detected oil spills was about 5.6 km 2 , i.e., the largest 10% of the observed oil spills contributed to their mean surface area to an extent comparable to that of the remaining 90%.
The axial ratio of approximately 50% of the detected oil spills was found to be smaller than 11 (lower panel of Figure 6), and that of approximately 90% to be smaller than 50. In most cases, the shape of the observed oil spills was elliptical, with a mean axial ratio of 38, either because of a strong drift of the sea surface layer or a movement of the pollution source, or because of both. In addition, the (inverse) quantile functions were added as red curves, and the horizontal black dotted lines denote the 95% quantile of the respective distribution.
Approximately 50% of the detected oil spills covered sea surface areas smaller than 0.7 km 2 , and about 90% areas smaller than 5.5 km 2 (upper panel of Figure 6). Neglecting outliers, the mean surface area of the detected oil spills was about 5.6 km 2 , i.e., the largest 10% of the observed oil spills contributed to their mean surface area to an extent comparable to that of the remaining 90%.
The axial ratio of approximately 50% of the detected oil spills was found to be smaller than 11 (lower panel of Figure 6), and that of approximately 90% to be smaller than 50. In most cases, the shape of the observed oil spills was elliptical, with a mean axial ratio of 38, either because of a strong drift of the sea surface layer or a movement of the pollution source, or because of both.

Comparison of Oil Spill Statistics from Different Operators
In order to obtain an estimate of the influence that the subjective visual interpretation of SAR images has on the detection of oil pollution, we compared results obtained by different operators: in the first case, two independent operators had analyzed 289 Sentinel-1 SAR-C images, and in the second case, a different pair of independent operators had analyzed 100 ENVISAT ASAR images.

First Case: Sentinel-1 Data
The first data set, for which the results obtained by different operators were compared, consisted of 289 Sentinel-1 SAR-C scenes acquired from October 2014 to February 2017 [20]. The same SAR images were re-analyzed, and the location and dimensions of each detected oil pollution were recorded in the same way as performed before. The results of both analyses are compared in Table 1. Operator 1 found a total of 801 oil spills and operator 2 a total of 694 oil spills. Since no a priori knowledge about the real oil pollution, i.e., no sea truth, was available, an exact quantitative assessment of the performance of both operators is not possible. We can, however, make two extreme assumptions, which may indicate if one operator performed better than the other, i.e., if the results obtained by one of the two operators are more reliable than those obtained by the other. We note that both assumptions are simplifying and may not represent the real conditions, i.e., the real amount of oil spills in the ROI. Since no lookalikes (negatives) were recorded, we restrict ourselves to calculations of the hit rate, or recall, R, the precision, P, and the F1 score, F 1 , given as [35]: In a first simplifying scenario, we may exclude false positives, i.e., we may assume that all detected oil spills were indeed oil pollution, rather than any kind of lookalike, so that both operators produced only true positives and false negatives. Then, a total of 1063 real oil spills were detected by the 2 operators together, of which only 432 (41%) were found by both operators (true positives). Moreover, after missing 262 spills found by operator 2 (false negatives; 25% of all oil spills), the recall (hit rate) of operator 1 would be R 1 = 75% and, vice versa, after missing 369 spills found by operator 1 (false negatives; 35% of all oil spills), the hit rate of operator 2 would be R 2 = 65%. Since the precision P 1,2 of both operators would be 100% (i.e., both produced no false positives), their F1 scores would be F 11 = 0.86 (operator 1) and F 12 = 0.79 (operator 2).
In another simplifying (extreme) scenario, we may consider as real oil pollution only those oil spills that were found by both operators, so that all spills detected by only one operator are false positives. Then, all 432 oil spills were found by both operators (true positives; both operators had a hit rate of R 1,2 = 100%); operator 1 produced 369 false positives (46% false discovery rate of 801 detected oils spills) and operator 2 produced 262 false positives (38% false discovery rate of 694 detected oil spills). The respective precisions of operators 1 and 2 would be P 1 = 54% and P 2 = 62%, respectively, and hence, their F1 scores would be F 11 = 0.70 and F 12 = 0.77, respectively.
These two scenarios demonstrate that determining which operator performed better cannot be easily judged. We also note that those extreme scenarios can only be used to obtain rough estimates of the operators' accuracies, since no ground (sea) truth was available, nor were there any lookalikes (i.e., negatives) recorded.
Whereas quantitative comparisons of the results obtained by different operators are difficult, qualitative comparisons are easier to make. The two panels of Figure 7 show the spatial distributions of the oil spills found by the two operators, with the upper panel (A) showing the results obtained by operator 1, and the lower panel (B) shows the same for operator 2. The overall patterns in both distributions are very similar, and they also agree with those found in the analysis of a greater number of Sentinel-1 SAR images, as described in the previous section. Only small differences in the total number of spills can be found at some locations, e.g., in the Sunda Strait, Jakarta Bay, and in the area of the ONWJ oilfield off the Javanese Coast. These differences are likely to be due to a small number of oil spills identified by one operator only. Note that no spills were detected in the north-eastern part of the ROI, because no SAR images of that area were acquired in the time period considered in this paper (October 2014 to February 2017 [20]). These two scenarios demonstrate that determining which operator performed better cannot be easily judged. We also note that those extreme scenarios can only be used to obtain rough estimates of the operators' accuracies, since no ground (sea) truth was available, nor were there any lookalikes (i.e., negatives) recorded.
Whereas quantitative comparisons of the results obtained by different operators are difficult, qualitative comparisons are easier to make. The two panels of Figure 7 show the spatial distributions of the oil spills found by the two operators, with the upper panel (A) showing the results obtained by operator 1, and the lower panel (B) shows the same for operator 2. The overall patterns in both distributions are very similar, and they also agree with those found in the analysis of a greater number of Sentinel-1 SAR images, as described in the previous section. Only small differences in the total number of spills can be found at some locations, e.g., in the Sunda Strait, Jakarta Bay, and in the area of the ONWJ oilfield off the Javanese Coast. These differences are likely to be due to a small number of oil spills identified by one operator only. Note that no spills were detected in the northeastern part of the ROI, because no SAR images of that area were acquired in the time period considered in this paper (October 2014 to February 2017 [20]).

Second Case: ENVISAT Data
The second data set, for which results obtained by different operators were compared, consisted of 100 ENVISAT ASAR scenes acquired from May 2006 to February 2012 [21]. Again, those SAR images were re-analyzed, and the location and dimensions of each detected oil pollution were recorded in the same way as conducted before. The results of the operators' analyses are listed as a confusion matrix shown in Table 2. In total, operator 1 found 180 oil spills, and operator 2 found 262 oil spills; 148 oil spills were identified by both operators, and the overall sum of spills identified by the two operators together was 294. In the first scenario, again, we assume that there were no false positives, i.e., that all identified oil spills were indeed caused by oil pollution. Then, only 50% of all oil spills were found by both operators; the hit rate of operator 1 would be R 1 = 61%, since 114 oil spills were missed (false negatives; false discovery rate 39%), and the hit rate of operator 2, who missed 32 spills (false negatives; false discovery rate 11%), would be R 2 = 89%. The precisions of both operators would again be P 1,2 = 100% and hence, their respective F1 scores would be F 11 = 0.76 (operator 1) and F 12 = 0.94 (operator 2).
In the second scenario, again, only those 148 oil spills, which were found by both operators (Table 2), are considered as true positives. The hit rates of both operators would again be R 1,2 = 100%, without any false negatives; the false discovery rate of operator 1 would be 18% (32 of 180 detected oil spills), and the false discovery rate of operator 2 would be 44% (114 of 262 detected oil spills). The respective precisions of operators 1 and 2 would be P 1 = 82% and P 2 = 56%, respectively, and hence, their F1 scores would be F 11 = 0.90 and F 12 = 0.72, respectively.
We note that none of the operators is performing better, achieving a higher F1 score, in both extreme scenarios, as it was already found in the previous section. We also note that even in the first scenarios, both operators may have missed some oil spills, which, naturally, would result in lower F1 scores. Further, as no negatives (lookalikes) were recorded, the above statical assessment must remain simplified.
The two panels of Figure 8 show the spatial distribution of the oil spills found by the two operators, with the results obtained by operator 1 shown in the upper panel (A) and the results obtained by operator 2 shown in the lower panel (B). Similar to the results presented in Figure 7, the overall patterns of the detected oil spills are the same for both operators. Because of the overall larger number of oil spills found by operator 2 (Table 2), more bright patches can be seen in the lower panel; however, they do not affect the good qualitative agreement between both data sets.

Influence of Weather Conditions
The visibility of oil pollution on SAR imagery depends, among others, on the prevailing weather conditions. We analyzed the datasets with respect to the influence that the wind speed range and preceding rain events have on the detectability of oil spills.

Wind Speed Range
The previous section demonstrated that large differences can be found between results obtained by different operators. A reason for the observed differences can be that small oil spills, whose imprints may cover only a few SAR image pixels, were missed by one of the operators. Another possible reason is that oil spills were confused with lookalikes, especially biogenic surface films, or vice versa. Such erroneous classifications may have happened, when SAR images were analyzed that were acquired at low-to-moderate wind speeds, when both types of surface films cause similar radar contrasts [31]. While mineral oil spills can be detected on SAR images at wind speeds up to 10 m/s [30], the upper limit of wind speeds, at which biogenic surface films are manifested on SAR images, is about 7 m/s [36,37].

Influence of Weather Conditions
The visibility of oil pollution on SAR imagery depends, among others, on the prevailing weather conditions. We analyzed the datasets with respect to the influence that the wind speed range and preceding rain events have on the detectability of oil spills.

Wind Speed Range
The previous section demonstrated that large differences can be found between results obtained by different operators. A reason for the observed differences can be that small oil spills, whose imprints may cover only a few SAR image pixels, were missed by one of the operators. Another possible reason is that oil spills were confused with lookalikes, especially biogenic surface films, or vice versa. Such erroneous classifications may have happened, when SAR images were analyzed that were acquired at low-to-moderate wind speeds, when both types of surface films cause similar radar contrasts [31]. While mineral oil spills can be detected on SAR images at wind speeds up to 10 m/s [30], the upper limit of wind speeds, at which biogenic surface films are manifested on SAR images, is about 7 m/s [36,37].
To investigate if the observed differences caused by different operators depend on the chosen wind speed range, we re-analyzed the given results, but considered only those SAR images that were acquired at higher wind speeds, by raising the lower wind speed limit from 2 m/s to 5 m/s. In this subset of SAR images less manifestations of biogenic surface films can be expected and therefore, the risk of misinterpretations should be reduced. Moreover, since small oil spills have a shorter lifetime at high wind speeds, the average spill size should increase and therefore, the risk of being overseen by an operator should decrease. We note that a further increase in the lower wind speed limit would have resulted in a too small number of (remaining) oil spills, thereby hindering a significant analysis.
In Tables 3 and 4, the results are shown for the Sentinel-1 dataset and for the ENVISAT dataset, respectively. Both tables list for the two wind speed ranges the percentages of oil spills found by both operators, along with the mean area of the spills that were not found by both operators, i.e., that were missed by one operator. In both data sets, the percentage of oil spills found by both operators increased when the lower wind speed limit was raised: in the Sentinel-1 dataset, the increase was 9 percentage points, and in the ENVISAT dataset 15 percentage points. These results support our hypothesis that the confusion of oil spills and biogenic surface films is a main source of uncertainty in the SAR image analysis.
The mean area of spills, which were not detected by both operators, slightly decreased in the Sentinel-1 dataset, but it strongly increased in the ENVISAT dataset. Apparently, most of the small oil spills on Sentinel-1 SAR-C images, which were not detected by one of the operators when the lower wind speed limit was 2 m/s, were also missed when the lower limit was raised to 5 m/s. In contrast, the mean area of small spills on ENVISAT ASAR images more than doubled when the lower wind speed limit was raised from 2 m/s to 5 m/s. However, since there was no general trend observed in both datasets, our hypothesis, that the mean area should generally increase, is not supported by these results. We also note that different pairs of operators analyzed the two datasets, that the sizes of the two datasets were different, and that their pixel sizes differed by a factor of three, which altogether makes it difficult to draw final conclusions.

Heavy Rain Events
Heavy rain disrupts marine surface films [38], and it takes some time until a coherent film has formed again on the sea surface. As a result, the time between a heavy rain event and the SAR image acquisition may affect the detectability of oil pollution. We used the datasets from [21] to investigate if heavy precipitation with rain rates exceeding 5 mm/h may affected the results presented in this paper. For every detected oil spill, we calculated the time between the respective SAR image acquisition and the latest heavy rain event using reanalysis data from the Copernicus Climate Data Storage (CDS) [39], which are provided on a 0.25 • × 0.25 • grid at an hourly temporal resolution. The temporal distribution is shown in Figure 9.
the time between the respective SAR image acquisition and the latest heavy rain event using reanalysis data from the Copernicus Climate Data Storage (CDS) [39], which are provided on a 0.25° × 0.25° grid at an hourly temporal resolution. The temporal distribution is shown in Figure 9. Two maxima in the temporal distribution can be seen, at 10 h and 18 h, which are likely to be due to heavy convective precipitation in the afternoon hours. There is, however, no clear trend visible, which would indicate that a certain time period between the latest strong rainfall and the SAR image acquisition is needed to allow for a detection of oil pollution on the SAR imagery. Apparently, a larger data basis is needed to draw definite conclusions. We also note that the spatio-temporal resolution of the reanalysis data might be too low, since the oil spills may form again at the sea surface within the first hour after the rainfall, and since tropical rain cells are often of sizes 2-40 km [40] and are not resolved by the used reanalysis data.

Discussion
Our results on the spatial distribution of marine oil pollution in the Western Java Sea confirm those from earlier studies [20,21], as similar overall pollution patterns were found. Most of the oil spills were detected along major shipping routes and in close vicinity to oil production sites. Other regions where a higher-than-average number of oil spills can be found, such as popular fishing grounds or near the shore especially at river outflows, [17] were not included in the analysis. However, most pollution that was found near the coast is located close to harbors. The most prominent example in our results is in Jakarta Bay, where one the busiest Indonesian harbors is located. There are also harbors on both sides of the Sunda Strait, which are connected by frequent traffic and smaller harbors along the Javanese coast. Near all these harbors, oil pollution was repeatedly found close to the coast.
Moreover, the majority of the detected oil spills was smaller than 1 km 2 , and most oil spills were of elliptical shape, with an axial ratio smaller than 10. We did not find a Two maxima in the temporal distribution can be seen, at 10 h and 18 h, which are likely to be due to heavy convective precipitation in the afternoon hours. There is, however, no clear trend visible, which would indicate that a certain time period between the latest strong rainfall and the SAR image acquisition is needed to allow for a detection of oil pollution on the SAR imagery. Apparently, a larger data basis is needed to draw definite conclusions. We also note that the spatio-temporal resolution of the reanalysis data might be too low, since the oil spills may form again at the sea surface within the first hour after the rainfall, and since tropical rain cells are often of sizes 2-40 km [40] and are not resolved by the used reanalysis data.

Discussion
Our results on the spatial distribution of marine oil pollution in the Western Java Sea confirm those from earlier studies [20,21], as similar overall pollution patterns were found. Most of the oil spills were detected along major shipping routes and in close vicinity to oil production sites. Other regions where a higher-than-average number of oil spills can be found, such as popular fishing grounds or near the shore especially at river outflows, [17] were not included in the analysis. However, most pollution that was found near the coast is located close to harbors. The most prominent example in our results is in Jakarta Bay, where one the busiest Indonesian harbors is located. There are also harbors on both sides of the Sunda Strait, which are connected by frequent traffic and smaller harbors along the Javanese coast. Near all these harbors, oil pollution was repeatedly found close to the coast.
Moreover, the majority of the detected oil spills was smaller than 1 km 2 , and most oil spills were of elliptical shape, with an axial ratio smaller than 10. We did not find a correlation between spill sizes and axial ratios. It is however likely that a large number of small oil spills were not found, particularly on ENVISAT ASAR images, whose spatial resolution is three times lower than that of the Sentinel-1 SAR-C images. The different spatial resolutions may also explain why a larger quantity of small oil spills, with sizes smaller than 0.5 km 2 , were found on Sentinel-1 SAR-C images. We further note that the highest normalized oil pollution, i.e., the largest number of oil spills per 100 SAR scenes and per grid cell, was found at the major oil fields. The visual analyses performed by different operators revealed the same qualitative result, that the overall patterns in the oil pollution are the same. However, not even half of the spills were found by two operators, who were independently analyzing the same set of SAR images. The percentage of spills found by both operators increased by approximately 10 percentage points when only SAR images were considered, which were acquired at higher wind speeds. That is, by raising the lower wind speed limit from 2 m/s to 5 m/s, we reduced the risk that biogenic surface films were mistaken for oil spills, because the latter are more likely to be disrupted at higher wind speeds than oil spills [30,37]. These differences confirm the results of a similar study, in which two operators classified 7 out of 22 oil spills found with the same confidence level, where the confidence level depended, among other factors, on the wind speed [19].
Based on 289 Sentinel-1 SAR-C images, the mean area of those oil spills that were not found by both operators was slightly reduced when a higher wind speed range was considered. In contrast, the mean area of the oil spills, which were not found by both operators on 100 ENVISAT ASAR images, increased considerably when the lower wind speed limit was raised. Only the latter result supports our assumption that differences in the results obtained by different operators are caused by confusion of oil spills and lookalikes, such as biogenic surface films, especially for large, elongated spills. Further comparisons based on larger quantities of SAR images are needed to draw definite conclusions.
Even though the differences in the results obtained by different operators could be reduced by raising the lower wind speed limit, they are still too large to allow for a seamless combination. This is especially true when the development of oil pollution over time is investigated. For qualitative studies of oil pollution in a given area, these differences are less important, as the overall patterns of detected oil pollution do not depend on the operator. It must also be considered that the classification of dark patches into oil spills or lookalikes may change over time, as a single operator gains more experience in SAR image analysis, and particularly in distinguishing dark patches of different origin. Additionally, the detail, at which said operator inspects the SAR images can vary from day to day or even from image to image, even if the operator endeavors to keep the given efforts on a constant (high) level.
This study is however based on a small number of SAR images and therefore, our results cannot be easily applied to other datasets. For more significant conclusions on the influence that different operators have on oil pollution statistics, further studies employing larger datasets examined by more than two operators are needed.

Conclusions
For our statistical analyses, a total of 1969 Sentinel 1 SAR-C scenes of the Western Java Sea acquired from October 2014 to February 2019 were visually inspected with regard to imprints of marine oil pollution. The results of the spatial analysis show that most spills were found along major shipping routes and at oil production sites. Approximately half of the spills have an area smaller than 1 km 2 and have an axis ratio smaller than 10; however, oil spills over six times that size were also found.
When comparing the results from independent operators examining the same set of SAR images, we found that there are quantitative differences while the overall patterns of marine oil pollution were the same. The percentage of spills, which were found by both operators, increased when the lower limit of the wind speed range was raised, for which SAR imagery was considered. Our results indicate that fewer biogenic surface films were mistaken for oil spills at higher wind speeds, because of the different wind speed ranges, at which both types of surface films manifest on SAR imagery. Moreover, the fact that small oil spills can be easily missed appears to be a major source for differences in the results from two operators.
Oil pollution statistics based on visual observations of SAR imagery may be used as training datasets for automated oil pollution detection systems. Hence, our results are important for an assessment of potential errors in those systems. Further research in this respect will be conducted. Future research will also see an investigation of radar polarization (VV vs. HH) on the detection performance for low wind conditions and incidence angle.