Land Cover Classiﬁcation of Complex Agroecosystems in the Non-Protected Highlands of the Galapagos Islands

: The humid highlands of the Galapagos are the islands’ most biologically productive regions and a key habitat for endemic animal and plant species. These areas are crucial for the region’s food security and for the control of invasive plants, but little is known about the spatial distribution of its land cover. We generated a baseline high-resolution land cover map of the agricultural zones and their surrounding protected areas. We combined the high spatial resolution of PlanetScope images with the high spectral resolution of Sentinel-2 images in an object-based classiﬁcation using a RandomForest algorithm. We used images collected with an unmanned aerial vehicle (UAV) to verify and validate our classiﬁed map. Despite the astounding diversity and heterogeneity of the highland landscape, our classiﬁcation yielded useful results (overall Kappa: 0.7, R 2 : 0.69) and revealed that across all four inhabited islands, invasive plants cover the largest fraction (28.5%) of the agricultural area, followed by pastures (22.3%), native vegetation (18.6%), food crops (18.3%), and mixed forest and pioneer plants (11.6%). Our results are consistent with historical trajectories of colonization and abandonment of the highlands. The produced dataset is designed to suit the needs of practitioners of both conservation and agriculture and aims to foster collaboration between the two areas.


The Humid Highlands of the Galapagos and Their Value
Vast expanses of agricultural land in the Galapagos have become abandoned in the last decades, making this UNESCO Natural World Heritage Site vulnerable to invasive plants, which thrive in disturbed environments [1,2]. Presently, the humid highlands of the Galapagos record higher numbers of invasive plants when compared to the dry lowlands, and these plants not only threaten agricultural systems but also the remaining patches of native-dominated ecosystems that still exist in the non-protected areas of this Ecuadorian archipelago [3]. Crop production and cattle ranching, when properly implemented, can effectively be used to control the spread of invasive plants or even rehabilitate invaded lands [4,5]. Given the direct influence of agriculture on both the introduction and control of introduced species, local governance seeks to incentivize agroecological models that serve to  Most previous studies either exclude the agricultural areas or suffer from high cloud cover concentrations over the humid highlands (Table 1). Only the survey performed by the National System of Information of Rural Lands and Technological Infrastructure (SIGTIERRAS) has high-resolution images where the agricultural zone is visible. However, these images are a decade old, and the sensor was sensitive only to wavelengths within the visible spectrum. Furthermore, two of the primary studies that are used by the Galapagos National Park (GNP) and government institutions (CLIRSEN and TNC, 2006, and SIGTIERRAS 2010) are not published, so wider access to this valuable information is greatly restricted. Given the sensitive nature of ecosystems around and within the agricultural areas and the dynamic rate at which they change, it is imperative that the humid zones be precisely mapped with a replicable methodology to support monitoring efforts and to inform land management strategies [29].

Objectives
The purpose of this work is to build upon previous studies and make updated land cover classification maps of Galapagos agroecosystems available using some of the highest-resolution freely-available images. Specifically, our objectives for the present investigation are to: • Develop a classification methodology, using remote sensing, specifically in complex agroecosystem landscapes. • Identify regions of remaining native forests in and near the agricultural zone. • Identify the distribution of invasive plant species like Psidium guajava, Rubus niveus, Zygsigum jambos, and Pennisetum purpureum that threaten agricultural lands and adjacent protected areas.

•
Identify the distribution of Erythrina smithiana living fences in Santa Cruz, which might act as an obstacle for migrating tortoise populations.

•
Identify the distribution of economically significant agricultural land cover like cattle ranching pastures, annual crops, tree crops, and cash crops, such as coffee.
Having access to updated information about the distribution of invasive species and remaining native forests will assist landowners and institutions like the GNP and the Ministry of Agriculture (MAG) in their goals to manage land uses, conserve protected species, and prevent the spread of invasive plants. Current land cover distributions can help determine the optimal conditions for prized crops, as well as predict the range of invasive plants in islands where plants may be at a different stage of colonization. This information, together with previous efforts using comparable mapping techniques, will help plan for future conditions as both local and global drivers, such as land use or climatic patterns, continue to change. Inhabitants of the highlands of the Galapagos devote their land to three general activities: cattle ranching, crop production, and tourism activities. According to the 2014 agricultural census, pastures cover nearly 60% of the land surface area of Galapagos agroecosystems, either cultivated or naturally germinated. A commonly grown variety is elephant grass (Pennisetum purpureum), which is used as Figure 1. Study area. Each of the inhabited islands has a designated agricultural zone. The map above shows the geographical location of the agricultural zones (blue outline), and the panels below show a detail of each agricultural zone with cloudless PlanetScope images.

Preliminary Definition of Land Cover Units
We focused on defining and mapping vegetation types that were of interest to both the conservation and agricultural sectors, including invasive species, like guava, and valued cash crops, like coffee.
Remote Sens. 2020, 12, 65 6 of 39 We based the agriculturally-relevant categories on those used by the 2014 agricultural census of the Galapagos [40]. Similarly, we based conservation-related categories on those used by the official classification systems of the MAG and on previous classification studies of the protected areas of the GNP [42,43].
We used level hierarchies to address the complexity of landscapes, breaking down broader categories into increasingly specific categories (Figure 2). At level 1, categories allow choosing between Land and the Pacific Ocean. At level 2, the 'Land' category is broken down into four categories: 'bare ground,' 'built environment,' 'vegetation,' and 'freshwater.' Bare ground is volcanic rock or soil surface area with no vegetation cover. This category excludes recently tilled land for crops. Built environment is surface covered by concrete, pavement, or other impervious surfaces. Vegetation represents all plants that cover the land surface, and freshwater describes naturally-occurring or artificial bodies of freshwater, such as ponds or reservoirs. At level 3, the 'vegetation' category is further broken down into 'introduced' and 'native' species, and levels 4 and 5 correspond to more specific categories that can be recognized as individual vegetation stands or patches at a mapping scale of 1:10,000. Introduced species ( Table 2) are plants that were intentionally or unintentionally brought to the islands by humans and have become naturalized since their introduction, such as Citrus and Erythrina spp. Native species (Table 3) are those that arrived on the islands by their own means, transported by the wind, water, or other animals [44]. Introduced species include food crops (transitory and permanent crops), pastures, and invasive species. The pastures category includes grass species planted by farmers for cattle grazing naturally-occurring species that are dispersed by the wind [40]. Invasive species are a subset of naturalized plants that produce reproductive offspring in large numbers and over a considerable distance, thus having the potential to spread over large areas and negatively impact native biota [45]. Some of the most common and aggressive species include Cedrela odorata (cedar), Cinchona pubescens (quinine), Psidium guajava (guava), Rubus niveus (blackberry), Lantana camara (supirosa), and Syzygium jambos (pomarrosa) [42].

Category Description
Cedrela-Cedar Cedrela odorata, or spanish cedar, is a semi-deciduous tree 10-30 m tall, which outcompetes most other tree species and grows into a dense canopy [46].

Cinchona-
The red quinine tree (Cinchona pubescens) is a 15 m broad-leaved evergreen, which

Category Description
Cedrela-Cedar Cedrela odorata, or spanish cedar, is a semi-deciduous tree 10-30 m tall, which outcompetes most other tree species and grows into a dense canopy [46].

Cinchona-Quinine
The red quinine tree (Cinchona pubescens) is a 15 m broad-leaved evergreen, which was once widespread in Santa Cruz's Miconia region (~580 masl). [47] Coffea-Coffee The Coffeaand coffee category includes bushes of the arabica or robusta varieties since this is the most important cash crop and the only product that is exported. Coffee is often shade-grown with other tree crops like Citrus or banana (Musa spp.), so this category is limited to what is visible from above [40].

Cultivated Grass
Cultivated grass is a category that groups monocrops of less common cultivated grasses for pasture or landscaping, including grasses of Brachiara or Paspalum genera [40].

Erythrina-Porotillo
Erythrina smithiniana or porotillo is a shrubby tree of the Fabaceae family, which grows up to~8 m and is commonly planted as stakes along the edges of farms to form 'living fences' [12].

Lantana-Supirosa
Lantana camara or Supirosa is an ornamental shrub up to 3 m tall that invades a wide variety of habitats, from tropical to temperate regions [48,49] Mixed forest Mixed forest is a category describing a forest where there is a mixture of introduced, invasive, or native species, and the dominance of a single species cannot be established [42].

Permanent Crops
Permanent Crops are those that take over one year to reach productive age, and that can produce for several subsequent years without requiring replanting after each harvest. Some of the most common perennial crops include coffee (Coffea spp.), plantain, banana (Musa spp.), sugar cane (Saccharum spp.), pineapple (Ananas spp.), and orange (Citrus spp.) [40]. The permanent crops category also includes all other perennial or tree crops, such as pineapple (Ananas spp.), sugar cane (Saccharum spp.), or other tree crops, commonly of Fabaceae, Persea, or Carica genera [40].

Pennisetum-Elephant grass
Pennisetum purpureum or Elephant grass is the most common type of cultivated grass for cattle fodder and grows tall, robust bamboo-like clumps [40].

Psidium-Guava
Psidium guajava or guava is a small tree (8 m tall) that has escaped cultivation and become common in mesic forests (>150 masl) of several of the large islands [50,51].

Rubus-Blackberry
Rubus niveus or blackberry is a widespread thorny shrub that grows in gaps of native vegetation and forms dense thickets up to 4 m high over grasslands, bracken, bush, and forest areas alike [52].

Silvopasture
Silvopasture describes an intentional combination of pasture for cattle grazing and trees, which provide additional fodder for cattle, fruit crops, wood, or shade [53].

Syzygium-Pomarrosa
Syzygium jambos or pomarrosa is a small tree (3-15 m) that forms dense thickets and threatens several highland habitats, especially in San Cristobal and Isabela [2].

Evergreen Forest and Shrubland
The Evergreen forest and shrubland is sometimes referred to as the 'Scalesia', 'brown', and 'Miconia' zones. This region contains well-developed soil with vegetation characterized by an abundance of shrubs, herbs, ferns, and trees like Darwiniothamnus tenuifolius, Lycopodium spp., Scalesia pedunculata, Miconia robinsoniana, and Zantoxylum fagara. Epiphytes like mosses and liverworts commonly cover the trunks and branches of trees in this zone [42,46,51].

Evergreen Seasonal Forest and Shrubland
Evergreen seasonal forest and shrubland is an ecosystem previously referred to as 'transition zone', where soil and understory are more developed than the deciduous forest and species like Clerodendrum molle, Cordia lutea, Chiococca alba, Psidium galapageium, Tournefortia spp. can be found [42,46,51].

Humid Tallgrass
Humid tallgrass is a region previously known as 'pampa zone', which contains virtually no trees or shrubs, and its vegetation consists mainly of ferns, grasses, and sedges like Cyathea weatherbyana, Habenaria monorrhiza, or Pteridium aquilinum [42,43].

Pioneer
The pioneer category describes herbaceous, quick-growing native vegetation that thrives in recently disturbed areas, such as Plumbago scandens, Senna occidentalis, Mentzelia aspera, Sida rhombifolia, or Heliotropium spp., but likely also includes introduced herbaceous species like Amaranthus spp., or Pothomorphe peltata [46].

Land Cover Classification
A detailed description of the mapping process is found in Figure 3 and the sections below.

Multispectral Data
New freely-available sensors, such as the 10-meter spatial resolution Sentinel-2 images from the UE's Copernicus Programme [54], and 3-meter spatial resolution 4-Band PlanetScope images from Planet Labs [55], can help overcome the challenges that have limited mapping of these areas. Planet Labs is a commercial venture, but its image collections are accessible to university students and faculty for free through its education and research program. New freely-available sensors, such as the 10-meter spatial resolution Sentinel-2 images from the UE's Copernicus Programme [54], and 3-meter spatial resolution 4-Band PlanetScope images from Planet Labs [55], can help overcome the challenges that have limited mapping of these areas. Planet Labs is a commercial venture, but its image collections are accessible to university students and faculty for free through its education and research program.
PlanetScope sensors have a high spatial resolution (3 m) and are sensitive to both visible and Near Infrared (NIR) wavelengths ( Figure 4). The high spatial resolution of PlanetScope images allows visualizing considerably more details than NASA's freely available Landsat collections or even commercially-available SPOT (Satellite Pour l'Observation de la Terre) sensors. Meanwhile, Sentinel-2 sensors have a high spectral resolution (Aerosol, Visible, Red Edge, NIR, Narrow NIR, Water Vapor, and Short Wave Infrared or SWIR) and have a 10, 20, and 60-meter spatial resolution ( Figure 4). New freely-available sensors, such as the 10-meter spatial resolution Sentinel-2 images from the UE's Copernicus Programme [54], and 3-meter spatial resolution 4-Band PlanetScope images from Planet Labs [55], can help overcome the challenges that have limited mapping of these areas. Planet Labs is a commercial venture, but its image collections are accessible to university students and faculty for free through its education and research program.  The multispectral images used to make the mosaic and land cover classification are listed in Table 4. These images were chosen by searching through Planet and Copernicus image collections for low (usually <10%) cloud cover images from the warm/humid season (January-May) that were recorded less than a month apart from each other for each island to avoid variability in the island mosaics. Even if cloud cover was greater than 10%, the images were considered useful if land areas were visible. Images from 2018 were used as a reference, and we included images a maximum of one year apart to complete areas that suffered from cloud coverage. We used a 10-meter spatial resolution Digital Topographic Model (DTM) generated from SIGTIERRAS orthophotos (2010). The DTMs were provided by authorities from the Galapagos Government Council (CGREG). Elevation and Slope are useful predictors of vegetation variability [56,57]. We calculated the slope from the DTM layer in ArcMap.

Unmanned Aerial Images and GPS Control Points
From 2017-2019, we surveyed about 83 agricultural production units (UPA) on all four islands. Most surveyed UPAs were recommended for mapping by MAG because they are considered to be representative of agrarian production on each island. Additionally, some UPAs were chosen because they are located in known tortoise habitat. During the surveys, we collected high-resolution (<2 cm per pixel) visible wavelength images of the UPAs with an unmanned aerial vehicle (UAV-DJI Mavic Pro Platinum, 1/2.3 CMOS Sensor), as well as 826 control points of the most salient landscape features and vegetation types within the UPAs that were visible from above, wherever terrain accessibility allowed, using a handheld GPS (Garmin Map64, accuracy: +/− 3 m). We covered an area of~1252 ha with our drone flights, corresponding to about 2% of the study area (Table 5, Figure 5). We aligned the UAV images and generated a three-dimensional mesh that we used to stitch together a high-resolution (~0.04 m) orthomosaic using AgiSoft Photoscan (version 1.4.5) for each UPA.

Image Pre-Processing
We used PlanetScope surface reflectance products, which were already atmospherically corrected to bottom-of-atmosphere reflectance. We generated the mosaic of these scenes in ENVI 5.3 and applied a 20-pixel edge feathering between images. Sentinel-2 images were atmospherically corrected in SNAP 6.0 using the Sen2Cor plugin with a Maritime setting. Lastly, we enhanced Sentinel-2's 20 m and 60 m bands to 10 meters using Sen2Res [58]. The Sen2Res algorithm separates band-dependent information (reflectance) from information that is common to all bands (geometry of image elements), then unmixes lower resolution bands, maintaining their reflectance while

Image Pre-Processing
We used PlanetScope surface reflectance products, which were already atmospherically corrected to bottom-of-atmosphere reflectance. We generated the mosaic of these scenes in ENVI 5.3 and applied a 20-pixel edge feathering between images. Sentinel-2 images were atmospherically corrected in SNAP 6.0 using the Sen2Cor plugin with a Maritime setting. Lastly, we enhanced Sentinel-2's 20 m and 60 m bands to 10 meters using Sen2Res [58]. The Sen2Res algorithm separates band-dependent information (reflectance) from information that is common to all bands (geometry of image elements), then unmixes lower resolution bands, maintaining their reflectance while propagating the geometry of scene elements to keep sub-pixel details [58]. All multispectral image mosaics, the DTM, and slope layers were clipped using a mask of the study area. The Sentinel-2 images were used as a reference for orthorectification of the PlanetScope images.

Image Fusion
We used the variational fusion method proposed by Gasparovic et al. (2018) to fuse PlanetScope and Sentinel-2 images to improve classification accuracy. This method introduces the geometry information of the higher resolution image (PlanetScope) by aligning all edges of the higher resolution bands with each lower resolution (Sentinel-2) multispectral bands of similar spectral characteristics [59,60]. Figure 4 highlights the spectral overlap with Sentinel-2 and PlanetScope fused bands. High-resolution PlanetScope bands 1, 2, 3, and 4, were each fused with the 10 m Sentinel-2 bands 2, 3, 4, and 8, respectively. We also fused PlanetScope's Band 4 with Sentinel's Band 8A, 11, and 12. There is no direct equivalent to fuse with Sentinel-2's Red Edge bands, so following Gasparovic et al. (2018), we generated a synthesized band (S) using Equation (1), where B3 is PlanetScope's Band 3 (Red), and B4 is its Band 4 (NIR). Table 6 summarizes the band combinations that were used to obtain the fused images. Image Fusion was conducted with the open-source Orfeo ToolBox V.6.6.1.

. Vegetation Indices
The fused multispectral images were used to generate vegetation indices, namely the Normalized Difference Vegetation Index (NDVI), Structural Index (SI), and Normalized Difference Vegetation Red Edge (NDRE). NIR wavelengths are used to calculate vegetation indices like the NDVI and SI, which are known to be useful for land cover classification [22,61,62]; studies also suggest that including Sentinel-2's Red Edge band in a vegetation index, such as the NDRE, can yield more accurate results for the classification of agricultural areas [16,63].

Object-Oriented Classification with RandomForest
Random Forest (RF) is a widely used algorithm for remote sensing, being a powerful option for integrating different imagery sources and ancillary data sources into image classification workflows [64,65]. RF classification has been demonstrated to work well in land cover classification tasks like this one because it is effective with relatively small samples of training data and not particularly sensitive to noise [27]. The method is also ideal because it can achieve good accuracy without detailed hyperparameter optimization, and provides a robust estimate of the trained model's uncertainty [23,24,27].

Segmentation
We created the objects with a multiresolution algorithm where PlanetScope bands and fused Sentinel-2 bands (Red, NIR) were given equal weight. The images were divided into segments of relatively homogenous pixel composition, where the mean of band values of pixels within each segment is used to represent "objects." We set the segmentation scale parameter at 40 pixels, where each pixel is about 9 m 2 . During the segmentation process, the similarity of pixels was determined by two values ranging from 0 to 1. The first value is a shape criterion, where higher values mean that the color of pixels is given less influence over what is included or excluded from each segment. The second value is a compactness criterion, where higher values emphasize spatial proximity, causing resulting objects to be more compact. We used a trial-and-error approach to determine the values (shape: 0.7; compactness: 0.3) where objects such as built environment, living fences, and pastures were selected most completely while excluding adjacent objects of different categories.

Training Data Sample
On each island, at least 20 object samples were selected for each category using GPS control points and drone images acquired during fieldwork as reference. These object samples represent areas where the cover category covers the total extension of the sample. From this data sample, we selected 50 stratified random pixel sample subsets for training the classification model.

Random Forest Classification
RF classification was performed in R Studio open-source statistical software, where we used the randomForest, Caret, and raster packages to produce all classifications. For each island classification, randomForest sampled 1000 classification trees and tried four variables at each split to calculate the out-of-bag samples (rfOOB). The rfOOB are used to create a cross-validated prediction error for the model and to formulate a measure of feature importance [66].
We used two sets of metrics to gauge the importance of 20 different variables from three different sources (orthophoto, PlaneScope, Fused S2 + PS;) in the randomForest model. The first metric we used is the Variable Importance (Caret Package), which is the mean of the scaled (from 0-100) class-specific decreases in accuracy. The second set of metrics we used included the Mean Decrease Impurity Importance (MDI) or Mean Decrease in Gini, as well as the Mean Decrease Accuracy (MDA, randomForest package). MDI measures how important a variable is for estimating the value of a target class across all of the decision trees that make up the forest. MDA evaluates the importance of each variable by looking at how much prediction error increases when removing that variable while all others are left unchanged [67,68].

Verification
Throughout the image classification process, we used Google Earth, high-resolution UAV images, GPS control points, and expert knowledge to inform, correct, and fine-tune the training samples for the image classification. We ran the classification algorithm multiple times, and after each iteration, we identified regions where we suspected the classification had issues, and then selected additional samples or removed existing samples to correct our results. Categories with insufficient data (Lantana, Erythrina) to select training samples that would produce consistent results were dropped from the classification.

Validation
To validate the map, we created a confusion matrix and calculated Cohen's Kappa [69][70][71]. We also calculated the receiver operating characteristic (ROC) area under the curve (AUC) for our land-cover methods, which is sometimes used to demonstrate the accuracy of machine learning methods [72]. However, this metric assumes that grid contents are homogenous. Given that we are dealing with high-resolution data and land cover classes that are often not discrete, we also assessed the accuracy of our maps by calculating the root mean square error (RMSE) and a correlation coefficient (R 2 ) like land cover classification studies that use high-resolution images [73][74][75][76].
First, we generated a grid of 60 × 60 m squares. We randomly selected 15 squares on each island that overlapped areas where we had high-resolution images and control points but that did not overlap with the areas chosen for training the algorithm. This way, we maintained independent data sources for the geometry of the classified images and our reference polygons [69]. We created two versions of these areas, one to serve as the reference polygons and the other to serve as the prediction. We intersected the prediction polygons with the classified maps to extract the information corresponding to those areas. For the reference polygons, we cut them into sections that visually matched the high-resolution UAV images and the GPS control points, and we manually classified them.
To characterize the effectiveness of our classification for estimating land cover fractions, we calculated the percentages of the polygons that were occupied by the different classes, and obtained a root mean square error (RMSE) and a correlation coefficient (R 2 ) for each island and for each class c using Equations (2) and (3) [73][74][75][76], respectively: wheref i,c is the predicted land cover fraction of class c for sample i, f i,c is the true land cover fraction of class c for sample i, f c is the mean class c land cover fraction of the samples, and N is the total number of samples. If the predicted cover perfectly matched the reference cover, we would expect an R 2 value of 1 and an RMSE of 0 for that class. The island-wide results were obtained by summing simultaneously over samples and classes in Equations (2) and (3). The AUC is a performance metric that estimates how well a model is able to distinguish between classes, typically ranging between 1 (perfect classification) and 0.5 (no predictive capability); values less than 0.5 are characteristic of a classifier that is anticorrelated with the true categories [77,78]. The AUC only characterizes binary predictors, so we calculated the AUC values for each land cover category versus all other categories, using the same 60 reference polygons to evaluate our classification. An AUC value of 0.7 is usually determined as the threshold for usefulness in an application, while scores >0.9 denote high performance and are the most useful for classification [79].

Object Oriented Classification with Random Forest
Previous research has shown that although RF can handle high dimensional data, classification accuracy remains relatively unchanged when only the most important predictor variables are used [80,81]. Therefore, we ran the RF classification several times and recorded the importance ranking of the less important variables of each iteration. In addition, a correlation and collinearity analysis was performed, leaving a set of only the most important and uncorrelated variables. Four data layers were removed from our data layer stack used in the final classification because they were mostly redundant and did not offer new information for classification: PlanetScopes's Band 2 and its NDVI, as well as the fused Band 6 and 7 (Red Edge). Table 7 summarizes the final layer stack used, as well as the MDA and MDI scores for each layer. Table 8 details the variable importance for sorting each of the land cover categories. These metrics suggest that for most land cover categories, elevation was the most important variable for their classification. All used PlanetScope layers were also consistently useful for classification, as were the fused NIR and Narrow NIR bands. The fused NDVI and NDRE were particularly helpful for differentiating the Cedrela-cedar cover. The classification method yielded a cumulative classification error ranging from 2.76% for Floreana to 6.2% for Santa Cruz (Table 9). Permanent crops were the most commonly misclassified cover, as it was sometimes misclassified as a transitory crop or as mixed forest. Table 9 summarizes the error rates, as well as the most common classification errors (>0.1 error rate) revealed in the confusion matrix.  Table 10 lists the spatial coverage of each land cover type on all islands and details the percentage of terrestrial surface that each category covers within the agricultural area as well as within the 1 km buffer zone from our study. A shapefile ( Figure S1) of the classified images can be found online in the supplementary materials section. Appendix A contains a breakdown for individual islands: Santa Cruz (Table A1), San Cristobal (Table A2), Isabela (Table A3), and Floreana (Table A4). Figure 6 depicts the spatial distribution of each land cover type on the four islands. Figure 6 and Table 10 show that there is a dramatic change in vegetation inside the agricultural area relative to the surrounding protected areas. Inside the agricultural area, the most common cover is Psidium-Guava, covering about 4958 ha, or nearly 20% of the agricultural area (Table 10). Meanwhile, the dominant land cover in the surrounding areas is Evergreen Seasonal Forest and Shrublands, covering 13,379 ha or 38% of the surrounding protected areas. The land cover tables and maps reveal other intriguing patterns. Most Cedrela is located outside of the agricultural area, like in Santa Cruz, where 76% of the 3098 ha Cedrela stands are located in GNP land directly adjacent to the southwest boundary of the agricultural area ( Figure 6, Table A1). Similarly, most Pennisetum stands are located outside of the agricultural areas, like in Isabela, where 93% of the 432 ha of Pennisetum is situated to the southwest of the agricultural area ( Figure 6, Table A3). Similarly, most mixed forest is found outside the agricultural area, like in Santa Cruz where 77% of 1166 ha of this land cover type surrounds Cedrela stands and straddles the altitudinal gradient almost as high as 700 masl to the north and as low as 120 masl to the southwest of the agricultural area ( Figure 6, Table A1). There are also native ecosystems that thrive within agricultural areas. For example, most evergreen forest and shrubland is found within the agricultural areas, like in San Cristobal, where 74% of its 2200 ha of this ecosystem is found around and to the East of El Junco (the island's largest freshwater body), but well within the agricultural area ( Figure 6, Table A2). San Cristobal's agricultural area is also covered to a comparable extent (2766 ha) by invasive species (Table 11, Figure A2). Identifiable patches of invasive species (Cedrela, Cinchona, Pennisetum, Psidium, Rubus, and Syzygium) constitute about 21% of the study area's surface area, most of it (7189 ha) distributed within the agricultural areas of all islands, compared to 5685 ha identified in the surrounding areas. Isabela is the island with the highest percentage of its agricultural area covered by invasive plants (36%, 1872 ha) and is also the island with the highest percentage of its surrounding areas (29%, 2149 ha) covered by invasive plants.

Spatial Coverage and Distribution of Land Cover Categories
Pastures (Cultivated grass, Silvopastures, and Pennisetum patches) constitute about 10.5% of the study area (Table 11, Figure A3). The vast majority (5620 ha) of pastures are located within the agricultural regions of all islands, compared to 733 ha found in the surrounding park areas. The agricultural area of Santa Cruz has the most extensive pastures (3726 ha), representing 32% of the agricultural zone's surface area.
Food crops (permanent crops, transitory crops, and Coffea patches) constitute about 8% of the study area across all islands (Table 11, Figure A4). Most food crops (4612 ha) are located within the agricultural areas, compared to 302 ha of GNP areas classified as food crops. Furthermore, most food crops (3914 ha) are categorized as permanent rather than transitory crops. Floreana's agricultural area has the highest percentage of its agricultural area covered by permanent and transitory crops (27%, 74 ha), but Santa Cruz has the largest extension (2549 ha) covered by food crops of all islands.
The estimates from Table 11 are conservative because the mixed forest and pioneer vegetation categories cover about 9.6% of the surface area, and these categories are, by definition, a mixture of other categories (Table 11, Figure A5). These vegetation types are about the same surface area (2926 ha) within the agricultural zones than in the surrounding park areas (2900 ha). San Cristobal is the We have also grouped categories corresponding to broader types, such as invasive vegetation, native vegetation, pastures, and food crops ( Table 11). Each of the more general categories from Table 11 has been represented as thematic maps (Appendix B). It should be noted that the estimate for invasive species is very conservative because, unlike previous studies [42], it excludes the mixed forest category from its calculation. Cumulatively, native vegetation categories (deciduous forest, evergreen forest and shrubland, evergreen seasonal forest and shrubland, and humid tallgrass, cover 50% of the study area, mostly distributed outside of the agricultural zones ( Figure 6, Figure A1). Around 4709 ha of native vegetation are found within the agricultural zone, compared to 25,647 ha covering the surrounding areas. The exception to this pattern is San Cristobal, where 30% (2535 ha) of its agricultural area was categorized as native vegetation (Table 11, Figure A1). San Cristobal's agricultural area is also covered to a comparable extent (2766 ha) by invasive species (Table 11, Figure A2). Identifiable patches of invasive species (Cedrela, Cinchona, Pennisetum, Psidium, Rubus, and Syzygium) constitute about 21% of the study area's surface area, most of it (7189 ha) distributed within the agricultural areas of all islands, compared to 5685 ha identified in the surrounding areas. Isabela is the island with the highest percentage of its agricultural area covered by invasive plants (36%, 1872 ha) and is also the island with the highest percentage of its surrounding areas (29%, 2149 ha) covered by invasive plants.
Pastures (Cultivated grass, Silvopastures, and Pennisetum patches) constitute about 10.5% of the study area (Table 11, Figure A3). The vast majority (5620 ha) of pastures are located within the agricultural regions of all islands, compared to 733 ha found in the surrounding park areas. The agricultural area of Santa Cruz has the most extensive pastures (3726 ha), representing 32% of the agricultural zone's surface area.
Food crops (permanent crops, transitory crops, and Coffea patches) constitute about 8% of the study area across all islands (Table 11, Figure A4). Most food crops (4612 ha) are located within the agricultural areas, compared to 302 ha of GNP areas classified as food crops. Furthermore, most food crops (3914 ha) are categorized as permanent rather than transitory crops. Floreana's agricultural area has the highest percentage of its agricultural area covered by permanent and transitory crops (27%, 74 ha), but Santa Cruz has the largest extension (2549 ha) covered by food crops of all islands.
The estimates from Table 11 are conservative because the mixed forest and pioneer vegetation categories cover about 9.6% of the surface area, and these categories are, by definition, a mixture of other categories (Table 11, Figure A5). These vegetation types are about the same surface area (2926 ha) within the agricultural zones than in the surrounding park areas (2900 ha). San Cristobal is the island with the most amount of vegetation within its agricultural area categorized as mixed forest or pioneer (1175 ha).

Validation
Our cross-validation yielded an overall kappa index of agreement (KIA) of 0.70. Eight of our categories had a KIA over 0.7, six categories had a KIA between 0.4 and 0.7, and three categories had a KIA below 0.4 (Table A5). The highest KIA corresponded to Pennisetum, and the lowest KIA corresponded to Coffea. Native land cover categories all presented high (>0.9) KIA values. Meanwhile, our ROC-AUC results are generally consistent, yielding 11 categories with an AUC above 0.7, four categories with an AUC between 0.5 and 0.7, and three categories with an AUC below 0.5 (Table A5). Cedrela was the only category with an AUC value above 0.9, while Pioneer, Bare Ground, and Freshwater had AUC values below 0.5. Humid tallgrass and Cinchona were not represented in our reference polygons because these land covers are usually found outside of the agricultural areas, and the most significant extensions of these vegetation covers are located in areas that are hard to reach. Table 12 lists validation results for each island and Table 13 details the results for each category. Only the most common land covers for the agricultural region are represented in these tables. We had an average R 2 score of 69 for all islands, which ranged from 58 (Floreana) to 83% (Santa Cruz). Our average RMSE was 21%. We had the most problems classifying pioneer (R 2 = 0.321, RMSE = 32.81%), Rubus (R 2 = 0.139, RMSE = 13.16), Coffea (R 2 = 0.191, RMSE = 8.84%), transitory crops (R 2 = 0.26%, RMSE = 17.8%), and cultivated grass (R 2 = 0.412%, RMSE = 32.81%). The classification that explained the largest amount of variation in the ground truth data were Pennisetum (R 2 = 1), Syzygium (R 2 = 1%, RMSE = 7.47%), Evergreen forest and Shubland (R 2 = 0.938, RMSE = 15.01%), and Cedrela (R 2 = 0.907, RMSE = 14.53%).

Discussion
Our results suggest that the agricultural zones of the Galapagos and their surrounding areas are incredibly diverse, and different vegetation covers thrive within very close proximity to each other. The complexity of Galapagos agroecosystems presented a challenge to mapping accurately and precisely the main vegetation covers occurring in the highlands of this UNESCO Natural World Heritage Site. These maps are the first attempt, using the latest satellite images available, to integrate vastly different land cover types (e.g., native ecosystems, staple food crops, invasive plant species) within the same decision-making tool, so that concerns of the agricultural and conservation sectors are both represented.

Methodology and Sources of Error
In this paper, we presented a replicable methodology that uses free and open-source data to monitor land use and land cover change in complex agricultural systems. Agricultural landscapes are a complex patchwork of food crops and other vegetation types, where even small-scale patches of vegetation can be a valuable habitat for animals of different kinds [82][83][84][85][86]. Having images with high spatial resolution is crucial for capturing small habitat fragments in the analysis, so methods that use freely-available high-resolution image collections provide a much-needed and inexpensive alternative to purchasing commercial high-resolution data for fine-scale analysis. Mapping an agricultural matrix in high resolution is a challenge because categorically different vegetation types can have similar spectral signatures and be cultivated in proximity with other vegetation types; our object-oriented random forest classification of fused images tackles this challenge by combining data sources with complementing resolutions, taking into account the geometrical properties of landscape features in addition to their reflectance, and by classifying elements with an algorithm that provides reliable results with limited training datasets [24,27].
Unfortunately, two land cover types, Lantana and Erythrina, had to be dropped from the final mapping outcomes because we did not have enough reference points to have consistent results during classification. In the case of Erythrina, their proximity to other dense vegetation types, such as permanent crops, Cedrela, and Psidium, was an issue during segmentation. Erythrina trees are usually planted in long stands as living fences are often pruned, so their crowns tend to be relatively small. Therefore, the size of Erythrina stands was insufficient to be adequately captured by the sensors, even at 3 m resolution. The small size and proximity to other vegetation types caused segments to include neighboring pixels of different land cover types and confuse their spectral signatures. Similarly, in the case of Lantana, we did not have enough examples of patches that were large enough to produce a clear spectral response. Additionally, we were limited in our reference data available because some areas lacking information were too remote or of difficult access to reach them by foot to collect GPS control points. On the other hand, weather conditions in the highlands also made image collection with a UAV difficult and sometimes impossible within the timeframe available for fieldwork, as was the case with Isabela's highest regions within our study area. Therefore, areas that were easier to access or had favorable weather conditions were more represented in our samples.
Additionally, land cover units grouping several species, instead of mapping individual plants, were adopted because these types of general categories are used in policy documents or previous studies. While this makes our results relevant to the intended users of the map (i.e., MAG) and compatible with existing data sources, these categories forced us to group together vegetation types into land cover units that likely had a wide range of spectral signatures. For example, permanent crops, transitory crops, silvopastures, and mixed forest are land cover units that are heterogeneous by definition. For instance, permanent crops are often tree crops, but MAG also includes pineapples in this category since they are technically perennial crops [40]. Pineapple plantations have a very different spectral signature than tree stands. This inclusion might explain why permanent crops were the most frequently misclassified category (error rate = 28% in Santa Cruz, Table 9). Similarly, silvopastures are, by definition, a mixture of grass and tree species, both of which could be of different types and have vastly different reflectance, and this probably contributed to their misclassification and high error rate (error rate = 38% in Santa Cruz, Table 9). Likewise, the mixed forest is defined as an area where invasive species co-dominate with native species [42]. Given there is a wide range of species, both native and introduced, that could fit that description, it is understandable that humans and computer algorithms could easily misclassify these areas (error rate = 12% in Cristobal and Floreana, Table 9).
Another challenge to mapping the agricultural area is that this is a very dynamic landscape, and its land cover is always changing. For example, both transitory crops and pioneer vegetation are, by definition, short-lived; farmers harvest transitory crops in less than one year, and succession quickly leads other plants to take over disturbed areas where pioneer plants once thrived. This short life probably confused their classification because the spectral life of specific land cover types is shorter than the two-year timeframe when the different satellite images were collected. Short spectral life might be one of the main reasons why the 'pioneer' land cover unit received the lowest R 2 score (0.032) during map validation. Despite their ephemeral nature, it was important to include these categories to remain consistent with other data sources and to try to capture a snapshot of current vegetation cover. Furthermore, temporary vegetation covers like 'pioneer' or heterogeneous categories like 'mixed forest' might signal that these areas are currently in flux, so their presence helps represent the dynamic nature of the agricultural areas.
The landscape and the classification units themselves are not homogenous, so traditional methods for assessing accuracy are limited by the assumption that each area in the map can be unambiguously assigned to a single category, and by expressing classification errors as either "right" or "wrong" without reference to the magnitude of the error [87]. We initially attempted a point-based accuracy assessment, but this yielded misleadingly low results because GPS control points of individual plant species could not be unequivocally assigned to any one category. Therefore, we opted for a validation method that could accommodate for the landscape's and our classification scheme's ambiguity. Expressing our accuracy in terms of percentages and correlations RMSE and R 2 was more relevant to the users of our classification because we are comparing manually-classified reference polygons with our classification results. However, we included other metrics common in land cover classification and machine learning (KIA, AUC) to reach readers that are more accustomed to them.
Our accuracy assessment results are in a similar range as those from similar object-based land cover classifications [24,69]. Our overall KIA value of 0.7 suggests that our classification results are significant and are useful overall. Categories with KIA or AUC values >0.7 correspond to land cover types with a forest or pasture structure (Table A5). The land covers that were confused most often were bare ground, freshwater, and pioneer cover types. (Table A5). All three land cover types covered relatively small surface area extensions, so their representation in the validation reference polygons was minimal. This error was especially strong for freshwater (KIA: 0.34, AUC: 0.314) because most visible freshwater occurs in microreservoirs, for which reflectance can change drastically depending on whether or not they are filled. Only one microreservoir fell within the reference polygons for classification. Bare ground (KIA: 0.271, AUC: 0.386) was understandably confused with transitory crops (KIA: 0.446, AUC 0.628) because these areas are usually denuded from other vegetation, and so the spectral response of the bare soil gets confused with that of the crops themselves. Bare ground is also hard to distinguish from built environment (KIA: 0.516, AUC: 0.685) because most roads of the humid highlands remain dirt roads and are not paved yet. Pioneer land cover (KIA: 0.22, AUC: 0.31) was probably confused with several other categories due to their ephemeral nature, as described above. Of the four categories with an AUC value between 0.6 and 0.7, two correspond to land cover types that grow below the forest canopy, such as Coffea (KIA: 1.927, AUC: 0.676) and Rubus (KIA:0.417, AUC: 0.603), so it makes sense that their AUC values are not as high, as their spectral signatures can become confused with other vegetation types. The other two categories with an AUC between 0.6 and 0.7 are built environments and transitory crops, which, as mentioned earlier, both become confused with bare ground.
Our results suggest that our classification map likely represents general patterns (large patches) relatively well, especially for categories that had a clear spectral signature and, thus, had a high correlation coefficient as well, such as Cedrela, Pennisetum, and Syzygium. However, the landscape's heterogeneity means that the map likely becomes more inaccurate as one approaches smaller scales. The fact that the two largest islands, Santa Cruz and San Cristobal, had the highest R 2 values (0.83 and 0.7, respectively) might be due to these islands having larger farms with a more regular and well-delimited configuration of their crops than smaller farms from Isabela or Floreana.
While human digitation using high-resolution UAV images would usually have more accurate results than using segments mapped by a computer, the fact that we are using satellite images with a 3 m spatial resolution means that we are within the margin of error for most GPS devices, including our drone. This might cause misalignment with the high-resolution images that we used to obtain the geometry for the reference polygons, thus lowering our accuracy results. Furthermore, given that we used the most homogenous land cover patches visible within our UAV images to help us pick areas to train the algorithm, remaining unused areas of our images were also the most mixed and harder to categorize, which might have affected our accuracy scores as well.

Vegetation Cover Abundance and Distribution
The fact that the areas surrounding the agricultural areas are GNP lands under conservation means that we expected to find a difference in the vegetation covers within and outside the agricultural areas. As expected, we found that most native vegetation is distributed outside the agricultural zone while invasive species dominate the area within the agricultural areas ( Figures A1 and A2). However, we also expected to see this pattern due to the long history of agricultural exploitation within the Galapagos. The most aggressive agricultural expansion occurred in San Cristobal from 1879 to 1904 under the rule of Manuel J. Cobos, and this set the tone for commercial development of agriculture in other islands as well. During this time the highlands of San Cristobal were denuded of native vegetation, P. guajava and avocado trees were brought for pig food, Sygzygium trees were imported to provide shade for wool sheep, pastures were introduced for extensive cattle grazing to sell cowhides, up to 3000 ha of sugar cane were planted for sugar and liqueur production and export, and over 100,000 coffee plants covered 110 ha of plantation [88]. In comparison, today, all food crops found in our study area combined barely cover a third of the area that was once covered by sugar cane alone. San Cristobal has the most substantial extensions of Sygzygium trees in the entire archipelago (374 ha), and P. guajava covers 23% of the agricultural area. While agricultural activity as a whole has become a fraction of what the island once supported, the area covered by coffee plantations has increased to 191 ha. Given the level of agricultural exploitation, perhaps what is surprising is that San Cristobal also shows the most amount of native vegetation within its agricultural area, which is likely due in part to the work of the GNP reforesting the areas around El Junco with Miconia robinsoniana.
Other islands followed similar patterns but at different times. Floreana's first successful inhabitants took residence in the 1930s, and they subsisted by modifying the highlands by knocking down trees, clearing the land, hunting wild cattle, and introducing seeds for crops. For example, the Cruz family, who arrived to Floreana in 1937, dedicated their time to agricultural production and cattle ranching. As their family grew, so did their land and the pastures that covered them; because the population was so small (55 people by 1975), life in Floreana was only assured through agricultural production. By 1965, farmers planted entire hectares of individual transitory crops year-round, and they harvested the fruit from citrus trees that grew wild inside and outside of the agricultural areas [88]. These patterns are visible in our results, as there are large areas of land dedicated to cattle production and mixed forest surrounds the agricultural area. We also see evidence of more recent events, such as the spread of R. niveus and P. guajava across the landscape [88].
In Isabela, there was a 900 ha farm called "La Hacienda" that was established in 1897 by Carlos Gil Quezada. Quezada's main economic activity was cattle ranching, and through the 1950s, he captured wild cattle that roamed the pampas of Sierra Negra volcano and exported them to Guayaquil [88,89]. Given that La Hacienda is reported to be relatively near the southeastern border of the present-day agricultural zone, it is entirely possible that the pastures visible to the southwest of the agricultural area are the same areas that were used as cattle hunting grounds by Gil Quezada. It would seem unusual that pastures are found outside of the agricultural zone, and the species found in those grasslands remain unconfirmed. We do not have direct verification of the vegetation of this site, but the spectral signature of this region is distinct, and during the development of the land cover map of the GNP [42], park rangers indicated to the authors that these are Pennisetum grasslands, so we categorized this region in the same manner. The observed large expansions of invasive species in Isabela are consistent with the recorded process of extensive cattle grazing and subsequent abandoning of lands since the 1980s as meat production became less profitable, leaving Isabela's last remaining stronghold of agricultural production concentrated in the lowest (easternmost) section of the agricultural zone [88], which is the same region where transitory and permanent crops can be seen today.
The prevalence of pastures in the agricultural area of Santa Cruz is well reported [40,90,91] and is consistent with our results. The presence of Cedrela odorata stands in GNP lands (but barely in the agricultural area) makes sense because, since its intentional introduction in the 1940s, the Ministry of the Environment prohibited the exploitation of Cedrela in 2007 due to its threatened status in the continental area. However, by 2009, it allowed the extraction of C. odorata once again, but only within the agricultural zone [92]. The distribution of P. guajava in Santa Cruz is also consistent with our current understanding of how cattle and tortoises are dispersing this invasive plant [51], and the presence of other invasive plants like R. niveus was also expected in all four inhabited islands [93]. It should be noted that plant species like R. niveus and Coffea commonly grow in the understory of a vegetated area. When trying to capture vegetation that lies in the understory, the spectral signal becomes mixed with that of the tree canopy, so ours is an underestimate of the true extent of these vegetation types.

Comparing and Combining Results with Existing Datasets
Given that this is the first study to map the agricultural areas in high-resolution, we can compare our results with self-reported surface areas of the agricultural zone from the 2014 agricultural census [40]. Table 14 lists the main categories used by the agricultural census on the left and the corresponding categories of this study to the right. We would expect to see discrepancies between the results for several reasons: the agricultural census does not encompass all producers, but rather a sample of 755 of the most productive and representative ones, whereas our study encompasses the entirety of the agricultural area. Therefore, our estimated land covers might include areas that are not tended by anyone. This might account for dramatic increases in reported permanent crops, for example, as many fruit trees within the agricultural area might simply be growing wild, and respondents might not have taken them into account for their responses. Furthermore, both sources have their own inaccuracies: while our study is susceptible to misclassifications, self-reported values may be susceptible to under or over-estimating surface areas. This might be the case for Transitory crops, where our classification might easily categorize 'tilled land' and possibly even fallow land as transitory crops. Perhaps not coincidentally, the two values become very similar if tilled and fallow lands are included in the estimate. Furthermore, respondents to the census may have different definitions of what falls within each category. For example, some farmers might not consider P. guajava an invasive plant, as they might see it as an important element of their silvopastures. This is the reason why, for Table 14, we categorized Pennisetum as a pasture and excluded it from the invasive category because farmers use this grass species for their cattle and would likely not consider it 'invasive'. Furthermore, the category "Pioneer and forest species" is a category that encompasses any vegetation that has no agricultural value for farmers, so it is very general. Despite the many ways in which these data might not match, the dramatic decrease in pastures and the astounding increase in invasive plants across all four islands are probably real trends. This would be consistent with the observed trend of decreasing pastures between the 2000 census (14,555 ha) and the 2014 census (11,126 ha) [41]. This would also be consistent with the fact that the region's worst-recorded drought occurred in 2016, which MAG officials reported drove many cattle ranchers to abandon this activity as many cattle died due to the dry conditions [personal communication]. A drop in pastures, combined with an increase in invasives, is exactly what we would expect from people abandoning their lands. Pastures have a spectral signature that is very different from other land cover types, so it is unlikely that they were so heavily misclassified. However, it is possible that even though we grouped silvopastures as part of our estimate for pastures, farmers might still identify more heavily wooded areas as "pastures" while our algorithm might have identified the same region as a forest of some type. Lastly, although it is hard to compare specific numbers due to differences in methodologies and missing information, authors did find a general decrease of pastures in the agricultural areas and an increase in invasive plants in Santa Cruz and San Cristobal from 1987 to 2006 [90]. The present study serves as much needed baseline data with a replicable methodology that can complement existing studies [42] to give us, for the first time, a truly complete view of land cover for the entire archipelago (Appendix C, Table A6).

Broader Significance
Our methods and our findings are relevant to complex agricultural landscapes in general and in ongoing conversations about conservation and food security, particularly in the context of other oceanic islands, as well as 'islands' of protected forests within an agricultural matrix. Our replicable methods are useful to map other islands, other agricultural landscapes, and other time periods beyond the ones presented here.
The fact that the agricultural sector has been neglected, especially in comparison to more robust economic sectors like tourism, development, or conservation, is not unique to the Galapagos. There is a global trend of abandonment of the agricultural sector and a migration of rural population into urban centers [94,95]. Furthermore, oceanic islands worldwide are typical tourism hotspots and suffer from similar pressures as does the Galapagos: reduced space for food production and an overdependence on food imports to sustain their human populations [96][97][98]. Our results are relevant in the context of global food security and particularly in food security on islands because of familiar socioeconomic and environmental drivers of pattern in island ecosystems [98,99]. Baseline data such as that provided in this study is essential to track changes in land use and land cover in our food systems and to anticipate the ways in which climate change can make them more vulnerable to collapse, as well as to help us identify the land uses and land covers that are most resilient to environmental change [99,100].
The vast extent of pastures found in our study should prompt decision-makers to reconsider whether this land use is the most sound in a place where the space to grow food is so limited. Our study confirms that soil depth and precipitation patterns in this region can support an astounding diversity of permanent and transitory crops, and pasture monocultures to feed cattle might not efficiently support local food security. It is relevant that decisionmakers of the Galapagos have promoted silvopastoral systems as a more sustainable alternative to pasture monocultures. Given that most pastures and silvopastures of the Galapagos are grain-fed, having an estimate of the distribution of silvopastoral systems allows us to compare their performance in the context of more variable and extreme seasonality.
Our results are also relevant in the conversations about novel ecosystems and island biogeography. Our focus on native ecosystems and invasive species make our results applicable to other island ecosystems because the biogeographical features of oceanic islands lead to high endemism rates and simultaneously make them more susceptible to invasion from introduced species [101]. In addition, there is a clear spatial correlation between human activities and invasive species, even long after these activities were abandoned [1,102]. For example, even though San Cristobal has not had sheep raised for wool in recent history, the fact that Manuel Cobos once used Syzygium on this island to give shade to sheep [88] is likely the reason why over a hundred years later Syzygium remains concentrated on San Cristobal rather than in other islands. Human agency profoundly modifies the agricultural landscape and its surrounding areas, and our results support previous observations that introduced species can become integrated into native landscapes, perhaps irreversibly [38,103]. For example, the extent of colonization by Psidium guajava or Rubus niveus pose a particular challenge for conservation projects that would like to see protected areas returned to a pre-human state. These plants now serve as an essential food source for endemic fauna like finches and tortoises and, thus, have become profoundly integrated into the life cycles of the emblematic species who disperse the seeds across their habitat [51,[104][105][106]. Our results show examples of what previous authors have called novel ecosystems in which humans have crossed some ecological thresholds that gave rise to a historically new and stable species assemblage [107]. However, our results also demonstrate the capacity of human agency to modify the environment in the face of highly aggressive invasive plants. For example, Cedrela odorata's distribution in Santa Cruz and the prevalence of native ecosystems within the agricultural area of San Cristobal both suggest that the work of private landowners and the GNP can be effective at controlling the spread of invasive plants and propagating the land cover of our choosing.

Next Steps
We envision that one of the potential next steps following this baseline map of the highlands of the Galapagos would be the use of additional vegetation indices to recategorize existing land covers with greater detail. New and more specific categories must be created to tease apart the differences in general categories, such as the current 'mixed forest,' 'permanent crops,' or transitory crops'. It is also advisable that future iterations of land cover maps be able to distinguish introduced plants like Lantana and Erythrina.

Conclusions
In this paper, we have presented a replicable and reliable methodology for land cover classification combining freely-available high-resolution data sources and taking advantage of their complementing spectral and spatial resolutions. The methods described are well suited to classify complex agricultural landscapes by taking into account the geometry of scene elements in addition to their reflectance.
We found that our spatial resolution was insufficient to reliably detect rows of living fences of Erythrina since their crown size was often less than the 3 m pixel size of our data sources. However, our methods were able to reliably identify stands of invasive plants like C. odorata, S. jambos, and P. guajava. Our results suggest that out of the 25,261 ha of the agricultural zone, invasive plants cover most of the surface area (28%), mostly dominated by P. guajava (4959 ha). Pastures for raising cattle cover 22% of the agricultural zones, the majority of which are located in Santa Cruz. Most native vegetation is distributed outside of the agricultural areas, but almost 19% of the agricultural zone's land cover was identified as native vegetation, mainly located in San Cristobal (2535 ha). Food crops of different kinds cumulatively cover a similar percentage (18%) of the surface area, and about 944 ha of these are dedicated to coffee cultivation, mostly located in Santa Cruz. About 12% of the agricultural sector is covered by vegetation that could not be clearly identified as either native or invasive vegetation.
Multiple institutions and researchers have expressed the need for a high-resolution and updated and land cover classification of Galapagos agroecosystems. Local agencies in the agricultural sector can now use these data for identifying current agricultural resources and planning land management projects. For example, the classification maps will inform the GNP and the MAG about priority areas for intervention to control invasive species; having clear estimates of surface area covered by different invasive plants allows institutions to make realistic estimates of approximate costs for their control as part of long-term plans for the region. Authorities can now ascertain the trends of the region's food basket with greater confidence, to ascertain whether food crops and pastures have increased or diminished in relation to previous years as well as in relation to future assessments. These data help local authorities plan strategic interventions, such as estimating how many people are necessary to make the most of land that is already being planted, as well as how many people would be necessary to reactivate areas currently dominated by invasive plants. Additionally, researchers can use these baseline data to anticipate changes in land cover as socioeconomic and environmental drivers continue to change observed land use and climate patterns. The land cover categorization and mapping strategy is designed to be reproducible so that it can be applied to future images to track changes in these agroecosystems.
The dataset presented here is designed to suit the needs of both agricultural and conservation practitioners. We aim to encourage collaboration between these two sectors to ensure the well-being of Galapagos inhabitants and the sustainable preservation of its ecosystems and to serve as a model for land use mapping in other environments globally where agroecosystems host essential species for conservation.              Figure A5. Distribution of mixed forest and pioneer vegetation cover in and around the agricultural zones of the inhabited islands of the Galapagos.