Who and Where Are the Observers behind Biodiversity Citizen Science Data? Effect of Landscape Naturalness on the Spatial Distribution of French Birdwatching Records

: The study of spatial bias in opportunistic data produced by citizen science programs is mainly approached either from a geographical angle (site proximity, accessibility, habitat quality) or from the angle of human behavior and volunteer engagement. In this study we linked both by analyzing the effect of observer proﬁle on spatial distribution of recordings. We hypothesized that observer proﬁle biases spatial distribution of records and that this bias can be explained by landscape naturalness. First, we established observer proﬁles from analysis of the temporal and spatial distributions of their records as well as record contents. Second, we mapped a naturalness gradient at regional and local scales. Using a dataset of more than 7 million bird records covering a time span of 15 years from the west of France, we deﬁned four types of observer: garden-watchers, beginners, naturalists, and experts. We found that recording intensity could be related to naturalness at regional level; most visited areas were those where naturalness was on average lower i.e., close to population basins and highly accessible due to well-developed road infrastructure. At local level (neighborhood of recording sites), we found that experts and naturalists recorded in areas of higher naturalness index than those of garden-watchers and beginners. These results highlight how records contributed by different types of observer may lead to complementary coverage of different areas of the landscape. Future studies should therefore fully consider observer heterogeneity and how different observer proﬁles are inﬂuenced by local landscape naturalness.


Introduction
Citizen science-the involvement of volunteers in data collection, analysis, and interpretation-holds great promise for nature conservation [1]. Species observation data collected through citizen science allows large-scale monitoring (spatial and temporal) that would be impossible or too expensive to carry out with only professional scientists [2][3][4]. Citizen science (CS) has already made enormous contributions to conservation science, and this approach of harnessing the power of data, information and voluntary skills has the potential to do much more [5]. For a review of studies which directly contribute to understanding of the role of CS in conservation sciences, see [6].
Citizen sciences cover a large range of programs with a great diversity of methodological approaches. This diversity ranges from CS programs with standardized protocols intended for volunteers with good naturalist skills, to CS programs open to any observer which promote the collection of opportunistic data [2,7,8].
Used the landscape naturalness gradient approach to explain the spatial distribution of observations at regional level; 3.
Explored the links between observer profiles and landscape naturalness in the neighborhood of their records.

Bird Dataset and Preprocessing
The CS bird dataset used in this study was collected by the French BirdLife International Partner (Ligue pour la Protection des Oiseaux, LPO). The data were considered as opportunistic because most came from participants who recorded whatever, wherever, and whenever they chose. Participants could contribute their records online after their outing or, since 2017, directly in the field using the smartphone application NaturaList to geolocate record. Observations may involve heard birdsongs or direct, visual observations. There is no minimum or maximum outing duration and observers are free to declare all or part of their observations. Moreover, observers do not need any prerequisites to register and they do not receive any training. The only obligation is to create an online account by specifying some mandatory personal information (e.g., identity, date of birth, address, contact number). After record declaration, a data validation procedure by a member of LPO is used to control and filter out unlikely records, which means that each record submitted must be validated to be added to the database. The dataset used in this study included more than 7 million records collected by nearly 10,000 participants. It covered a time span of 15 years (from 2005 to 2019) and a spatial extent of 32,000 km 2 , corresponding to the administrative region of Pays-de-la-Loire in western France.
The data were prepared as follows: first, data collected and managed by several local groups were aggregated so that each record was associated with a single observer identity. Second, the aggregated dataset was checked so as to include only those records for which the observed species, date and time, X, Y coordinates, as well as observer information were available. Finally, the unified database was cleaned of incomplete and erroneous records.

Observer Profiles
Our aim was to define observer profiles based on an original set of metrics integrating spatial, temporal, and content characteristics of recording. Since the dataset used covered a period of 15 years, during which the objectives, methods and data collection tools had greatly evolved, it was assumed that observer profiles had also evolved over time leading to a certain heterogeneity in recording behavior [35]. We used the analysis of bird records to deduce observer habits using metrics inspired by previous work on "engagement characteristics" of participants in CS [7,28,33,34]. Initially 22 metrics were developed, of which 11 were retained after exploratory analyses, principal components analysis (PCA), and correlation tests. The table of the 22 metrics, the correlation matrix, and the PCA plots on axes 1-2 and 3-4 of the 11 metrics are given for information in Appendix A. These metrics provided information on three aspects of observer behavior: temporal habits (T) (cumulative number of recording days, recording day frequency, recording effort, percentage of records made during the week (Monday to Friday)), spatial habits (S) (spatial amplitude, spatial density, minimum geometric extent), and record quality (C) (number of records, species richness, Species Generalization Index (SGI), Species Synanthropy Index (SSI)) ( Table 1). Table 1. The 11 metrics used to analyze observer profiles, in three categories (temporal habits (T), spatial habits (S), and record quality (C)), with their description and methods used for their calculation. Only observers with at least two recording days and five observations were retained for analyses, i.e., 5896 observers out of a total of 9878. The objective of these thresholds was to retain only observers who had shown a certain commitment to the program. Following this, we explored similarities and differences in observation habits using hierarchical cluster analysis (HCA) and Ward's minimum increase of sum-of-squares (of errors) method, without predefining a number of desired clusters. Observer profiles were identified from the dendrogram by analysis of inertia. We used the partitioning with the greatest relative inertia loss, identified with hierarchical clustering on principle components (HCPC).

Mapping the Naturalness Index (NI) of the Study Area
In order to interpret the spatial distribution of the recording, we chose to focus on the degree of naturalness of the landscape (naturalness index). This integrated method positions landscapes on a relative, quantitative scale ranging from "pristine" to "urban" [36] or more simply from "wild" to "not wild" [37]. This replicable method for assessing landscape quality has been used in numerous studies worldwide [37][38][39][40] and it includes most of the spatial variables already tested and cited in the introduction (proximity, accessibility/remoteness, habitat quality) facilitating comparisons with other work.
To quantify and map the naturalness index of the landscape, we adapted previous methodology [37,40,41] to our regional context and to the available spatial data. Among the four attributes generally used, we retained three: hemeroby of landcover (also called naturalness of landcover), human impact on landscape, and remoteness from road access. The fourth attribute, ruggedness, was excluded from our study because it was not relevant for the landscapes of our study area which are generally low-lying (maximum 416 m above sea level).

Landscape Hemeroby
The hemeroby concept provides a measure of anthropogenic impact on landscapes and habitats using a scale, in which the highest values (ahemerob) correspond to "natural" or undisturbed landscapes while artificial landscapes obtain the lowest values (metahemerob) [42,43]. To use the most recent and accurate data, a composite land cover map from different thematic datasets related specifically to natural and semi-natural vegetation, agriculture, forests, rivers, and human infrastructure was created. The details of the data used are given in Appendix B. To create the composite layer map, each dataset was first rasterized at 20 m then reclassified on a hemeroby scale and finally aggregated with the other data, giving priority to the most precise and recent data in cases of overlapping. For hemerobic reclassification, we applied a predefined hemeroby scale of landcover [43][44][45] which we adapted to our landcover dataset. The hemeroby scale ranged from level 6 ("ahemerob", i.e., no human impact) to level 1 ("metahemerob", i.e., destroyed, originally biocenosis). For each hemerobic level (1-6), we applied a second level of precision (1.1, 1.2, . . . ) in order to consider the precision of the dataset used. The second level of hemerobic classification was based on local, expert knowledge. The complete table of hemerobic reclassifications is given in Appendix C. To account for the influence that the pattern of land cover immediately adjacent to the observer has upon hemeroby, the average hemeroby score of all cells within 250 m of the target cell was calculated as proposed by [37]. The final hemeroby scale ranged from 1 to 255.

Human Influence on Landscape
The "human influence on the landscape" attribute was quantified and mapped by two human presence proxies: human population and modern human artefact density. Similarly to Müller et al. (2015), we first used human population density as a proxy for human influence on landscape. To measure the population density, we used "Filosofi" data, which are the most precise population data available in France, collected at household level. To guarantee personal confidentiality, the French national institute of statistics and economic studies (INSEE) makes the data available in aggregate form on a 200 m grid. We used the Filosofi dataset for 2016, rasterized at a resolution of 20 m. The modern human artefact indicator refers to the quantity of artificial structures within the visible landscape, including railways, pylons, buildings, and other built structures. Similarly to [37], a number of modern human artefacts were extracted from the BD TOPO (IGN) and aggregated in a single dataset. The density calculation of the human modern artefacts area was performed with the focal statistic tool (ESRI, ArcGIS Pro 2.). For each cell, this tool calculates the location of a statistic in the neighborhood of a radius of 500 m circle (0.7854 km 2 ). The radius of the moving window was chosen by an empirical method and according to [46]. We first tested three radial distances (500 m, 1 km, and 2 km) and kept the best compromise between visual representation and the normal distribution of the resulting data. The modern human artefacts map result is expressed in area of built-up land per square kilometer. The two layers were aggregated with equal weighting and reclassified on a relative scale from 1 to 255.

Remoteness from Roads and Paths
Remoteness from access is a very common indicator of naturalness maps (e.g., [37,38] and [40]). Remote areas are important both for species sensitivity to human presence and disturbance [47] and for the opportunity of solitude which some observers may seek [48]. We calculated the Euclidean distance in meters from all roads and paths including a buffer of 1 km around our study site. Only motorways were not considered because we assumed that it was not possible to go walking from these fenced roads. The remoteness from access layer was reclassified on a relative scale from 1 to 255. To produce the final map of naturalness index, we computed the sum of the three layers (hemeroby of the landscape, human influence on landscape and remoteness from access) with equal weighting. The final map had a resolution of 20 m and the naturalness index ranged from 1 (minimum naturalness index) to 255 (maximum naturalness index). All spatial analyses were carried out with ESRI ArcGIS pro 2.7.

Statistical Analyses
An illustrated workflow of the data analyses is given in Figure 1.
Land 2022, 11, x FOR PEER REVIEW 6 of 24 layer was reclassified on a relative scale from 1 to 255. To produce the final map of naturalness index, we computed the sum of the three layers (hemeroby of the landscape, human influence on landscape and remoteness from access) with equal weighting. The final map had a resolution of 20 m and the naturalness index ranged from 1 (minimum naturalness index) to 255 (maximum naturalness index). All spatial analyses were carried out with ESRI ArcGIS pro 2.7.

Statistical Analyses
An illustrated workflow of the data analyses is given in Figure 1.  2.4.1. Testing the Effects of Naturalness Index on the Spatial Distribution of Recordings, at Regional Scale Using GIS, a pre-existing 2 × 2 km grid used for French bird atlases was retained for the analysis across the study area; cells encompassing less than 80% of regional land cover (mainly boundary cells) were removed, leading to a grid of 7924 cells. First, mean naturalness index (NI) was calculated for each cell. Second, the exact locations of bird records were intersected with the regular grid ( Figure 1). The number of records was calculated within each cell and all cells were then divided into five classes of recording intensity (RI) of equal intervals ranging from very low recording intensity to very high ( Figure 1). We retained five classes which corresponded to the best compromise between a sufficiently large number for the analysis but small enough to guarantee a good, visual mapped representation. Moreover, because only a few cells had a very large number of records, the RI were log transformed to perform the classification with a normalized data set. In order to test the effect of NI on RI classes, differences in mean NI between classes of log RI were tested using non-parametric Kruskal-Wallis (KW) tests because of heteroscedasticity of the data ( Figure 1). Differences in mean NI were tested between each combination of RI classes with Dunn's post-hoc test of multiple comparisons using rank sums. We used Bonferroni method to adjust the p-value for multiple comparisons.

Testing the Hypothesis of a Homogeneous Distribution of the Records among Observer Profiles and Recording Intensity Classes
To test the hypothesis of a homogeneous distribution of the records according to observer profile and RI class, a Chi square test was applied on a contingency table. This table gathers the number of records of each observer profile in the different classes of RI. In order to interpret the intensity and the direction of each relationship (positive or negative), Pearson residuals were calculated and plotted.

Testing the Effect of the Naturalness Index on the Recording Intensity of Observer Profiles at the 400 m 2 Record Neighborhood Scale
To test whether naturalness index (NI) in the neighborhood of bird records could explain recording intensity of the different observer profiles (OP), we calculated the mean NI of the total records for each OP. Contrary to Section 2.4.1, NI was calculated at bird record scale in order to directly consider the neighborhood around each recording point (and not at the scale of the 4 km 2 grid). For this, we used the highest resolution of the NI map which is 20 m (i.e., 400 m 2 ). For each OP, mean NI of record neighborhoods were presented in violin plots to reflect the full distribution of the data, smoothed by a kernel density estimator.
We then estimated a confidence interval (CI) of the NI means by bootstrapping method replication. For each OP, we resampled, replacing the original samples 999 times. We used the one.boot and perc function of the "simpleboot" R package for bootstrapping the mean statistic and extracting the CI at 95%. We then plotted the distribution of NI means as a histogram and added the initial sample NI mean and the CI boundaries (p = 0.05) for each of the four OP.
To test the differences in mean landscape NI between the OPs, we also calculated the CI for the pairwise comparisons. We used the two.boot function from the "simpleboot" package to bootstrap the difference between means of OP with 500 resamples and extract the CI at 95%. This number of bootstrapping was the maximum supported by the memory of our computer given the size of the original samples. We then plotted the distribution of differences as a histogram and added the initial sample mean and the confidence interval boundaries (p = 0.05) for each of the pairwise comparisons.
Finally, to ensure that the observed differences were not linked to the lack of data in some parts of the study area, this analysis concerned only the high and very high RI classes. In addition to the R functions and packages already mentioned, we also used the following packages for our analyses and graphs: "ade4", "corrplot", "ggplot2", "ggpubr", "graphics", "questionr", "rstatix", "vcd".

Four Observer Profiles
We found that the 11 metrics used to describe observer habits clustered participants into four distinguishable groups. Inertia analyses revealed that this classification best optimized the trade-offs between the number of groups and the within group sum of squares. The dendrogram of the distance matrix of records and the analysis of the loss of relative inertia as a function of the number of groups are given for information in Appendix A ( Figures A2 and A3).
Based on analysis of each group's metrics we named the four observer profiles as follows: "garden-watchers", "beginners", "naturalists", and "experts" ( Table 2). Table 2. A summary of variation in metrics between observer profiles across the dataset as a whole. N.B. All 15 years of data over the entire region are pooled in this analysis. When mean is given, standard deviation is added as (sd).

Garden-Watchers
Garden-watchers represented 20.47% of observers and only 0.59% of total records. Garden-watchers had few recording days, which were widely spaced in time but very spatially concentrated (less than two 4 km 2 cell on average). Members of this group were not necessarily beginners (they had been registered for more than 2 years on average), the vast majority recorded mainly at weekends (80% of records), and mostly synanthropic species.

Beginners
The beginners represented the biggest group in terms of numbers of observers (46.49%) but collected only 3.79% of total records. They were new to CS; they had been registered on average less than a year but had a fairly high level of activity. On the other hand, they made only a few records per recording day. The records were distributed among 4 km 2 cells on average; they were therefore not limited to the home base but covered a small part of the study area. The beginners were clearly distinguishable from the other profiles by a minority of records made during the week (20.69%). Finally, the species observed were on average less generalist than the other groups (generalization index = 29.29) but relatively synanthropic (index of synthropy = 36.23).

Naturalists
Naturalists formed around a third of observers (30.73%) and a third of records (36.31%). These were experienced observers, who had been registered for an average of 5 to 6 years.
Their observations were distributed among 45.27 4 km 2 cells on average. This suggests naturalists observe not only around the home base, but also at a fairly stable list of sites, since the density of records is much higher (41.05) than for the other observer profiles. Finally, this was the profile with the lowest mean of synanthropic species records but the highest mean of generalist species records.

Experts
Only a small proportion of observers (136 observers i.e., 2.31%) formed this group but they collected the majority of all records (59.31%). They were distinguished by their seniority (registered10 years on average) and a large number of recording days (mean = 1578). They recorded the most frequently (every 4 or 5 days) and made many recordings per day. They covered an average of 427 cells (i.e., 10 times more than the naturalists). In addition, they recorded both rare and common species, more closely resembling professional, protocoled recording. They recorded a large number of species but the birds observed were often generalist species (SGI = 30.45).

The Naturalness of the Pays-de-la Loire Region
At the scale of the Pays-de-la-Loire region, we found an average hemeroby of 127.28 (sd 60.11), an average human influence of 247.23 (sd 12.01), and a remoteness from access of 7.26 (sd 8.16). With an aggregation of the three attributes, we created the naturality index (NI) map ( Figure 2). We found an average NI of 128.15 (sd 14.48). However, NI is far from homogeneous across the study area. We found a higher NI on the eastern and northern part of the territory (dark greens patches are mainly forest areas) and along the Loire river (with lakes and associated wetlands). Conversely, the patches of low naturalness index corresponded to the region's main cities: Nantes (to the west), Angers (to the east), Laval (to the north west), and Le Mans (to the north east).

Mean Naturalness Index (NI) Varied between Classes of Recording Intensity (RI)
Mean NI differed significantly between classes of RI (KW, P = 1.38 × 10 −8 ). Less-visite areas had higher mean naturalness than intensely visited areas (Table 3). We found a mea naturalness of 129.6 (sd = 7.35) for very low RI class; 128.6 (sd = 7.50) for low RI class; 128. (sd = 9.70) for moderate RI class; 127.3 (sd = 13.03) for high RI class and 125.5 (sd = 13.40 for very high RI class. Multiple comparison tests revealed that NI was significantly higher in classes of low

Mean Naturalness Index (NI) Varied between Classes of Recording Intensity (RI)
Mean NI differed significantly between classes of RI (KW, P = 1.38 × 10 −8 ). Lessvisited areas had higher mean naturalness than intensely visited areas (Table 3). We found a mean naturalness of 129.6 (sd = 7.35) for very low RI class; 128.6 (sd = 7.50) for low RI class; 128.1 (sd = 9.70) for moderate RI class; 127.3 (sd = 13.03) for high RI class and 125.5 (sd = 13.40) for very high RI class. Multiple comparison tests revealed that NI was significantly higher in classes of low recording intensity than in classes of high recording intensity. Except for very high/high, high/moderate, and very high/moderate pairwise combinations, all other combinations were significantly different (Figure 3 and Appendix D).  On the left, a map of the spatial distribution of the 5 RI classes, from very low to very high On the right, violin plots of mean NI among RI classes. Shapes of violin represent the kernel density of the NI data. Black points are the mean NI values given in bold type underneath. Significant differences are indicated by letters a-c. Non-significant (ns) differences (p > 0.05) between classes are denoted by bars.

Numbers of Birds Recorded Varied among Recording Intensity Classes and Observer Profiles
The number of bird records differed significantly among RI classes and observer profiles (X 2 = 205,254, p < 0.05). The standardized Pearson residuals show strong associations (both positive and negative) between RI and OP. The expert profile showed a strong positive association with the very high recording intensity class whereas the three other observer types (garden-watcher, beginner, and naturalist) showed strong negative relationship with this very high RI class. Conversely, the expert profile was negatively related to high and moderate RI classes, whereas the naturalist and beginner profiles were positively associated with these two classes. Garden-watchers were also positively related to the moderate and low recording intensity classes (Figure 4 and Appendix E). On the left, a map of the spatial distribution of the 5 RI classes, from very low to very high On the right, violin plots of mean NI among RI classes. Shapes of violin represent the kernel density of the NI data. Black points are the mean NI values given in bold type underneath. Significant differences are indicated by letters a-c. Non-significant (ns) differences (p > 0.05) between classes are denoted by bars.

Numbers of Birds Recorded Varied among Recording Intensity Classes and Observer Profiles
The number of bird records differed significantly among RI classes and observer profiles (X 2 = 205,254, p < 0.05). The standardized Pearson residuals show strong associations (both positive and negative) between RI and OP. The expert profile showed a strong positive association with the very high recording intensity class whereas the three other observer types (garden-watcher, beginner, and naturalist) showed strong negative relationship with this very high RI class. Conversely, the expert profile was negatively related to high and moderate RI classes, whereas the naturalist and beginner profiles were positively associated with these two classes. Garden-watchers were also positively related to the moderate and low recording intensity classes (Figure 4 and Appendix E).
(both positive and negative) between RI and OP. The expert profile showed a strong positive association with the very high recording intensity class whereas the three other observer types (garden-watcher, beginner, and naturalist) showed strong negative relationship with this very high RI class. Conversely, the expert profile was negatively related to high and moderate RI classes, whereas the naturalist and beginner profiles were positively associated with these two classes. Garden-watchers were also positively related to the moderate and low recording intensity classes (Figure 4 and Appendix E).

Figure 4.
Relationships between observer profiles and recording intensity in terms of numbers of bird records. Matrix of residuals from the Pearson's Chi-squared tests of association, using "corrplot" package in R. Blue indicates an over-representation, and red indicates an under-representation. Circle diameter is proportional to numbers of birds recorded for each observer profile/recording intensity class combination.

More Specialist Observers Record in Landscapes of Higher Naturalness Index
Depending on OP, NI varied in 400 m 2 landscapes neighboring bird records ( Figure 5). We found a mean neighborhood NI of 110. 33 Figure A4).

More Specialist Observers Record in Landscapes of Higher Naturalness Index
Depending on OP, NI varied in 400 m 2 landscapes neighboring bird records ( Figure  5). We found a mean neighborhood NI of 110. 33 Figure A4).
For the four OPs, we observed a majority of records in areas of intermediate naturalness. However, while garden-watchers and beginners recorded mostly in areas of low NI, these were the areas avoided by naturalists and experts. We found the differences in NI to be relatively small between naturalist and expert records (1.37) as well as between garden-watcher and beginner records (3.37). In contrast, differences in NI were strong between naturalist-expert and garden-watcher-beginner. The plots of the six pairs of differences as histograms with the initial sample mean and the confidence interval boundaries (p = 0.05) are given for information in Appendix F ( Figure  A5) Focusing on the most visited areas of the study site (high or very high recording intensity classes), we found that 4,465,633 records (corresponding to 80.86% of the total records) were concentrated in 5492 km 2 (corresponding to 17.32% of the study area). In these For the four OPs, we observed a majority of records in areas of intermediate naturalness. However, while garden-watchers and beginners recorded mostly in areas of low NI, these were the areas avoided by naturalists and experts.
We found the differences in NI to be relatively small between naturalist and expert records (1.37) as well as between garden-watcher and beginner records (3.37). In contrast, differences in NI were strong between naturalist-expert and garden-watcher-beginner. The plots of the six pairs of differences as histograms with the initial sample mean and the confidence interval boundaries (p = 0.05) are given for information in Appendix F ( Figure A5) Focusing on the most visited areas of the study site (high or very high recording intensity classes), we found that 4,465,633 records (corresponding to 80.86% of the total records) were concentrated in 5492 km 2 (corresponding to 17.32% of the study area). In these areas, we found that mean record neighborhood NI differed according to observer profile ( Figure 6). In the high RI class, we found the same pattern as at the scale of all RI classes combined, i.e., mean NI increased from garden-watchers to beginners to naturalists and then experts. However, in the very high RI class, it was the naturalists' records which had a higher mean neighborhood NI than the experts. The means of NI of the records of the four observer profiles are given for each RI class in Appendix F ( Figure A6).

Towards a Better Understanding of Observer Profiles
This study has built upon the framework proposed by [33], which uses clustering of temporal, behavioral metrics to distinguish the types of participant in environmental CS programs. We added spatial metrics and, as suggested by [28] and initiated by [7], we developed quality metrics based on indices of species specialization [49] and synanthropy [50] to better assess not only the numbers but also the types of bird species recorded by participants.
The four observer profiles we obtained are partially comparable with those of other studies. For example, the experts, who represented only 2.31% of the volunteers but 59.31% of records, could correspond to so-called "super users" described by [7]. Ref. [28] also found a repeated pattern across datasets and taxonomic groups of a few volunteers contributing many records and many volunteers contributing few records and this is referred to as the 80:20 rule [51]. In our study also, beginners contribute very few records but form the largest group of observers.
Using temporal metrics alone, [28] distinguished three volunteer profiles: "dabbler", "steady", and "enthusiast". By adding a "distance of recordings from home" spatial variable, [34] distinguished "one-session volunteers" who travelled least and committed the shortest amount of time, "long-term volunteers" who travelled furthest and committed the most time and "short-term volunteer", mid-way between the two other types. By introducing quality metrics in our study, we obtained classes that reflected not only differ- Figure 6. Violin plots of the mean NI of the four observer profiles (garden-watchers, beginners, naturalists, and experts), focusing only on high RI class (left) and very high RI class (right). Shapes of violin represent the kernel density of the NI data records.

Towards a Better Understanding of Observer Profiles
This study has built upon the framework proposed by [33], which uses clustering of temporal, behavioral metrics to distinguish the types of participant in environmental CS programs. We added spatial metrics and, as suggested by [28] and initiated by [7], we developed quality metrics based on indices of species specialization [49] and synanthropy [50] to better assess not only the numbers but also the types of bird species recorded by participants.
The four observer profiles we obtained are partially comparable with those of other studies. For example, the experts, who represented only 2.31% of the volunteers but 59.31% of records, could correspond to so-called "super users" described by [7]. Ref. [28] also found a repeated pattern across datasets and taxonomic groups of a few volunteers contributing many records and many volunteers contributing few records and this is referred to as the 80:20 rule [51]. In our study also, beginners contribute very few records but form the largest group of observers. Using temporal metrics alone, ref. [28] distinguished three volunteer profiles: "dabbler", "steady", and "enthusiast". By adding a "distance of recordings from home" spatial variable, ref. [34] distinguished "one-session volunteers" who travelled least and committed the shortest amount of time, "long-term volunteers" who travelled furthest and committed the most time and "short-term volunteer", mid-way between the two other types. By introducing quality metrics in our study, we obtained classes that reflected not only differences in temporal and spatial engagement (e.g., sedentary garden-watchers versus mobile experts) but also differences relating to observer expertise (e.g., beginners versus naturalists).
Such groupings are useful in the case of CS programs with numerous and diverse participants, such as opportunistic bird CS programs. However, for other taxonomic groups and programs, dividing participants into clear types may not be appropriate. For example, ref. [7] suggested that volunteers in an opportunistic butterfly recording CS program could be classed along four continuous scales of recording intensity, spatial extent, recording potential, and rarity recording. Another important limit of profiling is that it is static whereas we can assume that volunteers will move from one profile to another over time. It is possible that beginners or garden-watchers will develop their skills through practice, observe more species, less generalist and synanthropic species, ultimately evolving to become naturalists or possibly experts.
Despite these limits and as demonstrated by [28], observer profiling can be used in comparisons between CS programs but also, as we have seen in this study, for analysis within a single CS program. It could also be useful for long-term programs that are experiencing a change in participant profiles. As mentioned by [31], there has been a noticeable shift toward people collecting and providing data with limited training and little or no direct social interaction with experts or other citizen scientists, before submitting them via an online reporting platform. In addition, during pandemic lockdowns certain forms of participation in CS increased, generating more data and more data heterogeneity. These recent and rapid transformations have sometimes taken managers of long-running CS programs by surprise, and they need tools to manage and analyze these new data. In this study we focused on how observer profiling can be used to better understand spatial biases in recording effort and intensity.

Spatial Bias in CS
Recording Intensity at Regional Scale: Influence of Landscape Naturalness At regional level, we observed that well-known spatial patterns explained variation in recording effort. Our results showed that the most frequented sites in terms of recording intensity (RI) had lower average naturalness index (NI) (lower than less frequented sites and even lower than the overall regional average). We also found that the most frequented sites (cells with high or very high RI) were close to towns and easy to access (i.e., close to roads) (see map, Figure 3). This result is similar to studies by [12,19,24,26,27], conducted at a range of spatial scales, in several regional contexts, and involving a diversity of CS programs (taxonomic groups, observer profiles). The tendency toward intense recording activity in accessible and urbanized areas therefore constitutes a relatively well-documented bias.

Relationships between Observer Profiles and Landscape Naturalness at Finer Scale
While regional scale recording intensity was mostly driven by spatial patterns of proximity and accessibility, a closer look at the scale of record neighborhood revealed different preferences in landscape naturalness among observer profiles. Thus, although all observer profiles mainly visited the same broad scale areas, they did not frequent the same sites within those areas. In areas of high or very high RI, experts and naturalists tended to seek areas of higher NI which was not the case for garden-watchers and beginners. The different profiles therefore appeared to complement each other in terms of spatial coverage. This result is of particular importance as it could be used by organizers of CS programs wishing to improve spatial homogeneity in recording.

Implications and Future Directions
Increasingly, knowledge of observer profiles is being used to improve CS data quality for analytical purposes. Commonly, data are filtered to reduce bias and improve subsequent predictive modelling of species distributions [52]. Correcting bias linked to variation in participant expertise has for example successfully been used to improve species distribution models based on CS data [53].
Knowledge of observer profiles could also help CS organizers to guide particular types of observer with known observing habits and spatial preferences toward areas which may be under-recorded. Completing existing CS databases with a more guided approach is often necessary before data can be fully exploited [52]. Such knowledge may also be used to adjust future recruitment strategies toward particular observer profiles, depending on program objectives.
It could be interesting to further analyze the reasons for observer profile preferences. A considerable body of literature is concerned with human perceptions of naturalness [38,[54][55][56][57]. Interdisciplinary collaboration with the human and social sciences should lead to a deeper understanding of landscape preferences and related bird recording habits [58].
Since observer profiles in our study integrated spatial metrics, we have seen that part of the bias in recording is related to spatial distribution of naturalness gradient in our study area. The Pays-de-la-Loire region is relatively heavily urbanized with little or no wilderness. The population density is of 120 inhabitants/km 2 , the road network is dense, and rural landscapes have been transformed by agriculture. In such contexts, the naturalness index (NI) is used to quantify a gradient of human modification the NI [59,60], and can enable us to highlight even subtle variation in human pressure [40] but no observer is ever completely remote from towns or roads. In other regional contexts landscape types, areas of high NI and species richness may be under-recorded, even by experts, due to very low accessibility [29]. CS programs should consider landscape configuration and areas of probable under or over-sampling, at the program planning stage.

Data Availability Statement:
The CS data that support our research findings are the property of LPO, details are available from the corresponding author on request.

Acknowledgments:
The authors warmly thank all the observers and database management structures that transmitted observations: LPO Vendée, LPO Loire-Atlantique, Groupe Naturaliste de Loire-Atlantique, Bretagne Vivante, LPO Anjou, LPO Sarthe and Mayenne Nature Environement. We are also grateful to Claudie Auffray and Christiane Paillat for their daily administrative support.

Conflicts of Interest:
The authors declare no conflict of interest.

Appendix A
We first performed exploratory analyses (correlations and principal components analyses (PCA)) on a set of 22 metrics (Table A1, Figure A1). Following these analyses, we retained 11 metrics (see Table 1). Mean rarity of recorded species Following a notation of the rarity of the species present in the study area, mean of the rarity of the observations was calculated per volunteer

C10
Percentage of records of "rare" and "very rare" species Number of records of "common" + "very common" species × 100/total records We then applied the ascending hierarchical cluster analysis (AHC) to define observer profiles ( Figures A2 and A3). We retained the second best clustering result (four groups) so as to be able to distinguish and analyze a reasonable number of different profiles. We then applied the ascending hierarchical cluster analysis (AHC) to define observer profiles ( Figures A2 and A3). We retained the second best clustering result (four groups) so as to be able to distinguish and analyze a reasonable number of different profiles.      Appendix F Figure A4. Distribution of NI means with confidence interval boundaries (p = 0.05) from the 999 bootstrapping resampling for the four observer profiles.  Appendix F Figure A4. Distribution of NI means with confidence interval boundaries (p = 0.05) from the 999 bootstrapping resampling for the four observer profiles.