Mapping Historical Data : Recovering a Forgotten Floristic and Vegetation Database for Biodiversity Monitoring

Multitemporal biodiversity data on a forest ecosystem can provide useful information about the evolution of biodiversity in a territory. The present study describes the recovery of an archive used to determine the main Schmid’s vegetation belts in Trento Province, Italy. The archive covers 20 years, from the 1970s to the 1990s. During the FORCING project (an Italian acronym for Cingoli Forestali, i.e., forest belts), a comprehensive process of database recovering was executed, and missing data were digitized from historical maps, preserving paper-based maps and documents. All of the maps of 16 forest districts, and the related 8000 detected transects, have been georeferenced to make the whole database spatially explicit and to evaluate the possibility of performing comparative samplings on up-to-date datasets. The floristic raw data (approximately 200,000 specific identifications, including frequency indices) still retain an important and irreplaceable information value. The data can now be browsed via a web-GIS. We provide here a set of examples of the use of this type of data, and we highlight the potential and the limits of the specific dataset and of the historical database, in general.


Introduction
Forest ecosystems support a great proportion of the world's terrestrial taxa [1,2].In particular, they harbor a notably high species diversity of many taxa, including birds, invertebrates, and soil microbes [3][4][5][6].Moreover, their ecosystem services are essential for supporting human life and activities [7].The long-term maintenance of forest biodiversity is considered a key task of global guiding significance and has long been included in many national forest management plans and international policy actions.Forest biodiversity conservation is also currently considered for forests where the production of timber and pulpwood is the main goal [8][9][10].
To assess the state of naturalness of a forest, and the level of species diversity supported by the forest, it is essential to provide detailed information about the distribution of the plant species that form the forest community [11], as well as the historical trends defined by the occurrence of these species up to the actual establishment of the community [12][13][14][15][16][17][18].For the same reasons and for the same purposes, it is also essential to provide detailed information on the dynamics of the landscape, with particular emphasis on the vegetation component [19][20][21].Consequently, the study of a whole vegetation community, or of one or more "umbrella" species, can offer a methodological approach for evaluating the quality and the state of the entire ecosystem [22][23][24].Many previous studies have focused on the dynamics of invasive plants [25].Recently, vegetation studies were also focused on the traditional ecological knowledge transformation in the context of the study of the cultural evolution of local alpine communities [21].
Many research programs have studied flora and vegetation biodiversity.These programs have acquired field data and conducted floristic inventories for many years.Currently, monitoring programs can also exploit technological progress in data management and new remote sensing tools that facilitate the collection of ecosystem data from extensive areas or from areas that are difficult to access.However, it is also the case that historical floristic data are essential to allow effective comparisons between previous and present situations [12,26].The information provided by such historical approaches and perspectives cannot be entirely replaced by remote sensing techniques.In the context of vegetation and floristic studies, with particular emphasis on the integration of data from various sources, historical floristic data are progressively becoming more important [27].It is certainly the case that floristic data collection is becoming increasingly expensive.For this reason, scarce data-gathering resources are primarily invested in areas and studies in cases where there is a greater risk of biodiversity loss.The consequence of this selective allocation is that floristic monitoring is reduced in most, or all other, regions.However, when we try to paint a portrait of long-term vegetation evolution in an area that is currently monitored, we often find that there are no historical floristic data to support comparisons.As a result, there is no way to obtain information on the important temporal component of the overall pattern of the vegetation and of its evolution.If such information were available, it would facilitate the identification of the changes and the anthropic or natural driving forces responsible for the current conditions [13,17,18].
In this context, the recovery of historical floristic and vegetation databases can represent a very important component in the long run.
The FORCING project (an Italian acronym for Cingoli Forestali, i.e., forest belts) is a project financed by the Province of Trento to support the recovery of an old archive that contained vegetation data collected between 1978 and 1991 in Trentino (Italy).The original sampling project was financed in the 1970s by the Edmund Mach Foundation (Province of Trento) to characterize the so-called Schmid's vegetation belts in terms of species presence.The original project involved approximately 15 people, including botanists, forest guards and officials, technical staff, and land surveyors.The project identified ca.200,000 vascular plants, grasses, trees, and shrubs belonging to approximately 1300 floral species present at various elevations in the province.An estimate of the total cost of the project in 2015 is approximately 1,350,000 euros.
Schmid's vegetation belts were proposed in a systematic formulation by the Swiss botanist Emil Schmid [28].The Schmid's belts are the basic units constructed on biocoenotic basis to describe the vegetation and include any vascular plant species with equal or similar natural area, both horizontally and vertically, namely the species with similar climatic and soil requirements.The method, based on complete census of plant species at each study site, generates data on species composition and relative abundance of each species.
This systematic methodology, absolutely unique in its proposition and descriptive accuracy, was further optimized by Schmid [29] and Famiglietti and Schmid [30], and in such profile exerted a significant influence on some currents of the Italian geobotany school [31].
The asserted current definition of vegetation belt corresponds to that proposed by Pignatti, which speaks of a "portion of space, located in a mountainous area, in which there are similar bioclimatic conditions and therefore has the same potential from the vegetation point of view" [32].
The identification of the forest belts had a specific forest management meaning because it was believed that Schmid's belts could become a permanent component of the forest assessment planning process for the years to come [31,33].Information on Schmid's belts furnished some indications of the potential vegetation of the area being analyzed.Through the years, however, the management and ecological value of Schmid's belt information and approaches were revised, and these information sources and methods lost their potential as a management information tool.Nevertheless, the enormous sampling effort that these studies entailed furnished a substantially large vegetation database that long lay forgotten in the archives of Fondazione Edmund Mach [34].
This database represents a unique historical basis for assessing vegetation change occurring over the past 20-40 years.This information resource is potentially useful for both mapping historical floristic information and for identifying the effects of envisaged climate change events and processes and their impacts on the human activities and the ecosystem services of the Trentino region's forests.
The main aims of the present paper are as follows: (i) to present and valorize an important historical archive; (ii) to perform selected tests to check the integrity and congruence of the archive; (iii) to explore the degree of completeness of the archive and to evaluate its usefulness as a support tool for scientific studies or nature conservation plans; and (iv) to present a web-GIS platform where the data and the historical maps can be browsed, along with some relevant technical details.

Study Area
The study area is the Province of Trento in the northeastern Italian Alps, a mountain region of approximately 6212 km 2 (Figure 1).The territory is characterized by strong variability in forest species composition due to the presence of different geomorphological conditions and elevations.The forest varies from Mediterranean vegetation, where forests of Quercus ilex occur, up to alpine areas where Pinus mugo and alpine herbaceous vegetation represent the principal forest cover.
To assess the spatial distribution of species richness and to standardize the subsequent analysis, we decided to divide the study area into a grid of 5 km ˆ5 km cells, following the same cut procedure adopted for the Trentino Flora Inventory [35].
ISPRS Int.J. Geo-Inf.2016, 5, 100 3 of 14 and ecological value of Schmid's belt information and approaches were revised, and these information sources and methods lost their potential as a management information tool.Nevertheless, the enormous sampling effort that these studies entailed furnished a substantially large vegetation database that long lay forgotten in the archives of Fondazione Edmund Mach [34].This database represents a unique historical basis for assessing vegetation change occurring over the past 20-40 years.This information resource is potentially useful for both mapping historical floristic information and for identifying the effects of envisaged climate change events and processes and their impacts on the human activities and the ecosystem services of the Trentino region's forests.
The main aims of the present paper are as follows: (i) to present and valorize an important historical archive; (ii) to perform selected tests to check the integrity and congruence of the archive; (iii) to explore the degree of completeness of the archive and to evaluate its usefulness as a support tool for scientific studies or nature conservation plans; and (iv) to present a web-GIS platform where the data and the historical maps can be browsed, along with some relevant technical details.

Study Area
The study area is the Province of Trento in the northeastern Italian Alps, a mountain region of approximately 6212 km 2 (Figure 1).The territory is characterized by strong variability in forest species composition due to the presence of different geomorphological conditions and elevations.The forest varies from Mediterranean vegetation, where forests of Quercus ilex occur, up to alpine areas where Pinus mugo and alpine herbaceous vegetation represent the principal forest cover.
To assess the spatial distribution of species richness and to standardize the subsequent analysis, we decided to divide the study area into a grid of 5 km × 5 km cells, following the same cut procedure adopted for the Trentino Flora Inventory [35].Trentino Flora Inventory is a huge and reliable data base updated and maintained by Museo Civico di Rovereto using both repeated field campaigns and historical herbarium samples.It is also the main source of floristic data in the region and published statistical information about species Trentino Flora Inventory is a huge and reliable data base updated and maintained by Museo Civico di Rovereto using both repeated field campaigns and historical herbarium samples.It is also the main source of floristic data in the region and published statistical information about species absence, presence, and frequency that are calculated according to the 5-km grid.Therefore, the choice to use the same cut for the FORCING project database allows us to compare both raw data and processing results with Trentino Flora Inventory.The summed number of species present in each cell was mapped out to obtain the species richness map.

Original Field Sampling
In the original sampling activity the Trentino region was divided into 16 homogeneous zones, each corresponding to a morphological sub-valley system (Figure 2).Simultaneously, the region was divided into five altitudinal belts.Usually, the sampling activity was conducted zone by zone by one team composed of two field workers.The entire sampling campaign required approximately 13 years.Within each belt, individuated linear segments were defined to traverse the mountainsides in a direction perpendicular way to the contour lines.These segments were termed "series".The field workers, walking through each series, defined a sampling area on the terrain that covered every 50 m of altitudinal gap.The result was a transect 10 m in length and 1 m in width.Along this transect, all floristic species were collected along with an indication of their abundance, as well as other information related to the morphological/geological conditions of the sampling site.No herbarium samples of the species were collected, or at least conserved; therefore, we must rely on the original recordings and on the data coming from the database.To recover further information we met one of the former sampler of the 1970s-1990s (now retired) we had the possibility to discuss some issues with him and we tested that they were well trained on species identification.
ISPRS Int.J. Geo-Inf.2016, 5, 100 4 of 14 absence, presence, and frequency that are calculated according to the 5-km grid.Therefore, the choice to use the same cut for the FORCING project database allows us to compare both raw data and processing results with Trentino Flora Inventory.The summed number of species present in each cell was mapped out to obtain the species richness map.

Original Field Sampling
In the original sampling activity the Trentino region was divided into 16 homogeneous zones, each corresponding to a morphological sub-valley system (Figure 2).Simultaneously, the region was divided into five altitudinal belts.Usually, the sampling activity was conducted zone by zone by one team composed of two field workers.The entire sampling campaign required approximately 13 years.Within each belt, individuated linear segments were defined to traverse the mountainsides in a direction perpendicular way to the contour lines.These segments were termed "series".The field workers, walking through each series, defined a sampling area on the terrain that covered every 50 m of altitudinal gap.The result was a transect 10 m in length and 1 m in width.Along this transect, all floristic species were collected along with an indication of their abundance, as well as other information related to the morphological/geological conditions of the sampling site.No herbarium samples of the species were collected, or at least conserved; therefore, we must rely on the original recordings and on the data coming from the database.To recover further information we met one of the former sampler of the 1970s-1990s (now retired) we had the possibility to discuss some issues with him and we tested that they were well trained on species identification.

The Original Database
The archive was originally built on a DEC PDP/11 database management system platform, and subsequently saved in a series of tables in ASCII format.These tables contained all of the information regarding the species collected, with additional information about the environmental conditions of the sampling sites, e.g., geology, morphology, and canopy cover.The data were not geolocalized through the use of a coordinate system, but were manually drawn in the form of a series of paper maps-specifically, 16 maps-each corresponding to a zone (see Table 1).The data were originally mapped at a scale of 1:25,000 and then reported in the final maps at a scale of 1:100,000.The ASCII

The Original Database
The archive was originally built on a DEC PDP/11 database management system platform, and subsequently saved in a series of tables in ASCII format.These tables contained all of the information regarding the species collected, with additional information about the environmental conditions of the sampling sites, e.g., geology, morphology, and canopy cover.The data were not geolocalized through the use of a coordinate system, but were manually drawn in the form of a series of paper maps-specifically, 16 maps-each corresponding to a zone (see Table 1).The data were originally mapped at a scale of 1:25,000 and then reported in the final maps at a scale of 1:100,000.The ASCII tables do not permit the user to process any query or to overlay the table with other types of data.It has been necessary to recover the database operating on the existing data structure to scan the paper maps in digital form, combining and mixing the data from the 16 zones and restoring and reorganizing the architecture of the original database, eliminating any redundancy.In addition, the sampling points have been georeferenced through the use of a Python script that automatically overlaid them with altitude, slope, and aspect maps, associating them with geographic coordinates to each point.The final result is a new database in SQlite3/Spatialite format, with the possibility of inquiring and overlaying these data structures with a database client and GIS software.In the context of data sharing and dissemination [36] a web-GIS platform has been developed within the project.Through this platform, it is possible to explore the data stored in the new database and to view the distribution of the species collected.

Software Used
The scanned maps were imported in Grass GIS version 6.4 [37].The maps were georeferenced in the reference system WGS84 UTM Zone 32N through the use of the module of GIS Grass georectify, and exported in GeoTIFF format.
The original scale of the maps was 1:100,000.
We then proceeded to the manual digitization of the series reported in the maps using Qgis 2.0 [38].
The location of the sampling points (transects) was performed automatically through a script developed in Python language using the pysqlite modules for the interface to the sqlite3 database and the Python bindings for the gdal and ogr libraries for the manipulation of raster and vector geodata (Figure 3).
The new database was developed using the relational database management system sqlite3 with the geographical extension spatialite version 4.2.0.
The development of the web-GIS, is hosted on servers in the Fondazione Edmund Mach and is accessible at http://meteogis.fmach.it/forcing/.
The web-GIS was developed using PHP version 5.5 for the server side with the use of Mapserver version 6.0.2 for the provision of geospatial data.The client side has been developed in Javascript language using GeoExt libraries (version 3.4.1) to build the client interface for the web-GIS.

Cluster Analysis
To evaluate the completeness of the database from a vegetation/floristic point of view, a classification of the cells on the basis of species composition has been performed.Such methodology was realized with a hierarchical agglomerative clustering (HAC) approach.The HAC approach has been widely used in vegetation studies [39][40][41][42][43].It produces a nested hierarchy of groups of similar objects, based on a matrix containing the pairwise distances between all objects [44].Specifically, a cluster analysis based on the relative Euclidean (chord) distance with Ward's clustering algorithm was performed in this study.
The existence of significant differences in the floristic composition between clusters was contrasted using a Multiple Response Permutation Procedure (MRPP) test [45].The MRPP test provides a significant difference test between two or more groups of samples.A significant P-value (p < 0.05) indicates that the vegetation groups are significantly different in terms of species composition.
Subsequently, for each group, the indicator value of species was calculated using the Dufrene-Legendre Indicator Species Analysis [46] to characterize the groups in terms of the most representative species.This approach enables the researcher to evaluate and compare the floristic archive in qualitative terms, comparing these groups with existing group information based on bibliographic data on the main species assemblages found in Trentino.

Rarefaction Analysis
Comparing raw species counts for various assemblages and evaluating a seemingly diagnostic ratio involving species counts and numbers of sampled will quite generally produce misleading results due to differences in terms of sampling effort [47].It is well known that increasing the surface area sampled (and, thus, increasing the number of samples acquired) increases the probability of obtaining new species, an elementary application of the fundamental ecological species-area relationship [48].A robust way to compare diversity between different sampling surfaces and

Cluster Analysis
To evaluate the completeness of the database from a vegetation/floristic point of view, a classification of the cells on the basis of species composition has been performed.Such methodology was realized with a hierarchical agglomerative clustering (HAC) approach.The HAC approach has been widely used in vegetation studies [39][40][41][42][43].It produces a nested hierarchy of groups of similar objects, based on a matrix containing the pairwise distances between all objects [44].Specifically, a cluster analysis based on the relative Euclidean (chord) distance with Ward's clustering algorithm was performed in this study.
The existence of significant differences in the floristic composition between clusters was contrasted using a Multiple Response Permutation Procedure (MRPP) test [45].The MRPP test provides a significant difference test between two or more groups of samples.A significant P-value (p < 0.05) indicates that the vegetation groups are significantly different in terms of species composition.
Subsequently, for each group, the indicator value of species was calculated using the Dufrene-Legendre Indicator Species Analysis [46] to characterize the groups in terms of the most representative species.This approach enables the researcher to evaluate and compare the floristic archive in qualitative terms, comparing these groups with existing group information based on bibliographic data on the main species assemblages found in Trentino.

Rarefaction Analysis
Comparing raw species counts for various assemblages and evaluating a seemingly diagnostic ratio involving species counts and numbers of sampled will quite generally produce misleading results due to differences in terms of sampling effort [47].It is well known that increasing the surface area sampled (and, thus, increasing the number of samples acquired) increases the probability of obtaining new species, an elementary application of the fundamental ecological species-area relationship [48].A robust way to compare diversity between different sampling surfaces and estimate the potential number of species found, and, at the same time, the completeness of the archive, is to calculate sample-based rarefaction curves [49,50].
The rarefaction technique permits the researcher to evaluate and compare the species richness of various sampling sites, effectively adjusting for differences in different sampling effort.Furthermore, observations of the shape of the rarefaction curve yield implications and inferences regarding the level of completeness of the floristic archive [39].This technique was applied to the 16 zones to compare them and individuate any differences in species diversity in the light of the substantial differences in sampling effort.
All the statistical procedures were conducted using R software [51].All of the geoprocessing operations were performed using Grass and Quantum GIS software.

Results and Discussion
The digitalization and geolocalization of the samples produced 8148 records.Of these records, 7059 were geolocalized and reported in map form (86.63%).The number of linear transects geolocalized was 517 of 609 (83.52%), and the total number of species detected was 1285 species, for a total number of surveys equal to 190,761.The estimated distance covered by the entire sampling team was 65,536 m.
Table 1 shows the total number of series sampled and the number of species detected for each zone.The map in Figure 4 shows the total number of samples overlaid with the original paper maps.
The rarefaction technique permits the researcher to evaluate and compare the species richness of various sampling sites, effectively adjusting for differences in different sampling effort.Furthermore, observations of the shape of the rarefaction curve yield implications and inferences regarding the level of completeness of the floristic archive [39].This technique was applied to the 16 zones to compare them and individuate any differences in species diversity in the light of the substantial differences in sampling effort.
All the statistical procedures were conducted using R software [51].All of the geoprocessing operations were performed using Grass and Quantum GIS software.

Results and Discussion
The digitalization and geolocalization of the samples produced 8148 records.Of these records, 7059 were geolocalized and reported in map form (86.63%).The number of linear transects geolocalized was 517 of 609 (83.52%), and the total number of species detected was 1285 species, for a total number of surveys equal to 190,761.The estimated distance covered by the entire sampling team was 65,536 m.
Table 1 shows the total number of series sampled and the number of species detected for each zone.The map in Figure 4 shows the total number of samples overlaid with the original paper maps.
The web-GIS, hosted within servers provided by the Edmund Mach Foundation, allows for the exploration and overlay of the database content with several geographic layers, i.e., the digitized series map, digital elevation model, and the OpenStreetMap topographic layer.Furthermore, it is possible to view the richness overview map, organized by 5-km square cells or by municipality districts.Data exploration is possible in two ways: (i) after enabling the layer of the richness divided by cells, the selection function makes it possible to select one or more cells.Then, by clicking on the species list function, a new window will appear with the list of species detected inside the selected area; and (ii) clicking in the "species search" button allows the user to select a species with the aid of an auto-complete field and to view the geographical distribution.Six clusters were identified (Figure 5), with the first cluster composed of only one cell.An MRPP analysis (based on 999 permutations) showed statistical significance of the distance between the The web-GIS, hosted within servers provided by the Edmund Mach Foundation, allows for the exploration and overlay of the database content with several geographic layers, i.e., the digitized series map, digital elevation model, and the OpenStreetMap topographic layer.Furthermore, it is possible to view the richness overview map, organized by 5-km square cells or by municipality districts.Data exploration is possible in two ways: (i) after enabling the layer of the richness divided by cells, the selection function makes it possible to select one or more cells.Then, by clicking on the species list function, a new window will appear with the list of species detected inside the selected area; and (ii) clicking in the "species search" button allows the user to select a species with the aid of an auto-complete field and to view the geographical distribution.
Six clusters were identified (Figure 5), with the first cluster composed of only one cell.An MRPP analysis (based on 999 permutations) showed statistical significance of the distance between the clusters with an observed delta value equal to 0.4964, an expected delta value equal to 0.5782, and a p-value less than 0.001.Table 2 summarizes the results of the cluster analysis, with the species most represented individuated by the Species Analysis Indicator.The cluster analysis identifies macro differences among groups following an ecological and climatic gradient.Clusters 2 and 6 identify ecological conditions influenced by the warmer air coming from Garda Lake and the Po plain (in the southern part of the province) or where local microclimates are warmer (see Figure 1).These groups are localized in the southern part of Trentino or in those areas of Trentino where warm air can ascend via the Adige Valley.Group 4 identifies the main vegetation of central Trentino, mountain forest dominated by spruce, while groups 5 and 3 highlight the more alpine area that is strictly unaffected by southern Mediterranean influence.Cluster 1 identifies a single cell and appears to be idiosyncratic, most likely due to strong anthropic influence.These results support and confirm the species distribution in Prosser et al. [35].
ISPRS Int.J. Geo-Inf.2016, 5, 100 8 of 14 clusters with an observed delta value equal to 0.4964, an expected delta value equal to 0.5782, and a p-value less than 0.001.Table 2 summarizes the results of the cluster analysis, with the species most represented individuated by the Species Analysis Indicator.The cluster analysis identifies macro differences among groups following an ecological and climatic gradient.Clusters 2 and 6 identify ecological conditions influenced by the warmer air coming from Garda Lake and the Po plain (in the southern part of the province) or where local microclimates are warmer (see Figure 1).These groups are localized in the southern part of Trentino or in those areas of Trentino where warm air can ascend via the Adige Valley.Group 4 identifies the main vegetation of central Trentino, mountain forest dominated by spruce, while groups 5 and 3 highlight the more alpine area that is strictly unaffected by southern Mediterranean influence.Cluster 1 identifies a single cell and appears to be idiosyncratic, most likely due to strong anthropic influence.These results support and confirm the species distribution in Prosser et al. [35].The species richness map overlain with a hill-shaded layer is shown in Figure 6.

Rhododendron hirsutum
The species richness map overlain with a hill-shaded layer is shown in Figure 6.The comparison between the zones shows disparities both in terms of the number of species detected (ranging from a minimum of 245 species of zone 2 to a maximum of 606 species of zone 7) and sampling effort (minimum detected in the zone 2 with 245 reliefs and a maximum of 1089 reliefs in zone 8/9) (see Table 1).Observation of the rarefaction curves shown in Figure 7 serves to highlight that, apart from zones 1, 2, 12, and 15, the pattern associated with the zones of interest is characterized by a number of samples less than 400.Then, ideally cutting the curves to the same number of samples, the state described is different from that suggested by the analysis of raw data shown in Table 1.Zone 8/9, which is characterized by a high number of species and high number of samples, shows an estimated average species richness that falls approximately in the middle compared to the other areas.The zones with greater estimated average species richness are zones 7 and 13.Zone 7 is located near Garda Lake, hosts different vegetation belts and is an area where the local biodiversity is extremely high.Zone 13 is located at the confluence of the Non Valley and Adige Valley.In this case, as well, the altitude shift is very significant.On the contrary, the zones with the lowest average estimated species richness are the zone 5 and 3.These areas are characterized by more homogeneous forest vegetation composition.Additionally, in this case, a comparison with Flora Trentina confirms the trends and situation [35].
It is necessary to remember that the sampling was conducted out to produce Schmid's vegetation belts and was not intended as a complete inventory of forest flora.Considering the shape of the rarefaction curves, it is also possible to note when the curves reach a plateau, evidence that the floristic inventory is nearly complete.Considering the peculiar sampling approach adopted for this work in 1970, we did not expect to reach the plateau in any of the zones.Nevertheless, the rarefaction analysis highlights that zone 16 shows a tendency to reach a final asymptote.This finding shows that the data for certain areas can be considered relatively complete and have the potential for broader The comparison between the zones shows disparities both in terms of the number of species detected (ranging from a minimum of 245 species of zone 2 to a maximum of 606 species of zone 7) and sampling effort (minimum detected in the zone 2 with 245 reliefs and a maximum of 1089 reliefs in zone 8/9) (see Table 1).Observation of the rarefaction curves shown in Figure 7 serves to highlight that, apart from zones 1, 2, 12, and 15, the pattern associated with the zones of interest is characterized by a number of samples less than 400.Then, ideally cutting the curves to the same number of samples, the state described is different from that suggested by the analysis of raw data shown in Table 1.Zone 8/9, which is characterized by a high number of species and high number of samples, shows an estimated average species richness that falls approximately in the middle compared to the other areas.The zones with greater estimated average species richness are zones 7 and 13.Zone 7 is located near Garda Lake, hosts different vegetation belts and is an area where the local biodiversity is extremely high.Zone 13 is located at the confluence of the Non Valley and Adige Valley.In this case, as well, the altitude shift is very significant.On the contrary, the zones with the lowest average estimated species richness are the zone 5 and 3.These areas are characterized by more homogeneous forest vegetation composition.Additionally, in this case, a comparison with Flora Trentina confirms the trends and situation [35].
It is necessary to remember that the sampling was conducted out to produce Schmid's vegetation belts and was not intended as a complete inventory of forest flora.Considering the shape of the rarefaction curves, it is also possible to note when the curves reach a plateau, evidence that the floristic inventory is nearly complete.Considering the peculiar sampling approach adopted for this work in 1970, we did not expect to reach the plateau in any of the zones.Nevertheless, the rarefaction analysis highlights that zone 16 shows a tendency to reach a final asymptote.This finding shows that the data for certain areas can be considered relatively complete and have the potential for broader possible use.Generally speaking, the results of the rarefaction analysis produce trends comparable with other bibliographic sources showing an increasing dynamic without reaching the asymptote [52,53].Furthermore, the general distribution of the species richness in the Province of Trento agrees with the overall species richness distribution highlighted by other papers that use the same 5 × 5 km grid [54][55][56][57].The altitudinal effect of the grid is also clear and is the same as that which can be observed in Prosser [35,54].
Although the 5-km grid was useful to carry out the cited comparisons with the Trentino Flora Inventory [35] it must be said that the dimension of the grid may influence the number of plant communities taken into account in each cell.On the other side since the majority of Trentino region is located in relatively homogeneous mountain areas, this effect is limited to some specific areas.Several other studies, also performed in a heterogeneous environment, have used a similar grid to analyze plant biodiversity [58][59][60].Furthermore, our choice to use such a grid size was also because it fully overlaps with the data provided by the Forest Service of the Province of Trento, from where we could infer several other soil and dendrometrical parameters.
Therefore, the main results of this comparison confirm the accuracy and quality of the original sampling and strongly suggest the possibility to use some data for more complete analysis, although limited to forest environments.
The high species richness detected in the valley areas, or in high stressed areas is consistent with what has been amply demonstrated in alpine region, i.e., in the peak areas are strongly influenced by the extreme climatic conditions that operate by a filter in the number of species.Furthermore the decrease of micro-habitat, together with periodic stress input produce a positive effect on the overall number of species thanks to the introduction of pioneer and non-native species [61].

Conclusions
In this study, we recovered a substantial archive of forest floristic data and maps and made it available to the scientific community via a freely accessible web-GIS.The number of records contained in the archive and the preliminary investigations showed that the archive has great potential for use in forest studies.The simple floristic data analysis we performed shows that floristic Generally speaking, the results of the rarefaction analysis produce trends comparable with other bibliographic sources showing an increasing dynamic without reaching the asymptote [52,53].Furthermore, the general distribution of the species richness in the Province of Trento agrees with the overall species richness distribution highlighted by other papers that use the same 5 ˆ5 km grid [54][55][56][57].The altitudinal effect of the grid is also clear and is the same as that which can be observed in Prosser [35,54].
Although the 5-km grid was useful to carry out the cited comparisons with the Trentino Flora Inventory [35] it must be said that the dimension of the grid may influence the number of plant communities taken into account in each cell.On the other side since the majority of Trentino region is located in relatively homogeneous mountain areas, this effect is limited to some specific areas.Several other studies, also performed in a heterogeneous environment, have used a similar grid to analyze plant biodiversity [58][59][60].Furthermore, our choice to use such a grid size was also because it fully overlaps with the data provided by the Forest Service of the Province of Trento, from where we could infer several other soil and dendrometrical parameters.
Therefore, the main results of this comparison confirm the accuracy and quality of the original sampling and strongly suggest the possibility to use some data for more complete analysis, although limited to forest environments.
The high species richness detected in the valley areas, or in high stressed areas is consistent with what has been amply demonstrated in alpine region, i.e., in the peak areas are strongly influenced by the extreme climatic conditions that operate by a filter in the number of species.Furthermore the decrease of micro-habitat, together with periodic stress input produce a positive effect on the overall number of species thanks to the introduction of pioneer and non-native species [61].

Conclusions
In this study, we recovered a substantial archive of forest floristic data and maps and made it available to the scientific community via a freely accessible web-GIS.The number of records contained in the archive and the preliminary investigations showed that the archive has great potential for use in forest studies.The simple floristic data analysis we performed shows that floristic data are congruent with the Trentino forests' vegetation features and can be retained as reliable even if they cannot be considered a complete inventory of the Trentino forest flora because the focus of the original sampling was on forest vegetation.In some areas, the sampling appears to have attained a higher level of completeness.The data could be used to compare past floristic composition with present composition, to produce scenarios of future biodiversity shifts due to climate change, or to compare different regions with similar floristic composition [62].
Through the geolocalization of the transects, it could be possible to compare the present vegetation features and evaluate some of the temporal patterns of vegetation (e.g., new species, invasive species, changes in land use), and it could also be possible to resample near the same sites.
The data could be useful in the context of research projects, as well as in planning and evaluating projects for nature conservation.
The use of historical floristic databases must always take into account the purposes of the original sampling, which can limit their utility for particular purposes; nevertheless their value is immense, especially if, as in the present case, the floristic data are georeferenced and can be compared with the present state of the vegetation, can be considered more complete, and their use could be potentially wider.

Figure 2 .
Figure 2. Territorial division of the Province of Trento in 16 zones.

Figure 2 .
Figure 2. Territorial division of the Province of Trento in 16 zones.

14 Figure 3 .
Figure 3. Particular of the historical map overlaid with the sampling points recovered and georeferenced in this work.

Figure 3 .
Figure 3. Particular of the historical map overlaid with the sampling points recovered and georeferenced in this work.

Figure 4 .
Figure 4. Distribution of the transects and sampling points around the entire study area.

Figure 4 .
Figure 4. Distribution of the transects and sampling points around the entire study area.

Figure 5 .
Figure 5. Cluster analysis results overlaid on the Trentino Province area boundary (in red).

Figure 5 .
Figure 5. Cluster analysis results overlaid on the Trentino Province area boundary (in red).

Figure 6 .
Figure 6.Species richness map overlaid with a hill-shaded layer.

Figure 6 .
Figure 6.Species richness map overlaid with a hill-shaded layer.

14 Figure 7 .
Figure 7. Rarefaction analysis conducted on the 16 zones.The zone numbers are referenced at the end of each line.

Figure 7 .
Figure 7. Rarefaction analysis conducted on the 16 zones.The zone numbers are referenced at the end of each line.

Table 1 .
Summary of total number of series and species detected for each zone.

Table 2 .
Summary results of the species indicator analysis, with the species most represented and a short description for each zone.