Open Access to the Digital Biodiversity Database: A Comprehensive Functional Model of the Natural History Collections

: The Natural History Collections of Adam Mickiewicz University (AMUNATCOLL) in Pozna´n contain over 2.2 million specimens. Until recently, access to the collections was limited to specialists and was challenging because of the analogue data ﬁles. Therefore, this paper presents a new approach to data sharing called the Scientiﬁc, Educational, Public, and Practical Use (SEPP) Model. Since the stakeholder group is broad, the SEPP Model assumes the following key points: full open access to the digitized collections, the structure of metadata in accordance with certain standards, and a versatile tool set for data mining or statistical and spatial analysis. The SEPP Model was implemented in the AMUNATCOLL IT system, which consists of a web portal equipped with a wide set of explorative functionalities tailored to different user groups: scientists, students, ofﬁcials, and nature enthusiasts. An integral part of the system is a mobile application designed for ﬁeld surveys, enabling users to conduct studies comparing their own ﬁeld data and AMUNATCOLL data. The AMUNATCOLL IT database contains digital data on specimens, biological samples, bibliographic sources, and multimedia nature documents. The metadata structure was developed in accordance with ABCD 2.06 and Darwin Core standards. zoological samples, such as soil samples, litter, bird nests, and deadwood. The geographic locations of the ﬁeld observations saved in the application and the collected samples can be displayed at all times on the map so that the user has a constant view of his or her location while working in the ﬁeld. To deliver even more control over ﬁeldwork, the application can remember the user’s course over the route. The mobile application can be linked to an individual user account established on the AMUNATCOLL portal. Data transferred to a personal account can be processed using all analytical tools implemented on the portal. If these data meet the conditions speciﬁed by the AMUNATCOLL administrator, they can be included in the general database and made publicly available to all system users.


Introduction
Natural history collections (NHCs) have always been fundamental to the study of the diversity and variability of organisms [1]. Specimens collected in the field are essential for descriptions of taxa [2,3] and have become reference points for species identification by successive generations of researchers. However, for several decades, the use of collections has not been limited to taxonomic research. Specimens as information resources are being increasingly widely used in biogeographic, environmental, and ecological research [4]. The value of physical biodiversity specimens has increased significantly through the use of

Materials and Methods
In this paper, we share our experience with the development and implementation of the project "AMU Nature Collections-online (AMUNATCOLL): Digitization and sharing of the natural data resource of the Faculty of Biology at Adam Mickiewicz University in Poznań" [53]. The AMUNATCOLL project was cofinanced by the European Union through the European Regional Development Fund under the Operational Program Digital Poland (OP PC), Priority axis: II E-administration and open government; Action: 2.3 Digital accessibility and usability of public sector information; Submeasure: 2.3.1 Digital access to public sector information from administrative sources and science resources.
The idea of digitizing and sharing data on biodiversity was born at the Faculty of Biology of AMU (FBAMU), which for over 100 years has been a regional centre for biodiversity research and a storage site for NHCs. This idea was implemented in cooperation with the Poznań Supercomputing and Networking Center (PSNC), which is affiliated with the Institute of Bioorganic Chemistry of the Polish Academy of Sciences. The PSNC is an internationally known node of European research in the field of scientific information infrastructure and an important research and development centre in the field of information and communication technologies [54].

Natural History Collections of Adam Mickiewicz University in Poznań
The NHCs were deposited in a museum unit specifically created for this purpose at the FBAMU in 2004 [55]. These collections document the scientific research of several generations of naturalists conducted in many regions of the world. Most are collections gathered by biologists associated with the University of Poznań, which was established in 1919. The oldest specimens date back to the end of the 18th century. The AMU NHCs are characterized by high biodiversity, which corresponds to the diverse methodology of specimen documentation and storage. Therefore, three main groups of collections can be distinguished: botanical, mycological, and zoological.
The botanical collections comprise approximately 500,000 specimens, including over 350,000 vascular plants. Specimens of vascular plants are stored in two herbaria affiliated with Index Herbariorum [56]: POZ and POZG. The POZ herbarium (approx. 190,000 sheets) consists of many collections, mainly from Poland but also from various regions of Europe and North America. This herbarium contains over 240 nomenclatural types of various ranks. The main part of the POZG herbarium is the collection of vascular plants by S. Lisowski (approx. 120,000 sheets), an outstanding expert in the flora of Central Africa. This collection includes over 100 nomenclatural types of different ranks. Bryophytes and algae constitute an important part of the botanical collections. The collection of bryophytes includes approximately 90,000 herbarium bags with mosses (POZG) and over 40,000 bags with liverworts (POZW). The herbarium of algae (POZA) comprises over 10,000 microscopic specimens and dried specimens, including collections of stoneworts by I. Dąmbska. A total of over 20,000 macrofungi (POZM) and 30,000 lichen (POZ) specimens have been deposited in mycological collections.
The zoological collections include over 1,700,000 invertebrate specimen catalogued thus far, as well as 50,000 chordates. Over 80% of the collection is arthropods, the most numerous of which are arachnids and insects. The collection of molluscs is also very rich (over 200,000 specimens). The arachnid phylum, numbering over 660,000 identified specimens, is represented primarily by mites (over 500,000 specimens).
Less numerous but equally interesting are spiders, harvestmen, and pseudoscorpions. In the insect phylum, among more than 500,000 specimens, the most abundant are collections of beetles, wasps, sawflies, bees, ants, and butterflies. The collections of fleas, dragonflies, mayflies, and true bugs are also very interesting. The nomenclatural types (1000 specimens) collected therein prove the high scientific value of the zoological collections.

Definition and Course of Digitization
The applied digitization procedure is consistent with the broad approach introduced by Nelson and Ellis [52]: "We define digitization as the conversion of specimen data from analogue to digital signals. This includes transcribing text data from specimen labels and other specimen-related documents into digital records of those labels and documents regardless of input mode (e.g., voice, keyboard, scanning/optical character recognition (OCR); the translation of physical specimens to digital images of those specimens, including two-dimensional, three-dimensional (3D), computed tomography (CT), and other digital image types that visually represent the physical specimen; the conversion of analogue audio and video recordings to digital recordings; the conversion of textual location descriptions into digital georeferences within an accepted geographical coordinate system and the conversion of other specimen-related data into digital format with technologies that are or might become available". Considering the specificity of the AMUNATCOLL project, these activities were oriented around selected elements specified in the above definition, with particular emphasis on label text transcription; document (e.g., herbarium sheets) scanning; photography with the light microscope, stereoscopic microscope, scanning microscope, and full-matrix CANON cameras; and conversion of textual location description into digital georeferences. The life cycle of the AMUNATCOLL project, which aims to provide access to the digital biodiversity data stored at the FBAMU, has four main phases: initiation, preparation, and implementation of the project and the operation of the IT system ( Figure 1). The basic methodological assumptions of the system development life cycle (SDLC) [57,58] are used in the implementation of this project. The AMUNATCOLL project was carried out for 39 months. Over 70 employees participated in the project implementation. Approximately 60 people were specifically involved in digitizing the collections, including 5 who specialized in the process of geotagging NHC data [59].
(e.g., herbarium sheets) scanning; photography with the light microscope, ster microscope, scanning microscope, and full-matrix CANON cameras; and conve textual location description into digital georeferences.
The life cycle of the AMUNATCOLL project, which aims to provide acces digital biodiversity data stored at the FBAMU, has four main phases: initiation, tion, and implementation of the project and the operation of the IT system (Figur basic methodological assumptions of the system development life cycle (SDLC) [5 used in the implementation of this project. The AMUNATCOLL project was car for 39 months. Over 70 employees participated in the project implementation. A mately 60 people were specifically involved in digitizing the collections, includin specialized in the process of geotagging NHC data [59].

Conceptual Model of Digitized Natural History Collections Open to Science Education, Public and Practical (SEPP) Use
The mission of the AMUNATCOLL IT system is to develop and promote kn about biodiversity by collecting and openly sharing digitized data and providing their analysis for research, education, application and social purposes ( Figure 2) sume that the digitized and shared data will be used not only by specialists in th life sciences but also by experts in culture, history, and other fields of knowled survey conducted during the project preparation phase showed that not only s but also teachers, students, experts in nature conservation and environmental m ment, officials and specialists in spatial planning, and even legal experts are inter digitized data on biodiversity. The expectations signalled at that time were taken count in the conceptual model of the AMUNATCOLL IT system.

Conceptual Model of Digitized Natural History Collections Open to Science, Education, Public and Practical (SEPP) Use
The mission of the AMUNATCOLL IT system is to develop and promote knowledge about biodiversity by collecting and openly sharing digitized data and providing tools for their analysis for research, education, application and social purposes ( Figure 2). We assume that the digitized and shared data will be used not only by specialists in the field of life sciences but also by experts in culture, history, and other fields of knowledge. The survey conducted during the project preparation phase showed that not only scientists but also teachers, students, experts in nature conservation and environmental management, officials and specialists in spatial planning, and even legal experts are interested in digitized data on biodiversity. The expectations signalled at that time were taken into account in the conceptual model of the AMUNATCOLL IT system.
The system consists of a digital database and tool set made available through the web portal and a mobile application integrated with the portal. The AMUNATCOLL database is fed by four analogue information sources subject to the digitization process: (i) natural collections, i.e., specimens of algae, plants, fungi, and animals, as well as a collection of soil samples from all over the world, deposited at the FBAMU; (ii) published and unpublished floristic, faunal, and mycobiotic data, mainly from Poland; (iii) nature photography and films documenting scientific research in many regions of the world; and (iv) bibliographic sources cited in the AMUNATCOLL database.
The AMUNATCOLL mobile application allows users to document observations directly in the natural environment and send them to the portal for further analysis. The portal is interactive, enabling users to explore the shared data and provide feedback to the system administrator. The system consists of a digital database and tool set made available through the portal and a mobile application integrated with the portal. The AMUNATCOLL dat is fed by four analogue information sources subject to the digitization process: (i) na collections, i.e., specimens of algae, plants, fungi, and animals, as well as a collecti soil samples from all over the world, deposited at the FBAMU; (ii) published an published floristic, faunal, and mycobiotic data, mainly from Poland; (iii) nature ph raphy and films documenting scientific research in many regions of the world; an bibliographic sources cited in the AMUNATCOLL database.
The AMUNATCOLL mobile application allows users to document observation rectly in the natural environment and send them to the portal for further analysis portal is interactive, enabling users to explore the shared data and provide feedback system administrator.
Interactivity with biodiversity data is an important feature of the system becaus AMUNATCOLL mobile application allows users to document observations directly natural environment and send them, including the digitized data, to their portal ac for further analysis. Users can also add their own archive location data directly to the in the portal. Additionally, using the features of creating teams and joining teams, can share field data and comments.
The specimen data along with their occurrence, fully available at the A NATCOLL portal, are successively transferred to the international GBIF database [6 It is worth emphasizing that the AMUNATCOLL IT system is open to the inflow of from other research centres and individual owners of nature collections.
The presented idea goes towards Digital Extended Specimen (DES) [62] conce respect of developed functionality as well as following FAIR principles (findability, a sibility, interoperability, and reuse) of specimen metadata. The DES concept was d oped through an open community consultation led by GBIF [63] and is a result of a gation of the open digital specimen concept developed under DiSSCo initiative [64 the extended specimen concept developed by Lendemer [65] and implemented by B versity Collections Network [66]. Presently, the AMUNATCOLL IT system supports or partially the Extended Specimen Network extensions: Primary, Secondary, and Interactivity with biodiversity data is an important feature of the system because the AMUNATCOLL mobile application allows users to document observations directly in the natural environment and send them, including the digitized data, to their portal account for further analysis. Users can also add their own archive location data directly to the map in the portal. Additionally, using the features of creating teams and joining teams, users can share field data and comments.
The specimen data along with their occurrence, fully available at the AMUNATCOLL portal, are successively transferred to the international GBIF database [60,61]. It is worth emphasizing that the AMUNATCOLL IT system is open to the inflow of data from other research centres and individual owners of nature collections.
The presented idea goes towards Digital Extended Specimen (DES) [62] concept in respect of developed functionality as well as following FAIR principles (findability, accessibility, interoperability, and reuse) of specimen metadata. The DES concept was developed through an open community consultation led by GBIF [63] and is a result of aggregation of the open digital specimen concept developed under DiSSCo initiative [64] and the extended specimen concept developed by Lendemer [65] and implemented by Biodiversity Collections Network [66]. Presently, the AMUNATCOLL IT system supports fully or partially the Extended Specimen Network extensions: Primary, Secondary, and Tertiary by enabling access to data such as images, metadata, ecological data, environmental samples, field images, conservation status, and species distribution. Its future development will aim to extend this set at interoperability for other data groups.

Collecting and Sharing Biodiversity Data in the AMUNATCOLL IT System
The main path from nature collections and information being deposited in the traditional (analogue) way at the AMU to making them available in a modern and fully open format leads through the following stages: digitization, database design, and portal equipped with search and analysis tools ( Figure 3). The AMUNATCOLL database, in a special private area, can also be supplied with digital data sent from a mobile application that has been specially developed for field surveys.
format leads through the following stages: digitization, database design, and p equipped with search and analysis tools (Figure 3). The AMUNATCOLL database special private area, can also be supplied with digital data sent from a mobile applic that has been specially developed for field surveys.

Metadata Structure and Database
The core of the AMUNATCOLL IT system is a database with content that is primarily on collections of biological specimens as well as photos and published or p ously unpublished field observations (Section 1 in Table 1). These analogue biodiv documents are described during digitization with metadata that meet ABCD 2.06 s ards [67] and correspond to the Simple Darwin Core Categories [68]: Record-level T Taxon, Occurrence, Event, and Location. Depending on the type of document, the nu of fields in a single record varies from 68 to 85 (Table 1).
For the purpose of the specimen description, fields for information on the loc and method of storage and conservation are also provided. A specific source of mation on biodiversity is samples containing taxonomically unidentified biological rial (mainly invertebrates). These very valuable samples await scientific analysis, pa larly the determination of their species (Section 2 in Table 1). An extensive descript the samples includes, among other things, information about already elaborated sy atic groups that allows researchers to focus their analyses on groups that have n been identified. The list contains 50 metadata items and is modelled on Section 1 AMUNATCOLL database also collects multimedia documents of landscapes and hab as well as photos of specimens, which, although they do not have all the attributes n sary for inclusion in Section 1, can be used for nonscientific purposes (Section 3). I case, a single record consists of 51 fields. Section 4 (References) contains 28 metada scribing the references and other source documents cited in the descriptions of speci photos, biological samples, and multimedia documents. Two sections (1 and 3) al clude six fields for information on the usefulness of digitized objects for specific groups.
The database implemented in AMUNATCOLL IT provides effective access metadata fields as well as organizational, technical, and auxiliary fields. An open-s PostgreSQL server was deployed in the system. It manages relational databases an phasizes extensibility and SQL compliance. PostgreSQL is designed to handle h

Metadata Structure and Database
The core of the AMUNATCOLL IT system is a database with content that is based primarily on collections of biological specimens as well as photos and published or previously unpublished field observations (Section 1 in Table 1). These analogue biodiversity documents are described during digitization with metadata that meet ABCD 2.06 standards [67] and correspond to the Simple Darwin Core Categories [68]: Record-level Terms, Taxon, Occurrence, Event, and Location. Depending on the type of document, the number of fields in a single record varies from 68 to 85 (Table 1).
For the purpose of the specimen description, fields for information on the location and method of storage and conservation are also provided. A specific source of information on biodiversity is samples containing taxonomically unidentified biological material (mainly invertebrates). These very valuable samples await scientific analysis, particularly the determination of their species (Section 2 in Table 1). An extensive description of the samples includes, among other things, information about already elaborated systematic groups that allows researchers to focus their analyses on groups that have not yet been identified. The list contains 50 metadata items and is modelled on Section 1. The AMUNATCOLL database also collects multimedia documents of landscapes and habitats, as well as photos of specimens, which, although they do not have all the attributes necessary for inclusion in Section 1, can be used for nonscientific purposes (Section 3). In this case, a single record consists of 51 fields. Section 4 (References) contains 28 metadata describing the references and other source documents cited in the descriptions of specimens, photos, biological samples, and multimedia documents. Two sections (1 and 3) also include six fields for information on the usefulness of digitized objects for specific user groups.
The database implemented in AMUNATCOLL IT provides effective access to all metadata fields as well as organizational, technical, and auxiliary fields. An open-source PostgreSQL server was deployed in the system. It manages relational databases and emphasizes extensibility and SQL compliance. PostgreSQL is designed to handle heavy workloads, such as data warehouses or web services with multiple concurrent users. Furthermore, it is often the primary choice for solutions for biodiversity systems, as is evidenced by examples of institutions such as the Belgian Biodiversity Platform [69] and Island Conservation Society [70] or the TriatoKey application [71]. The database is divided into three logical parts: (i) "amunatcoll", responsible for storing digitized textual data on specimens; (ii) "dlibra", which stores metadata on digitized images of specimens and multimedia collected directly in the field; and (iii) "anc_portal", responsible for the storage of data related to the portal and its functions.

Portal
The architecture of the AMUNATCOLL portal [72] is in line with the assumptions of the conceptual model of the IT system (Figure 2). It takes into account both the abundance and complexity of information on biodiversity resources deposited in the database as well as the requirements of potential users. The modular and multilayered structure of the portal offers users quick and easy access to data while enabling the administration panel to manage the system.
The homepage contains basic information about the AMUNATCOLL IT system, including database resources, Geographic Information System for Biodiversity (BioGIS) tools, the mobile application, potential users, and the contractors and owners of the system (Figure 4). as the requirements of potential users. The modular and multilayered structure of the portal offers users quick and easy access to data while enabling the administration panel to manage the system.
The homepage contains basic information about the AMUNATCOLL IT system, including database resources, Geographic Information System for Biodiversity (BioGIS) tools, the mobile application, potential users, and the contractors and owners of the system (Figure 4).  A much more extensive description, including the mission of the system, can be found in the "AMUNATCOLL" module. However, the most important functions for the user are in the Search Engines, Tools, and My AMUNATCOLL ( Figure 5) modules. The system allows users to search for information about specimens, biological samples, multimedia documents, bibliographic sources cited in the database, and educational materials. Specimens and taxa can be viewed using several types of search engine. The general search engine is available to everyone, while the others are available to logged-in users. The three specialized search engines differ in the number of selection criteria and conditions that must be met for the specimens sought. For the simple search engine, a user can choose from one to seven criteria that the specimens must meet (Figure 6a). These are deliberately frequently selected specimen attributes: genus or species name in Latin, superior taxonomic unit, author of the collection, year of collection, country, type of collection, and iconographic documentation. In addition, it is feasible to search for specimens by marking their area of origin on the map. In the case of the extended and advanced search engines, the user can choose from 1 to 37 criteria classified into four groups (Figure 6b,c). Furthermore, it should be mentioned that when many criteria are selected, the possibility of defining logical operators was introduced. In the simple and extended search engines, the "and" condition is used by default. On the other hand, in the advanced search engine, the operators "or" and "no" are also implemented and when used are explicitly mentioned and modified.
Specimens can also be accessed by selecting collections or systematic groups, the lists of which are made available in the panels of relevant search engines. Systematic groups are represented by higher rank taxa, for which information is not included in the metadata specification itself. Data on taxonomically unidentified samples can be obtained using 49 selection criteria classified into four groups (Figure 7a). Eight criteria were entered for the multimedia search, and six criteria were entered for the bibliography (Figure 7b,c). Authorized legitimate users can create educational and training materials from AMUNATCOLL database documents that are available to everyone and can be downloaded from the successively completed list.
by marking their area of origin on the map. In the case of the extended and advanced search engines, the user can choose from 1 to 37 criteria classified into four groups (Figure  6b,c). Furthermore, it should be mentioned that when many criteria are selected, the possibility of defining logical operators was introduced. In the simple and extended search engines, the "and" condition is used by default. On the other hand, in the advanced search engine, the operators "or" and "no" are also implemented and when used are explicitly mentioned and modified.  Specimens can also be accessed by selecting collections or systematic groups, the list of which are made available in the panels of relevant search engines. Systematic group are represented by higher rank taxa, for which information is not included in the metadat specification itself. Data on taxonomically unidentified samples can be obtained using 4 The "Tools" module, which is closely related to the search engines, incorporates tw functions: creating statistical reports and visualizing the results of analyses on charts an maps. A detailed description of these tools is beyond the scope of this paper, so we use a example of an analysis aimed at answering the following questions: (i) How many an which scans of Papaver rhoeas specimens are represented in the AMUNATCOLL botanica mycological database collection? (ii) In which countries were they collected? (iii) How ar these sites distributed?
After three search criteria (genus/species, iconography, and collection type) are se lected, the number of specimens that meet them is displayed (Figure 8a). Adding th grouping factor (country) allows the user to obtain a table with the number and percen age distribution of the specimens and to generate appropriate charts (Figure 8b). Usin BioGIS tools, the obtained results can be presented on various types of maps (e.g., do distribution map, cluster map) (Figure 8c,d). This opens up wide possibilities of analysi built upon the BioGIS tools selected by the user according to his or her needs. The "Tools" module, which is closely related to the search engines, incorporates two functions: creating statistical reports and visualizing the results of analyses on charts and maps. A detailed description of these tools is beyond the scope of this paper, so we use an example of an analysis aimed at answering the following questions: (i) How many and which scans of Papaver rhoeas specimens are represented in the AMUNATCOLL botanicalmycological database collection? (ii) In which countries were they collected? (iii) How are these sites distributed?
After three search criteria (genus/species, iconography, and collection type) are selected, the number of specimens that meet them is displayed (Figure 8a). Adding the grouping factor (country) allows the user to obtain a table with the number and percentage distribution of the specimens and to generate appropriate charts (Figure 8b). Using BioGIS tools, the obtained results can be presented on various types of maps (e.g., dot distribution map, cluster map) (Figure 8c,d). This opens up wide possibilities of analysis built upon the BioGIS tools selected by the user according to his or her needs.
The user can choose the following map types: dot distribution map, area class map, choropleth map, diagram map, cluster map, attribute grouping map, and time-lapse map. Depending on the type of map and the nature of the presented phenomenon, the user can define map properties such as colour, shape and size of signatures, colour and width of borders, colour and transparency of separated areas, and number and width of intervals within the variability range of the analysed feature or phenomenon. The user can define the description properties and design a map legend (including title, author, description length, font sizes, legend location, and width). Diversity 2022, 14, x FOR PEER REVIEW 12 of 18 The user can choose the following map types: dot distribution map, area class map, choropleth map, diagram map, cluster map, attribute grouping map, and time-lapse map. Depending on the type of map and the nature of the presented phenomenon, the user can define map properties such as colour, shape and size of signatures, colour and width of borders, colour and transparency of separated areas, and number and width of intervals within the variability range of the analysed feature or phenomenon. The user can define the description properties and design a map legend (including title, author, description length, font sizes, legend location, and width).
The system includes thematic layers such as the grid of geographical coordinates, the grid of the Universal Transverse Mercator (UTM) [73], the grid of the Atlas of distribution of vascular plants in Poland (ATPOL) [74], countries and voivodships, counties, communes, national parks, and landscape parks in Poland.
In addition, the user has the option to define his or her own vector layers and develop his or her own maps.

Mobile Application
The AMUNATCOLL mobile application is aimed at users observing flora, fauna, mycobiota, and natural habitats in the field for scientific, educational, and professional purposes related to nature conservation [75]. This mobile application enables the documentation of natural observations in the form of text descriptions, photos, and voice recordings (Figure 9). The observation form contains a list of predefined fields and open fields The system includes thematic layers such as the grid of geographical coordinates, the grid of the Universal Transverse Mercator (UTM) [73], the grid of the Atlas of distribution of vascular plants in Poland (ATPOL) [74], countries and voivodships, counties, communes, national parks, and landscape parks in Poland.
In addition, the user has the option to define his or her own vector layers and develop his or her own maps.

Mobile Application
The AMUNATCOLL mobile application is aimed at users observing flora, fauna, mycobiota, and natural habitats in the field for scientific, educational, and professional purposes related to nature conservation [75]. This mobile application enables the documentation of natural observations in the form of text descriptions, photos, and voice recordings ( Figure 9). The observation form contains a list of predefined fields and open fields that the user can define him-or herself. The list of predefined fields includes ordinal data, data identifying the observation (number, date, author), the coordinates of the place of observation, area size, and vegetation cover. A researcher observing species can specify their scientific names from the list provided and assign them characteristics such as quantity and coverage, sex, age, stage of development, and dimensions. An ecologist documenting natural habitats has the option of selecting habitat types from the list provided or entering names according to individual classification. A special form is available for descriptions of zoological samples, such as soil samples, litter, bird nests, and deadwood. The geographic locations of the field observations saved in the application and the collected samples can be displayed at all times on the map so that the user has a constant view of his or her location while working in the field. To deliver even more control over fieldwork, the application can remember the user's course over the route. The mobile application can be linked to an individual user account established on the AMUNATCOLL portal. Data transferred to a personal account can be processed using all analytical tools implemented on the portal. If these data meet the conditions specified by the AMUNATCOLL administrator, they can be included in the general database and made publicly available to all system users. tity and coverage, sex, age, stage of development, and dimensions. An ecologist docu menting natural habitats has the option of selecting habitat types from the list provide or entering names according to individual classification. A special form is available fo descriptions of zoological samples, such as soil samples, litter, bird nests, and deadwood The geographic locations of the field observations saved in the application and the co lected samples can be displayed at all times on the map so that the user has a constan view of his or her location while working in the field. To deliver even more control ove fieldwork, the application can remember the user's course over the route. The mobile ap plication can be linked to an individual user account established on the AMUNATCOL portal. Data transferred to a personal account can be processed using all analytical too implemented on the portal. If these data meet the conditions specified by the AMU NATCOLL administrator, they can be included in the general database and made publicl available to all system users.

API Interface
Access to the AMUNATCOLL IT system takes place through an interface imple mented in REST (REpresentational State Transfer) technology ( Figure 10). The portal an the mobile application use the same programming interface. Most of the offered function alities are available only to the logged-in user; therefore, access to individual interfac methods is secured with JSON Web Token (JWT). Selected information from the taxo nomic database is made available to external entities, primarily the GBIF. For this purpose the "BioCASe Provider Software" (BPS) service, which is compatible with the "Biologica Collection Access Service" [76], was established. This international network of biodive sity repositories combines specimen data from wildlife collections, botanical garden

API Interface
Access to the AMUNATCOLL IT system takes place through an interface implemented in REST (REpresentational State Transfer) technology ( Figure 10). The portal and the mobile application use the same programming interface. Most of the offered functionalities are available only to the logged-in user; therefore, access to individual interface methods is secured with JSON Web Token (JWT). Selected information from the taxonomic database is made available to external entities, primarily the GBIF. For this purpose, the "BioCASe Provider Software" (BPS) service, which is compatible with the "Biological Collection Access Service" [76], was established. This international network of biodiversity repositories combines specimen data from wildlife collections, botanical gardens, zoos, and other research institutions around the world with information from huge observational databases, including the AMUNATCOLL database. zoos, and other research institutions around the world with information from huge observational databases, including the AMUNATCOLL database. Figure 10. Communication between the AMUNATCOLL IT system and external biodiversity databases.

Conclusions
Digital databases based on NHCs are already well established in the science of biodiversity. Large international data aggregators, such as the GBIF or Integrated Digitized Biocollections, are of global importance in facilitating access to biological data. A significant role is also played by small and medium-sized biodiversity collections, which on the one hand enable biodiversity research on a regional and local scale and on the other hand supply information to global databases. Such databases, apart from their scientific role, are also of great importance in the effective protection of biodiversity by developing areas of social activity such as education, citizen science, and environmental management. The social aspect and the related benefits for the wide group of stakeholders should be borne in mind whenever natural resources are being digitized and information systems are being designed to make them available. Digitization of specimens provides a great impetus to undertake new directions of scientific research. Digitization also creates an important opportunity to open access to nature collections to teachers, students, officials responsible for environmental protection, citizens fascinated by nature, and finally everyone who wants to learn to know, investigate, and protect the natural world.
All of the above assumptions drove the design and development of the AMU-NATCOLL IT system, which is based on the NHCs of AMU. The ultimate project goals were achieved, inter alia, by  basing the metadata structure on international standards (ABCD 2.06 and Darwin Core) and enabling the mechanism to transfer data to the GBIF database;  expanding the metadata structure with fields useful in education, popularization of biodiversity knowledge, and practical nature conservation;  considering physical specimens and multimedia documents collected directly under empirical environmental study in the digitization process;  implementing advanced search and analysis tools, especially a rich set of tools for spatial analysis, in the portal;  integrating the portal with the dedicated mobile application, intended for documenting natural observations directly in the field;  facilitating a space on the portal for user observations documented by means of a mobile application to work on NHCs; Figure 10. Communication between the AMUNATCOLL IT system and external biodiversity databases.

Conclusions
Digital databases based on NHCs are already well established in the science of biodiversity. Large international data aggregators, such as the GBIF or Integrated Digitized Biocollections, are of global importance in facilitating access to biological data. A significant role is also played by small and medium-sized biodiversity collections, which on the one hand enable biodiversity research on a regional and local scale and on the other hand supply information to global databases. Such databases, apart from their scientific role, are also of great importance in the effective protection of biodiversity by developing areas of social activity such as education, citizen science, and environmental management. The social aspect and the related benefits for the wide group of stakeholders should be borne in mind whenever natural resources are being digitized and information systems are being designed to make them available. Digitization of specimens provides a great impetus to undertake new directions of scientific research. Digitization also creates an important opportunity to open access to nature collections to teachers, students, officials responsible for environmental protection, citizens fascinated by nature, and finally everyone who wants to learn to know, investigate, and protect the natural world.
All of the above assumptions drove the design and development of the AMUNATCOLL IT system, which is based on the NHCs of AMU. The ultimate project goals were achieved, inter alia, by • basing the metadata structure on international standards (ABCD 2.06 and Darwin Core) and enabling the mechanism to transfer data to the GBIF database; • expanding the metadata structure with fields useful in education, popularization of biodiversity knowledge, and practical nature conservation; • considering physical specimens and multimedia documents collected directly under empirical environmental study in the digitization process; • implementing advanced search and analysis tools, especially a rich set of tools for spatial analysis, in the portal; • integrating the portal with the dedicated mobile application, intended for documenting natural observations directly in the field; • facilitating a space on the portal for user observations documented by means of a mobile application to work on NHCs; • creating an open system for collectors wanting to deposit their digital specimens, documents, or observations in the AMUNATCOLL database.
We believe that the SEPP Model of the IT system presented briefly in this work can be an inspiration for many institutions that are developing and implementing collection digitization processes for the benefit of knowledge and the protection of biodiversity.