Enhancing Trait Thesauri Interoperability Using a Manual and Automated Alignment Approach
Abstract
1. Introduction
2. Materials and Methods
2.1. SKOS Thesauri
2.2. Manual Mapping Approach
2.3. Automatic Matching Approach Within the OAEI
- (1)
- Preparation: Data conversion, thesauri loading, and matcher configuration;
- (2)
- Execution: Automatic generation of alignments on pairwise thesaurus comparisons;
- (3)
- Evaluation: Comparison of system-generated mappings against expert-based reference alignments.
- LogMap [40] is an ontology matching system that constructs an inverted lexical index for each ontology and uses external lexicons to find synonyms and lexical variation. It also exploits the information in the class hierarchy, and it employs reasoning and repair techniques to minimise logical errors.
- LogMapLt is a lightweight variant of LogMap, which applies string matching techniques. LogMapKG is the LogMap system that returns instance-level and concept-level correspondences.
- Matcha [41] is an ontology matching system that incorporates the lexical and structural algorithms from AML and a matching algorithm that uses large language models (LLMs). The system relies on the entities being semantically equivalent, either by having the same URI or by being declared as owl:sameAs.
- OLaLa [42,43] is a matching system based on sentence transformers and LLM. The system generates some matching candidates using the Sentence BERT model (SBERT) with a function that is able to extract labels or descriptions, as well as URI fragments and annotation properties. These are fed to the LLM application, where each candidate is analysed independently; therefore, the system has to decide whether one candidate is correct or not, or the system selects the most likely correspondence from a set of possible targets. The output of the high-precision matcher is added, and finally, filters are applied to ensure that only candidates with high confidence intervals are returned.
3. Results
3.1. Manual Mapping
3.2. Adequacy of Matching Tools: Results Against Manually Created Mappings
3.3. OLaLa Performance
3.4. Thesauri Merging: The Trait Thesaurus
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Kissling, W.D.; Walls, R.; Bowser, A.; Jones, M.O.; Jens, K.; Donat, A.; Josep, A.; Basset, A.; van Bodegom, P.M.; Cornelissen, J.H.C.; et al. Towards global data products of essential biodiversity variables on species traits. Nat. Ecol. Evol. 2018, 2, 1531–1540. [Google Scholar] [CrossRef]
- Flynn, D.F.B.; Mirotchnick, N.; Jain, M.; Palmer, M.I.; Naeem, S. Functional and phylogenetic diversity as predictors of biodiversity–ecosystem-function relationships. Ecology 2011, 92, 1573–1581. [Google Scholar] [CrossRef] [PubMed]
- Cardinale, B.J.; Duffy, J.E.; Gonzalez, A.; Hooper, D.U.; Perrings, C.; Venail, P.; Narwani, A.; Mace, G.; Tilman, D.; Wardle, D.A.; et al. Biodiversity loss and its impact on humanity. Nature 2012, 486, 59–67. [Google Scholar] [CrossRef] [PubMed]
- Krause, S.; Le Roux, X.; Niklaus, P.A.; Van Bodegom, P.M.; Lennon, J.T.; Bertilsson, S.; Grossart, H.P.; Philippot, L.; Bodelier, P.L.E. Trait-based approaches for understanding microbial biodiversity and ecosystem functioning. Front. Microbiol. 2014, 5, 251. [Google Scholar] [CrossRef] [PubMed]
- Pata, P.R.; Hunt, B.P.V. Harmonizing marine zooplankton trait data toward a mechanistic understanding of ecosystem functioning. Limnol. Oceanogr. 2024, 70, S8–S27. [Google Scholar] [CrossRef]
- Laraib, M.; Titocci, J.; Rosati, I.; Basset, A. An integrated individual-level trait-based phytoplankton dataset from transitional waters. Sci. Data 2023, 10, 897. [Google Scholar] [CrossRef]
- Falster, D.; Gallagher, R.; Wenk, E.H.; Wright, I.J.; Indiarto, D.; Andrew, S.C.; Baxter, C.; Lawson, J.; Allen, S.; Fuchs, A.; et al. AusTraits, a curated plant trait database for the Australian flora. Sci. Data 2021, 8, 254. [Google Scholar] [CrossRef]
- Pekár, S.; Wolff, J.O.; Černecká, Ľ.; Birkhofer, K.; Mammola, S.; Lowe, E.C.; Fukushima, C.S.; Herberstein, M.E.; Kučera, A.; Buzzatto, B.A.; et al. The World Spider Trait database: A centralized global open repository for curated data on spider traits. Database 2021, 2021, baab064. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Gallagher, R.V.; Falster, D.S.; Maitner, B.S.; Salguero-Gómez, R.; Vandvik, V.; Pearse, W.D.; Schneider, F.D.; Kattge, J.; Poelen, J.H.; Madin, J.S.; et al. Open science principles for accelerating trait-based science across the Tree of Life. Nat. Ecol. Evol. 2020, 4, 294–303. [Google Scholar] [CrossRef]
- Kattge, J.; Bönisch, G.; Díaz, S.; Lavorel, S.; Prentice, I.C.; Leadley, P.; Tautenhahn, S.; Werner, G.D.A.; Aakala, T.; Abedi, M.; et al. TRY plant trait database–enhanced coverage and open access. Glob. Change Biol. 2020, 26, 119–188. [Google Scholar] [CrossRef]
- Kattge, J.; Díaz, S.; Lavorel, S.; Prentice, I.C.; Leadley, P.; Bönisch, G.; Garnier, E.; Westoby, A.M.; Reich, P.B.; Wright, I.J.; et al. TRY—A global database of plant traits. Glob. Change Biol. 2011, 17, 2905–2935. [Google Scholar] [CrossRef]
- Jones, K.E.; Bielby, J.; Cardillo, M.; Fritz, S.A.; O’Dell, J.; Orme, C.D.L.; Safi, K.; Sechrest, W.; Boakes, E.H.; Carbone, C.; et al. PanTHERIA: A species-level database of life history, ecology, and geography of extant and recently extinct mammals. Ecology 2009, 90, 2648. [Google Scholar] [CrossRef]
- Dawson, S.K.; Carmona, C.P.; González-Suárez, M.; Jönsson, M.; Chichorro, F.; Mallen-Cooper, M.; Melero, Y.; Moor, H.; Simaika, J.P.; Duthie, A.B.; et al. The traits of “trait ecologists”: An analysis of the use of trait and functional trait terminology. Ecol. Evol. 2021, 11, 16434–16445. [Google Scholar] [CrossRef]
- Wilkinson, M.; Dumontier, M.; Aalbersberg, I.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.W.; Bonino da Silva Santos, L.; Bourne, P.E.; et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Wilkinson, S.R.; Aloqalaa, M.; Belhajjame, K.; Crusoe, M.R.; Kinoshita, B.P.; Gadelha, L.; Garijo, D.; Gustafsson, O.J.R.; Juty, N.; Kanwal, S.; et al. Applying the FAIR Principles to computational workflows. Sci. Data 2025, 12, 328. [Google Scholar] [CrossRef]
- Bernabé, C.H.; Queralt-Rosinach, N.; Silva Souza, V.E.; Bonino da Silva Santos, L.O.; Mons, B.; Jacobsen, A.; Roos, M. The use of Foundational Ontologies in Bioinformatics. In Proceedings of the 13th International Conference on Semantic Web Applications and Tools for Health Care and Life Sciences, SWAT4HCLS 2022, Leiden, The Netherlands, 10–14 January 2022. [Google Scholar]
- Schultes, E.; Magagna, B.; Hettne, K.M.; Pergl, R.; Suchánek, M.; Kuhn, T. Reusable FAIR Implementation Profiles as Accelerators of FAIR Convergence. In Advances in Conceptual Modeling; Grossmann, G., Ram, S., Eds.; ER 2020. Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2020; Volume 12584. [Google Scholar] [CrossRef]
- Wyborn Lesley, A.; Prent, A.; Croucher, J.; Rees, N.; Farrington, R. Using FAIR Implementation Profiles (FIPs) and FAIR Enabling Resources (FERs) to Accelerate Machine-to-machine Interoperability of Geoscience datasets Within and Across Repositories, Communities and Other Domains. In Proceedings of the AGU Fall Meeting Abstracts, San Francisco, CA, USA, 11–15 December 2023; Volume 2023. [Google Scholar]
- Magagna, B.; Schultes, E.; Fouilloux, A.; Burger, G.; Devriendt, D.; Bramley, R.; Kuhn, T.; Rebelo Moreira, J.L.; Bonino da Silva Santos, L.O.; Ferreira Pires, L. Ontological Analysis of FAIR Supporting Resources. In Proceedings of the Joint Ontology Workshops-Episode X: The Tukker Zomer of Ontology, and Satellite Events, JOWO 2024, Enschede, The Netherlands, 15–19 July 2024. [Google Scholar]
- Zeng, M.L. Knowledge organization systems (KOS). KO Knowl. Organ. 2008, 35, 160–182. [Google Scholar] [CrossRef]
- Wieczorek, J.; Bloom, D.; Guralnick, R.; Blum, S.; Döring, M.; Giovanni, R.; Robertson, T.; Vieglais, D. Darwin Core: An evolving community-developed biodiversity data standard. PLoS ONE 2012, 7, e29715. [Google Scholar] [CrossRef] [PubMed]
- Schneider, F.D.; Fichtmueller, D.; Gossner, M.M.; Güntsch, A.; Jochum, M.; König-Ries, B.; Le Provost, G.; Manning, P.; Ostrowski, A.; Penone, C.; et al. Towards an ecological trait-data standard. Methods Ecol. Evol. 2019, 10, 2006–2019. [Google Scholar] [CrossRef]
- Rosati, I.; Bergami, C.; Fiore, N.; Oggioni, A.; Tagliolato, P. LifeWatch Italy Thesauri Documentation; Version 1.0; CNR Edizioni: Roma, Italy, 2017; p. 18. ISBN 978-88-8080-249-5. [Google Scholar]
- Rosati, I.; Bergami, C.; Stanca, E.; Roselli, L.; Tagliolato, P.; Oggioni, A.; Fiore, N.; Pugnetti, A.; Zingone, A.; Boggero, A.; et al. A thesaurus for phytoplankton trait-based approaches: Development and applicability. Ecol. Inform. 2017, 42, 129–138. [Google Scholar] [CrossRef]
- Pey, B.; Laporte, M.A.; Nahmani, J.; Auclerc, A.; Capowiez, Y.; Caro, G.; Cluzeau, D.; Cortet, J.; Decaëns, T.; Dubs, F.; et al. A thesaurus for soil invertebrate trait-based approaches. PLoS ONE 2014, 9, e108985. [Google Scholar] [CrossRef]
- Faulwetter, S.; Markantonatou, V.; Pavloudi, C.; Papageorgiou, N.; Keklikoglou, K.; Chatzinikolaou, E.; Pafilis, E.; Chatzigeorgiou, G.; Vasileiadou, K.; Dailianis, T.; et al. Polytraits: A database on biological traits of marine polychaetes. Biodivers. Data J. 2014, 2, e1024. [Google Scholar] [CrossRef]
- Vandenbussche, P.-Y.; Atemezing, G.A.; Poveda-Villalón, M.; Vatant, B. Linked Open Vocabularies (LOV): A gateway to reusable semantic vocabularies on the Web. Semant. Web 2016, 8, 437–452. [Google Scholar] [CrossRef]
- Whetzel Patricia, L.; Noy, N.F.; Shah, N.H.S.; Alexander, P.R.; Nyulas, C.; Tudorache, T.; Musen, M.A. BioPortal: Enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications. Nucleic Acids Res. 2011, 39 (Suppl. S2), W541–W545. [Google Scholar] [CrossRef]
- Jonquet, C.; Toulet, A.; Arnaud, E.; Aubin, S.; Dzalé Yeumo, E.; Emonet, V.; Graybeal, J.; Laporte, M.A.; Musen, M.A.; Pesce, V.; et al. AgroPortal: A vocabulary and ontology repository for agronomy. Comput. Electron. Agric. 2018, 144, 126–143. [Google Scholar] [CrossRef]
- Pierkot, C.; Alviset, G.; Vernet, M. The EarthPortal towards an ontology repository for the Earth System semantic artefacts. In Proceedings of the Onto4FAIR 2023 Workshops, Sherbrooke, QC, Canada, 20 July 2023; 2023; pp. 17–22. [Google Scholar]
- Tarallo, A.; Pulieri, M.; Ramezani, P.; Rosati, I. Advancements in EcoPortal: Enhancing functionalities for the eco-logical domain semantic artefacts repository. FAIR Connect Empower. Data Steward. 2024, 2, 1–7. [Google Scholar] [CrossRef]
- Jonquet, C.; Graybeal, J.; Bouazzouni, S.; Dorf, M.; Fiore, N.; Kechagioglou, X.; Redmond, T.; Rosati, I.; Skrenchuk, A.; Vendetti, J.L.; et al. Ontology Repositories and Semantic Artefact Catalogues with the OntoPortal Technology. In The Semantic Web–ISWC 2023; Payne, T.R., Presutti, V., Qi, G., Poveda-Villalón, M., Stoilos, G., Hollink, L., Kaoudi, Z., Cheng, G., Li, J., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2023; Volume 14266. [Google Scholar] [CrossRef]
- Yang, S.-Y. OntoPortal: An ontology-supported portal architecture with linguistically enhanced and focused crawler technologies. Expert Syst. Appl. 2009, 36, 10148–10157. [Google Scholar] [CrossRef]
- Karam, N.; Khiat, A.; Algergawy, A.; Sattler, M.; Weiland, C.; Schmidt, M. Matching biodiversity and ecology ontologies: Challenges and evaluation results. Knowl. Eng. Rev. 2020, 35, e9. [Google Scholar] [CrossRef]
- Martínez-González, M.M.; Alvite-Díez, M.L. The support of constructs in thesaurus tools from a Semantic Web perspective: Framework to assess standard conformance. Comput. Stand. Interfaces 2019, 65, 79–91. [Google Scholar] [CrossRef]
- Abd Nikooie Pour, M.; Algergawy, A.; Buche, P.; Castro, L.J.; Chen, J.; Coulet, A.; Cufi, J.; Dong, H.; Fallatah, O.; Faria, D.; et al. Results of the Ontology Alignment Evaluation Initiative 2023. In Proceedings of the 18th International Workshop on Ontology Matching (OM 2023), HAL, Athens, Greece, 6–7 November 2023; Available online: https://hal.archives-ouvertes.fr/hal-04366893 (accessed on 2 October 2025).
- Gonzales-Aguilar, A.; Ramírez-Posada, M.; Ferreyra, D. TemaTres: Software para gestionar tesauros. Prof. Inf. 2012, 21, 319–325. [Google Scholar] [CrossRef]
- Stellato, A.; Fiorelli, M.; Turbati, A.; Lorenzetti, T.; Van Gemert, W.; Dechandon, D.; Laaboudi-Spoiden, C.; Gerencser, A.; Waniart, A.; Costetchi, E.; et al. VocBench 3: A collaborative Semantic Web editor for ontologies, thesauri, and lexicons. Semant. Web 2020, 11, 855–881. [Google Scholar] [CrossRef]
- Faria, D.; Pesquita, C.; Santos, E.; Palmonari, M.; Cruz, I.F.; Couto, F.M. The AgreementMakerLight ontology matching system. In On the Move to Meaningful Internet Systems: OTM 2013 Conferences; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2013; Volume 8185, pp. 527–541. [Google Scholar] [CrossRef]
- Jiménez-Ruiz, E.; Grau, B.C. LogMap: Logic-based and scalable ontology matching. In Proceedings of the 10th International Semantic Web Conference (ISWC ‘11), Bonn, Germany, 23–27 October 2011; pp. 273–288. [Google Scholar]
- Faria, D.; Silva, M.C.; Cotovio, P.; Eugénio, P.; Pesquita, C. Matcha and Matcha-DL results for OAEI 2022. In Proceedings of the 17th International Workshop on Ontology Matching (OM 2022) Co-Located with the 21st International Semantic Web Conference (ISWC 2022), Hangzhou, China, 23 October 2022. CEUR Workshop Proceedings, 3324. CEUR-WS.org. [Google Scholar]
- Hertling, S.; Paulheim, H. OLaLa: Ontology matching with large language models. In Proceedings of the 12th Knowledge Capture Conference (K-CAP ‘23), Pensacola, FL, USA, 2–7 December 2023; pp. 131–139. [Google Scholar] [CrossRef]
- Dhamankar, R.; Lee, Y.; Doan, A.; Halevy, A.; Domingos, P. iMAP: Discovering Complex Semantic Matches between Database Schemas. In Proceedings of the ACM SIGMOD International Conference on Management of Data, Paris, France, 13–18 June 2004. [Google Scholar] [CrossRef]
- Hertling, S.; Portisch, J.; Paulheim, H. Melt-matching evaluation toolkit. In Proceedings of the International Conference on Semantic Systems, Karlsruhe, Germany, 9–12 September 2019; Springer International Publishing: Cham, Switzerland, 2019. [Google Scholar]
- Di Muri, C.; Pulieri, M.; Raho, D.; Muresan, A.N.; Tarallo, A.; Titocci, J.; Nestola, E.; Basset, A.; Mazzoni, S.; Rosati, I. Assessing semantic interoperability in environmental sciences: Variety of approaches and semantic artefacts. Sci. Data 2024, 11, 1055. [Google Scholar] [CrossRef] [PubMed]
- Kotis, K.; Lanzenberger, M. Ontology matching: Current status, dilemmas and future challenges. In Proceedings of the 2008 International Conference on Complex, Intelligent and Software Intensive Systems, Barcelona, Spain, 4–7 March 2008; IEEE: New York, NY, USA, 2008. [Google Scholar] [CrossRef]
- Shvaiko, P.; Euzenat, J. Ontology matching: State of the art and future challenges. IEEE Trans. Knowl. Data Eng. 2011, 25, 158–176. [Google Scholar] [CrossRef]



| Thesaurus | Short Name | Version | Concepts | Link |
|---|---|---|---|---|
| Phytoplankton Traits Thesaurus | PHYTOTRAITS | 1.5 | 86 | https://ecoportal.lifewatch.eu/ontologies/PHYTOTRAITS, accessed on 2 October 2025 |
| Zooplankton Traits Thesaurus | ZOOPLANKTRAITS | 1.5 | 52 | https://ecoportal.lifewatch.eu/ontologies/ZOOPLANKTRAITS, accessed on 2 October 2025 |
| Fish Traits Thesaurus | FISHTRAITS | 1.5 | 126 | https://ecoportal.lifewatch.eu/ontologies/FISHTRAITS, accessed on 2 October 2025 |
| Macroalgae Traits Thesaurus | MACROALGAETRAITS | 1.5 | 110 | https://ecoportal.lifewatch.eu/ontologies/MACROALGAETRAITS, accessed on 2 October 2025 |
| Macrozoobenthos Traits Thesaurus | MACROZOOBENTHOSTRAITS | 1.5 | 125 | not published |
| System | Time (HH:MM:SS) | N. Mappings Detected | True Positive | False Positive | Precision | Recall | F-Measure |
|---|---|---|---|---|---|---|---|
| MACROALGAE–MACROZOOBENTHOS | |||||||
| OLaLa | 0:08:30 | 10 | 9 | 1 | 0.7 | 0.39 | 0.5 |
| LogMapLt | 0:00:00 | 7 | 7 | 0 | 0.86 | 0.33 | 0.48 |
| LogMap | 0:00:03 | 29 | 8 | 21 | 0.27 | 0.44 | 0.34 |
| LogMapKG | 0:00:04 | 29 | 9 | 20 | 0.27 | 0.44 | 0.34 |
| Matcha | 0:00:07 | 45 | 9 | 36 | 0.2 | 0.5 | 0.28 |
| FISH–ZOOPLANKTON | |||||||
| OLaLa | 0:07:59 | 13 | 13 | 0 | 1 | 0.87 | 0.93 |
| LogMapLt | 0:00:00 | 8 | 8 | 0 | 1 | 0.53 | 0.69 |
| LogMap | 0:00:03 | 32 | 3 | 29 | 0.09 | 0.2 | 0.13 |
| LogMapKG | 0:00:04 | 55 | 11 | 44 | 0.22 | 0.8 | 0.34 |
| Matcha | 0:00:11 | 47 | 13 | 34 | 0.28 | 0.87 | 0.42 |
| Pairwise Comparison | N. Mappings Manually Detected | N. Mappings Automatically Detected | True Positive | False Positive | Precision | Recall | F-Measure |
|---|---|---|---|---|---|---|---|
| FISH–MACROALGAE | 13 | 9 | 7 | 2 | 0.7 | 0.54 | 0.63 |
| FISH–MACROZOOBENTHOS | 20 | 19 | 19 | 0 | 1 | 0.95 | 0.97 |
| ZOOPLANKTON–MACROZOOBENTHOS | 18 | 15 | 15 | 0 | 1 | 0.83 | 0.9 |
| ZOOPLANKTON–MACROALGAE | 11 | 4 | 4 | 0 | 1 | 0.36 | 0.53 |
| PHYTOPLANKTON–FISH | 13 | 11 | 9 | 2 | 0.81 | 0.69 | 0.75 |
| PHYTOPLANKTON–MACROALGAE | 15 | 10 | 10 | 0 | 1 | 0.66 | 0.8 |
| PHYTOPLANKTON–ZOOPLANKTON | 22 | 19 | 19 | 0 | 1 | 0.86 | 0.92 |
| PHYTOPLANKTON–MACROZOOBENTHOS | 15 | 10 | 9 | 1 | 0.9 | 0.64 | 0.75 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Titocci, J.; Pulieri, M.; Rosati, I.; Karam, N. Enhancing Trait Thesauri Interoperability Using a Manual and Automated Alignment Approach. Appl. Sci. 2025, 15, 12484. https://doi.org/10.3390/app152312484
Titocci J, Pulieri M, Rosati I, Karam N. Enhancing Trait Thesauri Interoperability Using a Manual and Automated Alignment Approach. Applied Sciences. 2025; 15(23):12484. https://doi.org/10.3390/app152312484
Chicago/Turabian StyleTitocci, Jessica, Martina Pulieri, Ilaria Rosati, and Naouel Karam. 2025. "Enhancing Trait Thesauri Interoperability Using a Manual and Automated Alignment Approach" Applied Sciences 15, no. 23: 12484. https://doi.org/10.3390/app152312484
APA StyleTitocci, J., Pulieri, M., Rosati, I., & Karam, N. (2025). Enhancing Trait Thesauri Interoperability Using a Manual and Automated Alignment Approach. Applied Sciences, 15(23), 12484. https://doi.org/10.3390/app152312484

