Alignment: A Hybrid, Interactive and Collaborative Ontology and Entity Matching Service †
Abstract
:1. Introduction
- a section presenting our study on similarity algorithms, explaining our decision on the default configuration of the suggestions engine
- a section presenting two use cases
- a section presenting related work
- updated text within existing sections
2. Related Work
3. Alignment Platform Presentation
3.1. System Architecture
3.2. GUI
3.2.1. Graph Module
3.2.2. Detailed Entity Information
3.2.3. System Suggestions
3.2.4. Link Type Option
3.2.5. Created Links Monitor
3.3. Calculating Similarities
3.3.1. Default Configuration Parameters Choice and Validation
- is the length of string ;
- m is the number of matching characters;
- t is the number of transpositions
- is the Jaro similarity of the two strings;
- l is the length of the common prefix at the beginning of the string (maximum four characters)
- p is a constant scaling factor
3.3.2. User Configuration
3.4. Integration, Collaboration and Social Features
3.4.1. Integration with Other Services
3.4.2. SPARQL Endpoint
3.4.3. API
3.4.4. Working on the Same Project
3.4.5. Crowdsourcing Link Validation
4. Evaluation
4.1. Link Creation Module Evaluation
4.1.1. Junior Level Ontology Engineers
4.1.2. Domain Experts
4.2. Link Validation Module Evaluation
Trial Set Up
4.3. Besides OpenBudgets.eu—Use Cases
4.3.1. EveryPolitician Project
4.3.2. PhD Hub
5. Summary
Author Contributions
Funding
Conflicts of Interest
Abbreviations
SKOS | Simple Knowledge Organization System |
RDF | Resource Description Framework |
SPARQL | SPARQL Protocol and RDF Query Language |
API | Application Programming Interface |
REST | Representational State Transfer |
SOAP | Simple Object Access Protocol |
OWL | Ontology Web Language |
ACM CCS | Association for Computing Machinery Computing Classification System |
MeSH | Medical Subject Headings |
STW | Standard-Thesaurus Wirtschaft |
GUI | Graphical User Interface |
FOAF | Friend of A Friend Vocabulary |
Silk LSL | Silk Link Specification Language |
CPA | Classification of Products by Activity |
CPC | Central Product Classification |
References
- Sabou, M.; Ekaputra, F.J.; Biffl, S. Semantic Web Technologies for Data Integration in Multi-Disciplinary Engineering. In Multi-Disciplinary Engineering for Cyber-Physical Production Systems: Data Models and Software Solutions for Handling Complex Engineering Projects; Biffl, S., Lüder, A., Gerhard, D., Eds.; Springer International Publishing: Berlin, Germany, 2017; pp. 301–329. [Google Scholar]
- Filippidis, P.M.; Karampatakis, S.; Koupidis, K.; Ioannidis, L.; Bratsas, C. The code lists case: Identifying and linking the key parts of fiscal datasets. In Proceedings of the 11th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), Thessaloniki, Greece, 20–21 October 2016; pp. 165–170. [Google Scholar]
- Filippidis, P.M.; Karampatakis, S.; Ioannidis, L.; Mynarz, J.; Svátek, V.; Bratsas, C. Towards Budget Comparative Analysis: the need for Fiscal Codelists as Linked Data. In Proceedings of the 12th International Conference on Semantic Systems (SEMANTiCS 2016), Leipzig, Germany, 12–15 September 2016. [Google Scholar]
- Shvaiko, P.; Euzenat, J. Ontology Matching: State of the Art and Future Challenges; IEEE Educational Activities Department: Piscataway, NJ, USA, 2013; Volume 25, pp. 158–176. [Google Scholar]
- Karampatakis, S.; Bratsas, C.; Zamazal, O.; Filippidis, P.M.; Antoniou, I. Alignment: A Collaborative, System Aided, Interactive Ontology Matching Platform. In Knowledge Engineering and Semantic Web; Różewski, P., Lange, C., Eds.; Springer International Publishing: Berlin, Germany, 2017; pp. 323–333. [Google Scholar]
- Stellato, A.; Turbati, A.; Fiorelli, M.; Lorenzetti, T.; Costetchi, E.; Laaboudi, C.; Van Gemert, W.; Keizer, J. Towards VocBench 3: Pushing collaborative development of thesauri and ontologies further beyond. In Proceedings of the 17th European Networked Knowledge Organization Systems Workshop (NKOS 2017), Thessaloniki, Greece, 21 September 2017. [Google Scholar]
- David, J.; Euzenat, J.; Scharffe, F.; Trojahn dos Santos, C. The alignment API 4.0. Semant. Web 2011, 2, 3–10. [Google Scholar]
- Ngo, D.; Bellahsene, Z. Overview of YAM++−−(not) Yet Another Matcher for ontology alignment task. Web Semant. Sci. Serv. Agents World Wide Web 2016, 41, 30–49. [Google Scholar] [CrossRef]
- Jiménez-Ruiz, E.; Grau, B.C.; Zhou, Y.; Horrocks, I. Large-scale Interactive Ontology Matching: Algorithms and Implementation. In Proceedings of the 20th European Conference on Artificial Intelligence (ECAI 2012), Montepellier, France, 27–31 August 2012; pp. 444–449. [Google Scholar]
- Sicilia, Á.; Nemirovski, G.; Nolle, A. Map-On: A web-based editor for visual ontology mapping. Semant. Web 2017, 8, 969–980. [Google Scholar] [CrossRef]
- Volz, J.; Bizer, C.; Gaedke, M.; Kobilarov, G. Silk-a Link Discovery Framework for the Web of Data. In Proceedings of the 2nd Linked Data on the Web Workshop (LDOW 2009), Madrid, Spain, 20 April 2009. [Google Scholar]
- Severo, B.; Trojahn, C.; Vieira, R. VOAR 3.0: A Configurable Environment for Manipulating Multiple Ontology Alignments. In Proceedings of the International Semantic Web Conference (Posters, Demos & Industry Tracks), Vienna, Austria, 21–25 October 2017. [Google Scholar]
- Ivanova, V.; Bach, B.; Pietriga, E.; Lambrix, P. Alignment Cubes: Towards Interactive Visual Exploration and Evaluation of Multiple Ontology Alignments. In Proceedings of the International Semantic Web Conference, Vienna, Austria, 21–25 October 2017; pp. 400–417. [Google Scholar]
- Euzenat, J.; Shvaiko, P. Ontology Matching; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
- Suominen, O.; Hyvönen, E. Improving the quality of SKOS vocabularies with Skosify. In Proceedings of the International Conference on Knowledge Engineering and Knowledge Management, Galway City, Ireland, 8–12 October 2012; pp. 383–397. [Google Scholar]
- Beckett, D. The design and implementation of the Redland RDF application framework. Comput. Netw. 2002, 39, 577–588. [Google Scholar] [CrossRef] [Green Version]
- Dragisic, Z.; Ivanova, V.; Lambrix, P.; Faria, D.; Jiménez-Ruiz, E.; Pesquita, C.; Groth, P.; Simperl, E.; Gray, A.; Sabou, M.; et al. User Validation in Ontology Alignment. In Proceedings of the 15th International Semantic Web Conference, the Semantic Web—ISWC 2016, Kobe, Japan, 17–21 October 2016; Proceedings, Part I. pp. 200–217. [Google Scholar]
- Euzenat, J. An API for Ontology Alignment. The Semantic Web—ISWC 2004. In Proceedings of the Third International Semantic Web Conference, Hiroshima, Japan, 7–11 November 2004; McIlraith, S.A., Plexousakis, D., van Harmelen, F., Eds.; Springer: Berlin, Heidelberg, 2004; pp. 698–712. [Google Scholar]
- Jaro, M.A. Advances in Record-Linkage Methodology as Applied to Matching the 1985 Census of Tampa, Florida. J. Am. Stat. Assoc. 1989, 84, 414–420. [Google Scholar] [CrossRef]
- Winkler, W.E. String Comparator Metrics and Enhanced Decision Rules in the Fellegi-Sunter Model of Record Linkage. Available online: https://www.researchgate.net/publication/243772975_String_Comparator_Metrics_and_Enhanced_Decision_Rules_in_the_Fellegi-Sunter_Model_of_Record_Linkage (accessed on 15 November 2018).
- Cordasco, G.; De Donato, R.; Malandrino, D.; Palmieri, G.; Petta, A.; Pirozzi, D.; Santangelo, G.; Scarano, V.; Serra, L.; Spagnuolo, C.; et al. Engaging Citizens with a Social Platform for Open Data. In Proceedings of the 18th Annual International Conferenceon Digital Government Research, Staten Island, NY, USA, 7–9 June 2017; pp. 242–249. [Google Scholar]
- Geiger, R.S.; Ribes, D. The work of sustaining order in Wikipedia. In Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, Savannah, GA, USA, 6–10 February 2010; pp. 117–126. [Google Scholar]
- Mynarz, J.; Svátek, V.; Karampatakis, S.; Klímek, J.; Bratsas, C. Modeling fiscal data with the Data Cube Vocabulary. In Proceedings of the 12th International Conference on Semantic Systems (SEMANTiCS 2016), Leipzig, Germany, 12–15 September 2016. [Google Scholar]
- Cyganiak, R.; Reynolds, D.; Tennison, J. The RDF Data Cube Vocabulary. W3C Recommendation (January 2014). Available online: https://www.w3.org/TR/vocab-data-cube/ (accessed on 14 November 2018).
- Halilaj, L.; Petersen, N.; Grangel-González, I.; Lange, C.; Auer, S.; Coskun, G.; Lohmann, S. Vocol: An integrated environment to support version-controlled vocabulary development. In Knowledge Engineering and Knowledge Management, Proceedings of the 20th International Conference, EKAW 2016, Bologna, Italy, 19–23 November 2016; Springer: Berlin, Germany, 2016; pp. 303–319. [Google Scholar]
- Bratsas, C.; Filippidis, P.M.; Karampatakis, S.; Ioannidis, L. Developing a scientific knowledge graph through conceptual linking of academic classifications. In Proceedings of the 2018 13th International Workshop on Semantic and Social Media Adaptation and Personalization (SMAP), Zaragoza, Spain, 6–7 September 2018. [Google Scholar]
- Rogers, F.B. Medical subject headings. Bull. Med. Libr. Assoc. 1963, 51, 114–116. [Google Scholar] [PubMed]
- Coulter, N.; French, J.; Glinert, E.; Horton, T.; Mead, N.; Rada, R.; Ralston, A.; Rodkin, C.; Rous, B.; Tucker, A.; et al. Computing classification system 1998: current status and future maintenance. Report of the CCS update committee. Comput. Rev. 1998, 39, 1–62. [Google Scholar]
- Neubert, J. Bringing the “Thesaurus for Economics” on to the Web of Linked Data. In Proceedings of the 2nd Linked Data on the Web Workshop (LDOW 2009), Madrid, Spain, 20 April 2009. [Google Scholar]
Similarity Measure | Threshold | Max Distance | PR% | RE% | Time (s) | |
---|---|---|---|---|---|---|
1 | Dice Coefficient | 0.0 | null | 0.65 | 9.35 | 700 |
2 | Dice Coefficient | 0.1 | null | 39.59 | 23.45 | 14 |
3 | Dice Coefficient | 0.2 | null | 26.33 | 33.34 | 28 |
4 | Dice Coefficient | 0.3 | null | 21.13 | 41.96 | 27 |
5 | Dice Coefficient | 0.4 | null | 13.32 | 54.74 | 40 |
6 | Dice Coefficient | 0.5 | null | 8.77 | 63.78 | 73 |
7 | Jaro Distance | 0.0 | null | 43.54 | 16.18 | 69 |
8 | Jaro Distance | 0.1 | null | 34.92 | 21.71 | 69 |
9 | Jaro Distance | 0.2 | null | 14.02 | 32.85 | 65 |
10 | Jaro Distance | 0.3 | null | 3.46 | 44.25 | 68 |
11 | Jaro-Winkler Distance | 0.0 | null | 40.83 | 17.63 | 536 |
12 | Jaro-Winkler Distance | 0.1 | null | 21.56 | 28.85 | 475 |
13 | Jaro-Winkler Distance | 0.2 | null | 6.59 | 44.12 | 496 |
14 | Jaro-Winkler Distance | 0.3 | null | 3.66 | 49.65 | 569 |
15 | Soft Jaccard Coefficient | 0.0 | 2 | 39.84 | 21.32 | 18 |
16 | Soft Jaccard Coefficient | 0.1 | 2 | 39.64 | 21.92 | 20 |
17 | Soft Jaccard Coefficient | 0.2 | 2 | 36.45 | 26.43 | 52 |
18 | Soft Jaccard Coefficient | 0.3 | 2 | 30.79 | 31.68 | 97 |
19 | Soft Jaccard Coefficient | 0.4 | 2 | 19.78 | 40.28 | 140 |
20 | Soft Jaccard Coefficient | 0.5 | 1 | 14.49 | 50.61 | 184 |
21 | Soft Jaccard Coefficient | 0.5 | 2 | 11.47 | 52.12 | 202 |
22 | Soft Jaccard Coefficient | 0.5 | 3 | 4.93 | 52.69 | 176 |
23 | Soft Jaccard Coefficient | 0.5 | 4 | 2.88 | 40.17 | 237 |
24 | Soft Jaccard Coefficient | 0.5 | 5 | 1.93 | 27.47 | 196 |
25 | All combined | 1.11 | 75.31 | NA |
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Karampatakis, S.; Bratsas, C.; Zamazal, O.; Filippidis, P.M.; Antoniou, I. Alignment: A Hybrid, Interactive and Collaborative Ontology and Entity Matching Service. Information 2018, 9, 281. https://doi.org/10.3390/info9110281
Karampatakis S, Bratsas C, Zamazal O, Filippidis PM, Antoniou I. Alignment: A Hybrid, Interactive and Collaborative Ontology and Entity Matching Service. Information. 2018; 9(11):281. https://doi.org/10.3390/info9110281
Chicago/Turabian StyleKarampatakis, Sotirios, Charalampos Bratsas, Ondřej Zamazal, Panagiotis Marios Filippidis, and Ioannis Antoniou. 2018. "Alignment: A Hybrid, Interactive and Collaborative Ontology and Entity Matching Service" Information 9, no. 11: 281. https://doi.org/10.3390/info9110281
APA StyleKarampatakis, S., Bratsas, C., Zamazal, O., Filippidis, P. M., & Antoniou, I. (2018). Alignment: A Hybrid, Interactive and Collaborative Ontology and Entity Matching Service. Information, 9(11), 281. https://doi.org/10.3390/info9110281