Artificial Intelligence for Digital Heritage Innovation: Setting up a R&D Agenda for Europe
Abstract
:1. Introduction
- To highlight the scope of ML in CH and innovation;
- To present the state of the art of ML technologies for CH;
- To identify challenges, risks, and opportunities;
- To draft a mitigation strategy and agenda for AI in CH and innovation.
1.1. Methodology
1.2. Definitions
- Artificial intelligence (AI) refers to the development of computer systems capable of performing tasks that would typically require human intelligence, such as pattern and speech recognition, game playing and decision-making, problem-solving, and learning from data (cf. [21,22]). AI encompasses subfields, including ML, natural language processing, computer vision, and robotics. AI is now being used across all disciplines, including information science, mathematics, medical science, geoscience, physics, and chemistry [23].
- Machine learning (ML) is a subset of AI that focuses on developing algorithms and models that enable computers to learn from experience without being explicitly programmed. ML algorithms learn patterns and relationships from large datasets and use this knowledge to make predictions, classify data, or make decisions (cf. [24]). ML is traditionally divided into three categories: supervised, unsupervised, and reinforcement learning [25]. An algorithm learns from labeled training data to make predictions or decisions in supervised learning. The goal is to learn a mapping function to accurately predict the correct output label for new, unseen input data. Unsupervised learning aims to find structure and regularity in an unlabeled dataset. In reinforcement learning, the algorithm learns a policy for maximizing rewards given as feedback within a dynamic environment [26,27]. While originally algorithmic approaches were used for solving ML problems, the advent of deep learning and neural networks almost completely replaced these traditional methods [28].
- Big data refers to large and complex datasets that cannot be effectively processed or analysed using traditional data processing techniques (cf. [29,30]). In contrast to other approaches, big data processes full-scale data instead of samples to uncover patterns, trends, and insights. Big data often involves using advanced technologies and techniques, such as distributed computing and data mining.
2. Application Fields of AI in CH
- Image analysis and restoration: AI algorithms can analyze and restore old, damaged, or degraded (moving) images, sounds, paintings, and photographs. These algorithms can enhance image quality, remove noise, and even reconstruct missing parts of the artwork, aiding in preserving and restoring cultural artifacts. Examples listed in [27] are the prediction of the painting’s style, genre, and artist, the detection of fake artworks by stroke analysis, and the artistic style transfer using adversarial networks to regularize the generation of stylized images.” Further research deals with the automatic colorization of images [31] and the restoration of ancient mosaics [32].
- Object recognition and classification: AI-powered computer vision techniques enable automatic recognition and classification of cultural objects. By analyzing visual features and patterns, AI algorithms can identify and categorize artifacts, sculptures, and architectural elements [33], facilitating the organization and cataloging of museum collections. Examples are the prediction of color metadata, e.g., for textile objects [34], of technique, timespan, material, and place metadata for European silk fabrics [35], and the recognition and classification of symbols in ancient papyri [36].
- Translation and transcription: AI language models are capable of translating. e.g., ancient texts, inscriptions, and manuscripts into modern languages. They can also be used for modern languages by translating metadata or full-text content of heritage objects and related information, making sharing cultural heritage across languages easier. Other models can transcribe handwritten texts, allowing researchers and historians to access and understand historical documents and perform automated analysis (e.g., [37]).
- Automatic text analysis: This comprises various approaches [38]. An example is the automatic semantic indexing of pre-structured historical texts, which enables historians to mine large amounts of text and data to gain a deeper understanding of the sources (e.g., [39]); for example, tax lists or registers of letters sent to a historical entity [40].
- Virtual Reality (VR) and Augmented Reality (AR): AI technology supports the creation of immersive VR and AR experiences for CH sites and museums. Visitors can virtually explore ancient ruins, historical sites, or museum exhibitions, interacting with AI-generated virtual characters or objects to enhance their understanding and engagement with the cultural context [41,42].
- Recommender systems for personalized experiences: AI algorithms can analyze user preferences, historical data, and contextual information to provide personalized recommendations for CH experiences. Despite the risks of information filtering (e.g., [43]), use is to suggest relevant exhibits, customized tours, or tailored content, AI-powered recommender systems enhance visitor engagement and satisfaction, or—triggered by the advent of large language models (LLMs) such as GPT—dialogue and chatbot systems. Examples are the use of chatbots in museums [44,45] or recommender systems for CH collections (e.g., [46,47]).
- Cultural content analysis and interpretation: AI techniques, such as natural language processing (NLP), are used to analyze large volumes of cultural content, including literature, music, and artwork. This analysis can reveal patterns, themes, and cultural influences, providing valuable insights into historical contexts and artistic movements. Examples are metadata enrichment (e.g., [48,49,50]) and linking to open data sources (e.g., [33]).
- Heritage digitization and preservation: AI can be crucial in digitizing cultural artifacts and archives. By automating digitization processes and extracting knowledge, AI speeds up the preservation of CH, allowing researchers and the public to explore and study rare artifacts remotely. Several articles provide an overview of particular technologies, e.g., for 3D acquisition, such as laser scanning [51] or photogrammetry [52], and quantify their use [53]. AI-powered systems can monitor and analyze CH site environmental conditions, helping with early detection of potential threats such as humidity, temperature fluctuations, and structural damage. This real-time monitoring aids in the proactive conservation and protection of cultural landmarks (e.g., [54,55]).
3. Project Examples
4. AI Technologies for CH State of the Art
- Fiorucci et al. analyzed the current situation on AI for CH in 2020 with regard to both ML approaches and application examples [27].
- The EuropeanaTech AI task force conducted a survey amongst professionals to examine the usage and prospects of AI in that field [66].
- A curated list of policy documents—with only a few links to CH currently–is maintained by the Council of Europe [4].
4.1. AI and Images
- Content-based image retrieval: Efficient retrieval and exploration of historical images based on visual similarity and content-based features. However, traditional ML technologies currently require large-scale training data [27,72,73,74], which are only capable of recognizing well-documented and visually distinctive landmark buildings [62] but fail to deal with less distinctive architecture, such as houses of similar style. Even using more advanced ML approaches or combining different algorithms [75] only allows the realization of prototypic scenarios [76,77].
- Image-based localization: Connecting images with the 3D world relevant for AR/VR applications requires estimating the original six-degree-of-freedom (6DOF) camera pose. While several methods exist for homogeneous image blocks [78,79], the problem becomes increasingly complex for varying radiometric and geometric conditions, especially relevant for historical photographs [80].
- Image recognition and classification: Identifying objects, scenes, or people depicted in historical images using deep learning models, such as CNNs. This field ranges from the detection of WW2 bomb craters in historical aerial images [81], via historical photo content analysis [82] to historical map segmentation [83,84,85].
4.2. AI and Text
- NLP techniques: Named entity recognition, part-of-speech tagging, sentiment analysis, and topic modeling. The most recent applications of CNNs and Transformer [93] are consistently successful in accurately extracting and reducing the number of errors even with unsupervised pre-training.
- Text classification algorithms: Naive Bayes, Support Vector Machines, and Random Forests.
- Sequence models: Hidden Markov models, conditional random fields, and recurrent neural networks.
4.3. AI and Virtual 3D Objects
- Object recognition and classification and semantic segmentation: In 3D/4D reconstruction of CH, ML-based technologies are currently used primarily for specific tasks. This involves AI models to identify specific architectural elements, artifacts, or decorative motifs, to recognize specific objects [72,73,74,96], and to preselect imagery [97,98]. Other tasks include AI-based semantic segmentation techniques to partition 3D models into meaningful regions or components [99].
- 3D model creation: Research has focused on developing AI-based algorithms for efficient and accurate 3D reconstruction of CH objects, buildings, and sites. Traditional algebraic approaches, as in photogrammetry, employ algorithms within equations, e.g., to detect, describe, and match geometric features in images [100] and to create 3D models. ML approaches are currently heavily researched and used for image and 3D point cloud analytics in CH (recent overview: [27]), but increasingly for 3D modeling tasks. Generative adversarial networks (GAN), a combination of the proposal and assessment components of ML, are frequently employed as approximative techniques in 3D modeling, e.g., for single photo digitization [101], completion of incomplete 3D digitized models [102,103] or photo-based reconstructions [104]. Recent approaches include neural radiance fields (NeRF) [105,106,107,108], which have shown strength in creating 3D geometries from sparse and heterogeneous imagery and short processing time [109,110].
- Image to visualization approaches: Approaches bypass the modeling stage to generate visualizations directly from imagery [72,111,112], e.g., by transforming or assembling image content (recent image generators like DALL-E [113], Stable Diffusion or Midjourney). Other approaches based on NeRF to predict shifting spatial perspectives even from single images [114] can predict 3D geometries.
- Use of ML algorithms to detect patterns, anomalies, or changes over time within 3D models (e.g., [54]). The analysis involves assessing the effectiveness of AI in extracting meaningful information from large-scale 3D datasets, supporting archaeological research, conservation efforts, or architectural analysis.
4.4. AI and Maps
4.5. AI and Music
- Automated music classification utilizes computer algorithms and ML techniques to automatically categorize music into classes or genres based on features extracted from the music data. Automated music classification has various applications, such as organizing music libraries and archives, and assisting in music research. Music-related classification tasks include mood classification, artist identification, instrument recognition, music annotation, and genre classification. For instance, one study investigates automatic music genre classification model creation using ML [135].
- Optical Music Recognition (OMR) research investigates how to computationally read music notation in documents [136]. OMR is a challenging process that differs in difficulty from OCR and handwritten text recognition because of the properties of music notation as a contextual writing system. First, the visual expression of music is very diverse. For instance, the Standard Music Font Layout [137] lists over 2440 recommended characters and several hundred optional glyphs. Second, it is only their configuration—how they are placed and arranged on the staves and with respect to each other—that specifies what notes should be played. The two main goals of OMR are:
- 1.
- 2.
- Recovering musical semantics (i.e., the notes, represented by their pitches, velocities, onsets, and durations). MIDI [140] would be an appropriate output representation for this goal.
- Automatic Music Transcription (AMT) is the process of automatically converting audio recordings of music into symbolic representations, such as sheet music (e.g., MusicXML or MEI) or MIDI files. AMT is a very useful tool for music analysis. AMT comprises several subtasks: (multi-)pitch estimation, onset and offset detection, instrument recognition, beat and rhythm tracking, interpretation of expressive timing and dynamics, and score typesetting. Due to the very nature of music signals, which often contain several sound sources that produce one or more concurrent sound events that are meant to be highly correlated over both time and frequency, AMT is still considered a challenging and open problem [141].
4.6. AI and Audiovisual Material
- Digitization and restoration: AI assists in digitizing and restoring deteriorating audiovisual materials, improving their quality and preserving their historical significance.
- Video summaries: Can speed up the process of finding content in audiovisual archives [142].
- Content analysis and knowledge extraction: AI algorithms analyze audio and visual elements within content to identify patterns, objects, scenes, speakers, and other relevant information. It can also help to spot biases and contentious terms and track semantic drift in metadata, supporting curators, cataloguers, and others in deciding on potentially updating catalog records [143].
- Metadata enhancement: AI enriches metadata for better content organization, search, and context by extracting keywords or using LLMs to organize and enrich metadata records at scale.
- Transcription and translation: AI-powered speech-to-text transcription and translation services make audiovisual content more accessible and understandable to a wider audience [144].
- Partial audio matching: Supports framing analysis in identifying segments in one source audio file that are identical to segments in another target audio file. Framing analysis can reveal patterns and biases in the way content is being recontextualized in the media to shape public discourse [145].
- Cross-modal analysis: AI techniques analyze both audio and visual components of content, facilitating holistic interpretation and understanding.
- Interactive storytelling and content-generation interfaces: AI-powered interactive narratives and documentaries engage users with historical events and cultural context. AI can further enhance access by using fine-grained and time-based data extracted by AI systems as a basis for creating “generous interfaces” that allow for the rich exploration of CH collections [146,147] and using conversational speech to provide new ways of interacting with audiovisual collections [148].
5. Challenges and Opportunities for AI and CH
5.1. Quality
5.2. Quantity and Historical Singularity
5.3. Time and Temporal Transition
5.4. Transparency and Explainable Artificial Intelligence for History and Heritage
5.5. Ethical Considerations and Bias
5.6. Data Availability, Accessibility and Quality
5.7. Interdisciplinary Collaboration
5.8. Education
5.9. Customization
5.10. AI for CH as a Business Sector
6. Strategy and Agenda for Digital Heritage Innovation
- The FUTURES4EUROPE, conducted on behalf of the European Commission DG RTD, was a Delfi-like expert review to identify and scope future AI directions [165]
- The Millennium Project developed ideas, strategies, and global governance models for Artificial General Intelligence (AGI) [177].
- AI for archives [178] provide views and demands of this particular subdomain of the heritage sector.
- The Time Machine FET-Flagship CSA conducted various workshops, surveys and scoping activities in 2019 and 2020 to develop a roadmap for large-scale research initiatives [179].
- The ARCHE project reviewed future-oriented literature spanning the environment, economics, health, education, arts and culture, and heritage to identify megatrends, cross-cutting themes and possible opportunities for action for the heritage sector [180]
- ELISE’s 2021 Strategic Research Agenda set out the research challenges that needed to be addressed to strengthen the technical capabilities of AI, improve its performance in deployment, and align AI development with societal interests [181]
7. Summary
7.1. Discussion
7.2. Limitations and Implications
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Acknowledgments
Conflicts of Interest
References
- Europeana. Report on ENUMERATE Core Survey 4. 2017. Available online: https://pro.europeana.eu/files/Europeana_Professional/Projects/Project_list/ENUMERATE/deliverables/DSI-2_Deliverable%20D4.4_Europeana_Report%20on%20ENUMERATE%20Core%20Survey%204.pdf (accessed on 1 December 2023).
- EIF. Market Analysis of The Cultural And Creative Sectors in Europe; EIF: Brussels, Belgium, 2018. [Google Scholar]
- AI4LAM. AI4LAM Resources. Available online: https://sites.google.com/view/ai4lam/ai-registry/resources?authuser=0 (accessed on 1 December 2023).
- Council of Europe. Artificial Intelligence—Council of Europe’s Work in Progress. Available online: https://www.coe.int/en/web/artificial-intelligence/work-in-progress (accessed on 21 August 2023).
- Heritage Research Hub. Library. Available online: https://www.heritageresearch-hub.eu/library/ (accessed on 1 December 2023).
- Braun, V.; Clarke, V. Using thematic analysis in psychology. Qual. Res. Psychol. 2006, 3, 77. [Google Scholar] [CrossRef]
- Braun, V.; Clarke, V. Reflecting on reflexive thematic analysis. Qual. Res. Sport Exerc. Health 2019, 11, 589–597. [Google Scholar] [CrossRef]
- Morgan, D.L.; Nica, A. Iterative Thematic Inquiry: A New Method for Analyzing Qualitative Data. Int. J. Qual. Methods 2020, 19, 1609406920955118. [Google Scholar] [CrossRef]
- UNESCO. Draft Medium Term Plan 1990–1995; UNESCO: London, UK, 1989. [Google Scholar]
- UNESCO. Concept of Digital Heritage; UNESCO: London, UK, 2018. [Google Scholar]
- Georgopoulos, A. CIPA’s Perspectives on Cultural Heritage. In Digital Research and Education in Architectural Heritage. In Proceedings of the 5th Conference, DECH 2017, and First Workshop, UHDL 2017, Dresden, Germany, 30–31 March 2017; Revised Selected Papers. Münster, S., Friedrichs, K., Niebling, F., Seidel-Grzinska, A., Eds.; Springer: Cham, Switzerland, 2018; pp. 215–245. [Google Scholar]
- Ch’ng, E.; Gaffney, V.; Chapman, H. Visual Heritage in the Digital Age; Springer: London, UK, 2013. [Google Scholar]
- European Commission. National/Regional Innovation Strategies for Smart Specialisation; Cohesion Policy 2014–2020; European Commission: Brussels, Belgium, 2014.
- CHCfE Consortium. Cultural Heritage Counts for Europe. Available online: https://www.europanostra.org/tag/cultural-heritage-counts-for-europe/ (accessed on 21 August 2023).
- Baregheh, A.; Rowley, J.; Sambrook, S. Towards a multidisciplinary definition of innovation. Manag. Decis. 2009, 47, 1323–1339. [Google Scholar] [CrossRef]
- Granstrand, O.; Holgersson, M. Innovation ecosystems: A conceptual review and a new definition. Technovation 2020, 90, 102098. [Google Scholar] [CrossRef]
- Mignosa, A. Theory and Practice of Cultural Heritage Policy. In The Artful Economist: A New Look at Cultural Economics; Rizzo, I., Towse, R., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 227–244. [Google Scholar] [CrossRef]
- European Parliamant. Cultural Heritage in EU Policies; Briefing; European Parliamant: Brussels, Belgium, 2019.
- Joshi, A.V. Introduction to AI and ML. In Machine Learning and Artificial Intelligence; Joshi, A.V., Ed.; Springer International Publishing: Cham, Switzerland, 2020; pp. 3–7. [Google Scholar] [CrossRef]
- Helm, J.M.; Swiergosz, A.M.; Haeberle, H.S.; Karnuta, J.M.; Schaffer, J.L.; Krebs, V.E.; Spitzer, A.I.; Ramkumar, P.N. Machine learning and artificial intelligence: Definitions, applications, and future directions. Curr. Rev. Musculoskelet. Med. 2020, 13, 69–76. [Google Scholar] [CrossRef] [PubMed]
- Russell, S.J. Artificial Intelligence a Modern Approach; Pearson Education, Inc.: Indianapolis, IN, USA, 2010. [Google Scholar]
- Hunt, E.B. Artificial Intelligence; Academic Press: Cambridge, MA, USA, 1975. [Google Scholar]
- Xu, Y.; Liu, X.; Cao, X.; Cai, Z.; Wang, F.; Zhang, J. Artificial intelligence: A powerful paradigm for scientific research. Innov. 2021, 2, 100179. [Google Scholar] [CrossRef] [PubMed]
- Bishop, C.M. Pattern Recognition and Machine Learning; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
- Murphy, K.P. Probabilistic Machine Learning: An Introduction; MIT Press: Cambridge, MA, USA, 2022. [Google Scholar]
- Wang, H.-n.; Liu, N.; Zhang, Y.-y.; Feng, D.-w.; Huang, F.; Li, D.-s.; Zhang, Y.-m. Deep reinforcement learning: A survey. Front. Inf. Technol. Electron. Eng. 2020, 21, 1726–1744. [Google Scholar] [CrossRef]
- Fiorucci, M.; Khoroshiltseva, M.; Pontil, M.; Traviglia, A.; Del Bue, A.; James, S. Machine Learning for Cultural Heritage: A Survey. Pattern Recognit. Lett. 2020, 133, 102–108. [Google Scholar] [CrossRef]
- Alpaydin, E. Introduction to Machine Learning; MIT: Cambridge, MA, USA, 2020. [Google Scholar]
- Sagiroglu, S.; Sinanc, D. Big data: A review. In Proceedings of the 2013 International Conference on Collaboration Technologies and Systems (CTS), San Diego, CA, USA, 20–24 May 2013; pp. 42–47. [Google Scholar]
- Wu, X.; Zhu, X.; Wu, G.-Q.; Ding, W. Data mining with big data. IEEE Trans. Knowl. Data Eng. 2013, 26, 97–107. [Google Scholar]
- Farella, E.M.; Malek, S.; Remondino, F. Colorizing the Past: Deep Learning for the Automatic Colorization of Historical Aerial Images. J. Imaging 2022, 8, 269. [Google Scholar] [CrossRef]
- Moral-Andrés, F.; Merino-Gómez, E.; Reviriego, P.; Lombardi, F. Can Artificial Intelligence Reconstruct Ancient Mosaics? Stud. Conserv. 2022, 1–14. [Google Scholar] [CrossRef]
- Bassier, M.; Bonduel, M.; Derdaele, J.; Vergauwen, M. Processing existing building geometry for reuse as Linked Data. Autom. Constr. 2020, 115, 103180. [Google Scholar] [CrossRef]
- Europeana pro. CRAFTED: Enrich and Promote Traditional and Contemporary Crafts. Available online: https://pro.europeana.eu/project/crafted (accessed on 1 December 2023).
- Rei, L.; Mladenic, D.; Dorozynski, M.; Rottensteiner, F.; Schleider, T.; Troncy, R.; Lozano, J.S.; Salvatella, M.G. Multimodal metadata assignment for cultural heritage artifacts. Multimed. Syst. 2023, 29, 847–869. [Google Scholar] [CrossRef]
- Haliassos, A.; Barmpoutis, P.; Stathaki, T.; Quirke, S.; Constantinides, A. Classification and Detection of Symbols in Ancient Papyri. In Visual Computing for Cultural Heritage; Liarokapis, F., Voulodimos, A., Doulamis, N., Doulamis, A., Eds.; Springer International Publishing: Cham, Switzerlamd, 2020; pp. 121–140. [Google Scholar] [CrossRef]
- Nockels, J.; Gooding, P.; Ames, S.; Terras, M. Understanding the application of handwritten text recognition technology in heritage contexts: A systematic review of Transkribus in published research. Arch. Sci. 2022, 22, 367–392. [Google Scholar] [CrossRef]
- Elsevier. Automatic Text Analysis (Compendium). Available online: https://www.sciencedirect.com/topics/social-sciences/automatic-text-analysis (accessed on 1 December 2023).
- He, S.; Samara, P.; Burgers, J.; Schomaker, L. A multiple-label guided clustering algorithm for historical document dating and localization. IEEE Trans. Image Process. 2016, 25, 5252–5265. [Google Scholar] [CrossRef] [PubMed]
- Beckstein, C.; Gramsch-Stehfest, R.; Beck, C.; Engelhardt, J.; Knüpfer, C.; Jauch, O. Digitale Prosopographie. Automatisierte Auswertung und Netzwerkanalyse eines Quellenkorpus zur Geschichte gelehrter deutscher Eliten des 15. Jahrhunderts. In Digital History. Konzepte. Methoden und Kritiken digitaler Geschichtswissenschaften; Döring, K.D., Haar, S., König, M., Wettlaufer, J., Eds.; De Gruyter: Oldenbourg, Germany, 2022; pp. 151–170. [Google Scholar]
- Muenster, S. Digital 3D Technologies for Humanities Research and Education: An Overview. Appl. Sci. 2022, 12, 2426. [Google Scholar] [CrossRef]
- Russo, M. AR in the Architecture Domain: State of the Art. Appl. Sci. 2021, 11, 6800. [Google Scholar] [CrossRef]
- Nechushtai, E.; Lewis, S.C. What kind of news gatekeepers do we want machines to be? Filter bubbles, fragmentation, and the normative dimensions of algorithmic recommendations. Comput. Hum. Behav. 2019, 90, 298–307. [Google Scholar] [CrossRef]
- Schaffer, S.; Ruß, A.; Sasse, M.L.; Schubotz, L.; Gustke, O. Questions and answers: Important steps to let AI chatbots answer questions in the museum. In Proceedings of the International Conference on ArtsIT, Interactivity and Game Creation; Springer: Berlin/Heidelberg, Germany, 2021; pp. 346–358. [Google Scholar]
- Bongini, P.; Becattini, F.; Del Bimbo, A. Is GPT-3 All You Need for Visual Question Answering in Cultural Heritage? In Proceedings of the European Conference on Computer Vision; Springer: Cham, Switzerland, 2022; pp. 268–281. [Google Scholar]
- Casillo, M.; Colace, F.; Conte, D.; Lombardi, M.; Santaniello, D.; Valentino, C. Context-aware recommender systems and cultural heritage: A survey. J. Ambient Intell. Humaniz. Comput. 2023, 14, 3109–3127. [Google Scholar] [CrossRef]
- Pavlidis, G. Recommender systems, cultural heritage applications, and the way forward. J. Cult. Herit. 2019, 35, 183–196. [Google Scholar] [CrossRef]
- Bai, Z.; Nakashima, Y.; Garcia, N. Explain me the painting: Multi-topic knowledgeable art description generation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 5422–5432. [Google Scholar]
- Cetinic, E. Iconographic Image Captioning for Artworks. In Proceedings of the ICPR International Workshops and Challenges, Virtual Event, 10–15 January 2021; Springer: Cham, Switzerland, 2021; pp. 502–516. [Google Scholar]
- Münster, S. Advancements in 3D Heritage Data Aggregation and Enrichment in Europe: Implications for Designing the Jena Experimental Repository for the DFG 3D Viewer. Appl. Sci. 2023, 13, 9781. [Google Scholar] [CrossRef]
- Di Stefano, F.; Chiappini, S.; Gorreja, A.; Balestra, M.; Pierdicca, R. Mobile 3D scan LiDAR: A literature review. Geomat. Nat. Hazards Risk 2021, 12, 2387–2429. [Google Scholar] [CrossRef]
- Remondino, F.; Nocerino, E.; Toschi, I.; Menna, F. A Critical Review of Automated Photogrammetric Processing of Large Datasets. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, XLII-2/W5, 591–599. [Google Scholar] [CrossRef]
- European Commission. Study on Quality in 3D Digitisation of Tangible Cultural Heritage: Mapping Parameters, Formats, Standards, Benchmarks, Methodologies, and Guidelines; VIGIE 2020/654 Final Study Report; European Commission: Brussels, Belgium, 2022.
- Mishra, M. Machine learning techniques for structural health monitoring of heritage buildings: A state-of-the-art review and case studies. J. Cult. Herit. 2021, 47, 227–245. [Google Scholar] [CrossRef]
- Tejedor, B.; Lucchi, E.; Bienvenido-Huertas, D.; Nardi, I. Non-destructive techniques (NDT) for the diagnosis of heritage buildings: Traditional procedures and futures perspectives. Energy Build. 2022, 263, 112029. [Google Scholar] [CrossRef]
- Münster, S.; Bruschke, J.; Hoppe, S.; Maiwald, F.; Niebling, F.; Pattee, A.; Utescher, R.; Zarriess, S. Multimodal AI Support of Source Criticism in the Humanities. In Proceedings of the ADHO DH 2022, Tokyo, Japan, 25–29 July 2022. [Google Scholar]
- Ukolov, D. Reviving the Sounds of Sacral Environments: Personalized Real-Time Auralization and Visualization of Location-Based Virtual Acoustic Objects on Mobile Devices; Springer: Cham, Switzerland, 2023; pp. 165–186. [Google Scholar]
- Dimitropoulos, K.; Tsalakanidou, F.; Nikolopoulos, S.; Kompatsiaris, I.; Grammalidis, N.; Manitsaris, S.; Denby, B.; Crevier-Buchman, L.; Dupont, S.; Charisis, V.; et al. A Multimodal Approach for the Safeguarding and Transmission of Intangible Cultural Heritage: The Case of i-Treasures. IEEE Intell. Syst. 2018, 33, 3–16. [Google Scholar] [CrossRef]
- Bocyte, R.; Oomen, J. Content Adaptation, Personalisation and Fine-grained Retrieval: Applying AI to Support Engagement with and Reuse of Archival Content at Scale. In Proceedings of the 12th International Conference on Agents and Artificial Intelligence (ICAART 2020), Valletta, Malta, 22–24 February 2020. [Google Scholar] [CrossRef]
- Cetinic, E.; She, J. Understanding and Creating Art with AI: Review and Outlook. ACM Trans. Multimedia Comput. Commun. Appl. 2022, 18, 1–22. [Google Scholar] [CrossRef]
- Mikalonytė, E.S.; Kneer, M. Can Artificial Intelligence Make Art?: Folk Intuitions as to whether AI-driven Robots Can Be Viewed as Artists and Produce Art. J. Hum.-Robot Interact. 2022, 11, 43. [Google Scholar] [CrossRef]
- Münster, S.; Lehmann, C.; Lazariv, T.; Maiwald, F.; Karsten, S. Toward an Automated Pipeline for a Browser-Based, City-Scale Mobile 4D VR Application Based on Historical Images. In Proceedings of the Research and Education in Urban History in the Age of Digital Libraries; Springer: Cham, Switzerland, 2019; pp. 106–128. [Google Scholar]
- Gros, A.; Guillem, A.; De Luca, L.; Baillieul, É.; Duvocelle, B.; Malavergne, O.; Leroux, L.; Zimmer, T. Faceting the post-disaster built heritage reconstruction process within the digital twin framework for Notre-Dame de Paris. Sci. Rep. 2023, 13, 5981. [Google Scholar] [CrossRef]
- Directorate-General for Communications Networks; Content and Technology; Izsak, K.; Terrier, A.; Kreutzer, S.; Strähle, T.; Roche, C.; Moretto, M.; Sorensen, S.; Hartung, M.; et al. Opportunities and Challenges of Artificial Intelligence Technologies for the Cultural and Creative Sectors; Publications Office of the European Union: Luxembourg, 2022. [Google Scholar] [CrossRef]
- Pasikowska-Schnass, M. Artificial Intelligence in the Context of Cultural Heritage and Museums: Complex Challenges and New Opportunities; Briefing; European Parliamentary Research Service: Brussels, Belgium, 2023. [Google Scholar]
- EuropeanaTech. AI in relation to GLAMs Task Force. Report and recommendations; Europeana Network ASsociation: The Hague, The Netherlands, 2023. [Google Scholar]
- Gasparini, A.A.; Kautonen, H. Understanding Artificial Intelligence in Research Libraries: An Extensive Literature Review. Liber Q. Te J. Eur. Res. Libr. 2022, 32, 1–36. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
- Jahrer, M.; Grabner, M.; Bischof, H. Learned local descriptors for recognition and matching. Proc. Comput. Vis. Winter Workshop 2008, 2008, 39–46. [Google Scholar]
- Lang, S.; Ommer, B. Attesting similarity: Supporting the organization and study of art image collections with computer vision. Digit. Scholarsh. Humanit. 2018, 33, 845–856. [Google Scholar] [CrossRef]
- Rodríguez-Ortega, N. Image processing and computer vision in the field of art history. In The Routledge Companion to Digital Humanities and Art History; Routledge: London, UK, 2020; pp. 338–357. [Google Scholar]
- n.b. ArchiMediaL. Enriching and Linking Historical Architectural and Urban Image Collections. Available online: http://archimedial.net/ (accessed on 1 December 2023).
- Radovic, M.; Adarkwa, O.; Wang, Q.S. Object Recognition in Aerial Images Using Convolutional Neural Networks. J. Imaging 2017, 3, 21. [Google Scholar] [CrossRef]
- Aiger, D.; Allen, B.; Golovinskiy, A. Large-Scale 3D Scene Classification With Multi-View Volumetric CNN. arXiv 2017, preprint. arXiv:1712.09216. [Google Scholar]
- Maiwald, F.; Lehmann, C.; Lazariv, T. Fully Automated Pose Estimation of Historical Images in the Context of 4D Geographic Information Systems Utilizing Machine Learning Methods. ISPRS Int. J. Geo-Inf. 2021, 10, 748. [Google Scholar] [CrossRef]
- Gominski, D.; Poreba, M.; Gouet-Brunet, V.; Chen, L. Challenging Deep Image Descriptors for Retrieval in Heterogeneous Iconographic Collections. In Proceedings of the 1st Workshop on Structuring and Understanding of Multimedia heritAge Contents, Nice, France, 21 October 2019; pp. 31–38. [Google Scholar]
- Morelli, L.; Bellavia, F.; Menna, F.; Remondino, F. Photogrammetry now and then—From hand-crafted to deep-learning tie points. ISPRS—Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, XLVIII-2/W1-2022, 163–170. [Google Scholar] [CrossRef]
- Sattler, T.; Maddern, W.; Toft, C.; Torii, A.; Hammarstrand, L.; Stenborg, E.; Safari, D.; Okutomi, M.; Pollefeys, M.; Sivic, J.; et al. Benchmarking 6DOF Outdoor Visual Localization in Changing Conditions. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 8601–8610. [Google Scholar]
- Sarlin, P.-E.; Cadena, C.; Siegwart, R.; Dymczyk, M. From Coarse to Fine: Robust Hierarchical Localization at Large Scale. arXiv 2019, arXiv:1812.03506. [Google Scholar]
- Maiwald, F. A Window to the Past through Modern Urban Environments—Developing a Photogrammetric Workflow for the Orientation Parameter Estimation of Historical Images. Ph.D. Thesis, Technische Universität Dresden, Dresden, Germany, 2022. [Google Scholar]
- Kruse, C.; Wittich, D.; Rottensteiner, F.; Heipke, C. Generating impact maps from bomb craters automatically detected in aerial wartime images using marked point processes. ISPRS Open J. Photogramm. Remote Sens. 2022, 5, 100017. [Google Scholar] [CrossRef]
- Chumachenko, K.; Mannisto, A.; Iosifidis, A.; Raitoharju, J. Machine Learning Based Analysis of Finnish World War II Photographers. IEEE Access 2020, 8, 144184–144196. [Google Scholar] [CrossRef]
- Chazalon, J.; Carlinet, E.; Chen, Y.; Perret, J.; Duménieu, B.; Mallet, C.; Géraud, T.; Nguyen, V.; Nguyen, N.; Baloun, J.; et al. ICDAR 2021 Competition on Historical Map Segmentation; Springer: Cham, Switzerland, 2021; pp. 693–707. [Google Scholar]
- Maiwald, F.; Komorowicz, D.; Munir, I.; Beck, C.; Münster, S. Semi-Automatic Generation of Historical Urban 3D Models at a Larger Scale Using Structure-from-Motion, Neural Rendering and Historical Maps; Münster, S., Pattee, A., Kröber, C., Niebling, F., Eds.; Research and Education in Urban History in the Age of Digital Libraries; Springer: Cham, Switzerland, 2023; pp. 107–127. [Google Scholar]
- Vaienti, B.; Petitpierre, R.; di Lenardo, I.; Kaplan, F. Machine-Learning-Enhanced Procedural Modeling for 4D Historical Cities Reconstruction. Remote Sens. 2023, 15, 3352. [Google Scholar] [CrossRef]
- Martinovic, A.; Knopp, J.; Riemenschneider, H.; Van Gool, L. 3d all the way: Semantic segmentation of urban scenes from start to end in 3d. In Proceedings of the IEEE Computer Vision & Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 4456–4465. [Google Scholar]
- Hackel, T.; Wegner, J.D.; Schindler, K. Fast semantic segmentation of 3D point clouds with strongly varying density. ISPRS Ann. 2016, 3, 177–184. [Google Scholar]
- Poterek, Q.; Herrault, P.A.; Skupinski, G.; Sheeren, D. Deep Learning for Automatic Colorization of Legacy Grayscale Aerial Photographs. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 2899–2915. [Google Scholar] [CrossRef]
- Huang, S.; Jin, X.; Jiang, Q.; Liu, L. Deep learning for image colorization: Current and future prospects. Eng. Appl. Artif. Intell. 2022, 114, 105006. [Google Scholar] [CrossRef]
- Michel, J.-B.; Shen, Y.K.; Aiden, A.P.; Veres, A.; Gray, M.K.; Team, T.G.B.; Pickett, J.P.; Hoiberg, D.; Clancy, D.; Norvig, P.; et al. Quantitative Analysis of Culture Using Millions of Digitized Books. Science 2011, 331, 176–182. [Google Scholar] [CrossRef]
- Vidhya, K.A. Text Mining Process, Techniques and Tools: An Overview. Int. J. Inf. Technol. Manag. 2010, 613–622. [Google Scholar]
- Ehrmann, M.; Hamdi, A.; Pontes, E.L.; Romanello, M.; Doucet, A. Named entity recognition and classification in historical documents: A survey. ACM Comput. Surv. 2021, 56, 1–47. [Google Scholar] [CrossRef]
- Rouhou, A.C.; Dhiaf, M.; Kessentini, Y.; Salem, S.B. Transformer-based approach for joint handwriting and named entity recognition in historical document. Pattern Recognit. Lett. 2022, 155, 128–134. [Google Scholar] [CrossRef]
- Utescher, R.; Patee, A.; Maiwald, F.; Bruschke, J.; Hoppe, S.; Münster, S.; Niebling, F.; Zarrieß, S. Exploring Naming Inventories for Architectural Elements for Use in Multimodal Machine Learning Applications. In Proceedings of the Workshop on Computational Methods in the Humanities 2022, Lausanne, Switzerland, 9–10 June 2022. [Google Scholar]
- Drobac, S.; Lindén, K. Optical character recognition with neural networks and post-correction with finite state methods. Int. J. Doc. Anal. Recognit. (IJDAR) 2020, 23, 279–295. [Google Scholar] [CrossRef]
- Khademi, S.; Mager, T.; Siebes, R. Deep Learning from History. In Proceedings of the Research and Education in Urban History in the Age of Digital Libraries, Dresden, Germany, 10–11 October 2021; Springer: Cham, Switzerland, 2021; pp. 213–233. [Google Scholar]
- Münster, S.; Apollonio, F.I.; Bell, P.; Kuroczynski, P.; Di Lenardo, I.; Rinaudo, F.; Tamborrino, R. Digital Cultural Heritage Meets Digital Humanities. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W15, 813–820. [Google Scholar] [CrossRef]
- Bell, P.; Ommer, B. Computer Vision und Kunstgeschichte—Dialog zweier Bildwissenschaften. In Digital Art History; Kuroczynski, P., Bell, P., Dieckmann, L., Eds.; Heidelberg University Press: Heidelberg, Germany, 2019; pp. 61–78. [Google Scholar]
- Russo, M.; Grilli, E.; Remondino, F.; Teruggi, S.; Fassi, F. Machine Learning for Cultural Heritage Classification. In Augmented Reality and Artificial Intelligence in Cultural Heritage and Innovative Design Domain; Franco Angeli: Milan, Italy, 2021. [Google Scholar] [CrossRef]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Kniaz, V.V.; Remondino, F.; Knyaz, V.A. Generative Adversarial Networks for Single Photo 3d Reconstruction. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W9, 403–408. [Google Scholar] [CrossRef]
- Hermoza, R.; Sipiran, I. 3D Reconstruction of incomplete Archaeological Objects using a Generative Adversarial Network. In Proceedings of the Computer Graphics International 2018, Bintan Island, Indonesia, 11–14 June 2018; pp. 5–11. [Google Scholar]
- Nogales Moyano, A.; Delgado Martos, E.; Melchor, Á.; García Tejedor, Á.J. ARQGAN: An evaluation of Generative Adversarial Networks’ approaches for automatic virtual restoration of Greek temples. Expert Syst. Appl. 2021, 180, 115092. [Google Scholar] [CrossRef]
- Microsoft In Culture. See Ancient Olympia brought to life. 2021. Available online: https://unlocked.microsoft.com/ancient-olympia-common-grounds (accessed on 1 December 2023).
- Mildenhall, B.; Srinivasan, P.P.; Tancik, M.; Barron, J.T.; Ramamoorthi, R.; Ng, R. NeRf: Representing scenes as neural radiance fields for view synthesis. Commun. ACM 2021, 65, 99–106. [Google Scholar] [CrossRef]
- Srinivasan, P.P.; Deng, B.; Zhang, X.; Tancik, M.; Mildenhall, B.; Barron, J.T. Nerf: Neural reflectance and visibility fields for relighting and view synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2021, Virtue, 19–25 June 2021; pp. 7495–7504. [Google Scholar]
- Croce, V.; Caroti, G.; Luca, L.; Piemonte, A.; Véron, P. neural radiance fields (nerf): Review and potential applications to digital cultural heritage. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2023, XLVIII-M-2-2023, 453–460. [Google Scholar] [CrossRef]
- Kaya, B.; Kumar, S.; Sarno, F.; Ferrari, V.; Gool, L. Neural Radiance Fields Approach to Deep Multi-View Photometric Stereo. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision 2022, Waikoloa, HI, USA, 3–8 January 2022. [Google Scholar]
- Murtiyoso, A.; Grussenmeyer, P. initial assessment on the use of state-of-the-art nerf neural network 3d reconstruction for heritage documentation. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2023, XLVIII-M-2-2023, 1113–1118. [Google Scholar] [CrossRef]
- Vandenabeele, L.; Häcki, M.; Pfister, M. crowd-sourced surveying for building archaeology: The potential of structure from motion (sfm) and neural radiance fields (nerf). Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2023, XLVIII-M-2-2023, 1599–1605. [Google Scholar] [CrossRef]
- 4dReply. Closing the 4D Real World Reconstruction Loop. Available online: https://cordis.europa.eu/project/id/770784 (accessed on 8 February 2022).
- Martin-Brualla, R.; Radwan, N.; Sajjadi, M.S.M.; Barron, J.T.; Dosovitskiy, A.; Duckworth, D. NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. arXiv 2021. preprint. [Google Scholar]
- Cho, J.; Zala, A.; Bansal, M. DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers. CoRR 2022, abs/2202.04053. [Google Scholar]
- Li, Z.; Wang, Q.; Cole, F.; Tucker, R.; Snavely, N. DynIBaR: Neural Dynamic Image-Based Rendering. arXiv 2022, arXiv:2211.11082. [Google Scholar]
- Uhl, J.H.; Leyk, S.; Chiang, Y.-Y.; Knoblock, C.A. Towards the automated large-scale reconstruction of past road networks from historical maps. Comput. Environ. Urban Syst. 2022, 94, 101794. [Google Scholar] [CrossRef]
- Liu, C.; Wu, J.; Kohli, P.; Furukawa, Y. Raster-To-Vector: Revisiting Floorplan Transformation. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2195–2203. [Google Scholar]
- Oliveira, S.A.; Seguin, B.; Kaplan, F. dhSegment: A Generic Deep-Learning Approach for Document Segmentation. In Proceedings of the 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), Niagara Falls, NY, USA, 5–8 August 2018; pp. 7–12. [Google Scholar]
- Ignjatić, J.; Bajic, B.; Rikalovic, A.; Culibrk, D. Deep Learning for Historical Cadastral Maps Digitization: Overview, Challenges and Potential. In Proceedings of the 26th International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision in co-operation with EUROGRAPHICS Association 2018, Delft, The Netherlands, 16–20 April 2018. [Google Scholar] [CrossRef]
- Kartta Labs. 2023. Available online: https://github.com/kartta-labs (accessed on 1 December 2023).
- Petitpierre, R.; Kaplan, F.; di Lenardo, I. Generic Semantic Segmentation of Historical Maps. In CEUR Workshop Proceedings; CEUR-WS: Aachen, Germany, 2021. [Google Scholar]
- Petitpierre, R. Neural networks for semantic segmentation of historical city maps: Cross-cultural performance and the impact of figurative diversity. arXiv 2020, preprint. arXiv:2101.12478. [Google Scholar]
- Tran, A.; Zonoozi, A.; Varadarajan, J.; Kruppa, H. PP-LinkNet: Improving Semantic Segmentation of High Resolution Satellite Imagery with Multi-stage Training. In Proceedings of the 2nd Workshop on Structuring and Understanding of Multimedia heritAge Contents, Seattle, WA, USA, 12 October 2020; pp. 57–64. [Google Scholar]
- Crommelinck, S.; Höfle, B.; Koeva, M.; Yang, M.Y.; Vosselman, G. Interactive Boundary Delineation from UAV data. In Proceedings of the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Riva del Garda, Italy, 4–7 June 2018; pp. 81–88. [Google Scholar]
- Chen, Q.; Wang, L.; Wu, Y.; Wu, G.; Guo, Z.; Waslander, S.L. Aerial imagery for roof segmentation: A largescale dataset towards automatic mapping of buildings. ISPRS J. Photogramm. Remote Sens. 2018, 147, 42–55. [Google Scholar] [CrossRef]
- Crommelinck, S.; Koeva, M.; Yang, M.Y.; Vosselman, G. Application of Deep Learning for Delineation of Visible Cadastral Boundaries from Remote Sensing Imagery. Remote Sens. 2019, 11, 2505. [Google Scholar] [CrossRef]
- Hecht, R.; Meinel, G.; Buchroithner, M.F. Automatic identification of building types based on topographic databases—A comparison of different data sources. Int. J. Cartogr. 2015, 1, 18–31. [Google Scholar] [CrossRef]
- Betsas, T.; Georgopoulos, A. 3D edge detection and comparison using four-channel images. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, XLVIII-2/W2-2022, 9–15. [Google Scholar] [CrossRef]
- Oliveira, S.A.; Lenardo, I.d.; Kaplan, F. Machine Vision Algorithms on Cadaster Plans. In Proceedings of the Conference of the International Alliance of Digital Humanities Organizations (DH 2017), Montreal, QC, Canada, 8–11 August 2017. [Google Scholar]
- Herold, H.; Hecht, R. 3D Reconstruction of Urban History Based on Old Maps; Springer: Cham, Switzerland, 2018; pp. 63–79. [Google Scholar]
- Ares Oliveira, S.; di Lenardo, I.; Tourenc, B.; Kaplan, F. A deep learning approach to Cadastral Computing. In Proceedings of the Digital Humanities Conference, Utrecht, The Netherlands, 8–12 July 2019. [Google Scholar]
- Heitzler, M.; Hurni, L. Cartographic reconstruction of building footprints from historical maps: A study on the Swiss Siegfried map. Trans. GIS 2020, 24, 442–461. [Google Scholar] [CrossRef]
- Available online: https://ismir.net (accessed on 1 December 2023).
- MIRtoolbox. Available online: https://www.jyu.fi/hytk/fi/laitokset/mutku/en/research/materials/mirtoolbox (accessed on 1 December 2023).
- Mignot, R.; Peeters, G. An Analysis of the Effect of Data Augmentation Methods: Experiments for a Musical Genre Classification Task. Trans. Int. Soc. Music. Inf. Retr. 2019, 2, 97–110. [Google Scholar] [CrossRef]
- Tulisalmi-Eskola, J. Automatic Music Genre Classification—Supervised Learning Approach. Master’s Thesis, Metropolia University of Applied Sciences, Helsinki, Finland, 2022. [Google Scholar]
- Calvo-Zaragoza, J.; Jr, J.H.; Pacha, A. Understanding Optical Music Recognition. ACM Comput. Surv. 2020, 53, 1–35. [Google Scholar] [CrossRef]
- Standard Music Font Layout. Available online: https://w3c.github.io/smufl/latest/index.html (accessed on 1 December 2023).
- Music XML. Available online: https://www.musicxml.com (accessed on 1 December 2023).
- Music Encoding Initiative. Available online: https://music-encoding.org (accessed on 1 December 2023).
- Official Midi Specifications. Available online: https://www.midi.org/specifications (accessed on 1 December 2023).
- Benetos, E.; Dixon, S.; Duan, Z.; Ewert, S. Automatic Music Transcription: An Overview. IEEE Signal Process. Mag. 2019, 36, 20–30. [Google Scholar] [CrossRef]
- Tsalakanidou, F. Deliverable 2.3—AI Technologies and Applications in Media: State of Play, Foresight, and Research Directions; AI4Media Project (Grant Agreement No 951911); 2022. Available online: https://www.ai4media.eu/wp-content/uploads/2022/03/AI4Media_D2.3_Roadmap_final.pdf (accessed on 1 December 2023).
- Ferrara, A.; Montanelli, S.; Ruskov, M. Detecting the Semantic Shift of Values in Cultural Heritage Document Collections. Ceur Workshop Proc. 2022, 3286, 35–43. [Google Scholar]
- Van Noord, N.; Olesen, C.; Ordelman, R.; Noordegraaf, J. Automatic Annotations and Enrichments for Audiovisual Archives. In Proceedings of the 13th International Conference on Agents and Artificial Intelligence—Volume 1: ARTIDIGH, Online, 4–6 February 2021; SciTePress: Setúbal, Portugal, 2021; pp. 633–640, ISBN 978-989-758-484-8; ISSN 2184-433X. [Google Scholar] [CrossRef]
- Kemenade, P.v.; Bocyte, R.; Oomen, J. You’ve Been Framed—Partial Audio Matching Functionality to Support Framing Analysis. In DARIAH Annual Event 2023: Cultural Heritage Data as Humanities Research Data? Zenodo: Budapest, Hungary, 2023; Available online: https://zenodo.org/communities/dariahannualevent2023chdata-hrdata/ (accessed on 1 December 2023).
- Wigham, M.; Melgar Estrada, L.; Ordelman, R.J.F. Jupyter Notebooks for Generous Archive Interfaces. In Proceedings of the 2018 IEEE International Conference on Big Data (Big Data), Seattle, WA, USA, 10–13 December 2018; Song, Y., Liu, B., Lee, K., Abe, N., Pu, C., Qiao, M., Ahmed, N., Kossmann, D., Saltz, J., Tang, J., et al., Eds.; pp. 2766–2774. [Google Scholar] [CrossRef]
- Piet, N. Beyond Search; Netherlands Institute for Sound & Vision: Hilversum, The Netherlands, 2023. [Google Scholar] [CrossRef]
- Beelen, T.; Velner, E.; Ordelman, R.; Truong, K.; Evers, V.; Huibers, T. Designing conversational robots with children during the pandemic. arXiv 2022, preprint. arXiv:2205.11300. [Google Scholar]
- Tan, C.; Sun, F.; Kong, T.; Zhang, W.; Yang, C.; Liu, C. A survey on deep transfer learning. In Proceedings of the Artificial Neural Networks and Machine Learning–ICANN 2018: 27th International Conference on Artificial Neural Networks, Rhodes, Greece, 4–7 October 2018; Proceedings, Part III 27. pp. 270–279. [Google Scholar]
- Wang, T.; Gan, V.J. Automated joint 3D reconstruction and visual inspection for buildings using computer vision and transfer learning. Autom. Constr. 2023, 149, 104810. [Google Scholar] [CrossRef]
- Goodarzi, P.; Ansari, M.; Pour Rahimian, F.; Mahdavinejad, M.; Park, C. Incorporating sparse model machine learning in designing cultural heritage landscapes. Autom. Constr. 2023, 155, 105058. [Google Scholar] [CrossRef]
- Salahuddin, Z.; Woodruff, H.C.; Chatterjee, A.; Lambin, P. Transparency of deep neural networks for medical image analysis: A review of interpretability methods. Comput. Biol. Med. 2022, 140, 105111. [Google Scholar] [CrossRef]
- Farella, E.M.; Ozdemir, E.; Remondino, F. 4D Building Reconstruction with Machine Learning and Historical Maps. Appl. Sci. 2021, 11, 1445. [Google Scholar] [CrossRef]
- Rao, R.; Fung, G. On the Dangers of Cross-Validation. An Experimental Evaluation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2008; pp. 588–596. [Google Scholar] [CrossRef]
- Liu, S.; Bovolo, F.; Bruzzone, L.; Du, Q.; Tong, X. Unsupervised Change Detection in Multitemporal Remote Sensing Images. In Change Detection and Image Time Series Analysis 1; John Wiley & Sons: Hoboken, NJ, USA, 2021; pp. 1–34. [Google Scholar] [CrossRef]
- Zhu, Z. Change detection using landsat time series: A review of frequencies, preprocessing, algorithms, and applications. ISPRS J. Photogramm. Remote Sens. 2017, 130, 370–384. [Google Scholar] [CrossRef]
- Goswami, A.; Sharma, D.; Mathuku, H.; Gangadharan, S.M.P.; Yadav, C.S.; Sahu, S.K.; Pradhan, M.K.; Singh, J.; Imran, H. Change Detection in Remote Sensing Image Data Comparing Algebraic and Machine Learning Methods. Electronics 2022, 11, 431. [Google Scholar] [CrossRef]
- Nebiker, S.; Lack, N.; Deuber, M. Building Change Detection from Historical Aerial Photographs Using Dense Image Matching and Object-Based Image Analysis. Remote Sens. 2014, 6, 8310–8336. [Google Scholar] [CrossRef]
- Henze, F.; Lehmann, H.; Bruschke, B. Nutzung historischer Pläne und Bilder für die Stadtforschungen in Baalbek/Libanon. Photogramm.—Fernerkund.—Geoinf. 2009, 3, 221–234. [Google Scholar] [CrossRef]
- Wang, Y. Change Detection from Photographs. Image Processing; Université Paul Sabatier: Toulouse, France, 2016. [Google Scholar]
- Zhang, T.; Nefs, H.; Heynderickx, I. Change detection in pictorial and solid scenes: The role of depth of field. PLoS ONE 2017, 12, e0188432. [Google Scholar] [CrossRef]
- Noh, H.; Ju, J.; Seo, M.; Park, J.; Choi, D.G. Unsupervised Change Detection Based on Image Reconstruction Loss. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA, 19–20 June 2022; pp. 1351–1360. [Google Scholar]
- Kharroubi, A.; Poux, F.; Ballouch, Z.; Hajji, R.; Billen, R. Three Dimensional Change Detection Using Point Clouds: A Review. Geomatics 2022, 2, 457–485. [Google Scholar] [CrossRef]
- Zielke, T. Is Artificial Intelligence Ready for Standardization? In Proceedings of the Systems, Software and Services Process Improvement: 27th European Conference, EuroSPI 2020, Düsseldorf, Germany, 9–11 September 2020. [Google Scholar]
- FUTURES4EUROPE. General AI. 2021. Available online: https://www.futures4europe.eu/general-ai (accessed on 1 December 2023).
- Amnesty International. The Toronto Declaration: Protecting the Right to Equality and Non-Discrimination in Machine Learning Systems; Amnesty International: London, UK, 2018. [Google Scholar]
- European Commission. Ethics Guidelines for Trustworthy AI; Publications Office: Brussels, Belgium, 2019. Available online: https://data.europa.eu/doi/10.2759/346720 (accessed on 1 December 2023).
- Jakesch, M.; Buçinca, Z.; Amershi, S.; Olteanu, A. How Different Groups Prioritize Ethical Values for Responsible AI. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, Seoul, Republic of Korea, 21–24 June 2022; pp. 310–323. [Google Scholar]
- Madaio, M.; Egede, L.; Subramonyam, H.; Vaughan, J.; Wallach, H. Assessing the Fairness of AI Systems: AI Practitioners’ Processes, Challenges, and Needs for Support. Proc. ACM Hum.-Comput. Interact. 2022, 6, 1–26. [Google Scholar] [CrossRef]
- Pansoni, S.; Tiribelli, S.; Paolanti, M.; Di Stefano, F.; Frontoni, E.; Malinverni, E.S.; Giovanola, B. Artificial intelligence and cultural heritage: Design and assessment of an ethical framework. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2023, XLVIII-M-2-2023, 1149–1155. [Google Scholar] [CrossRef]
- Beaudoin, J.E. An Investigation of Image Users across Professions: A Framework of Their Image Needs, Retrieval and Use. Ph.D. Thesis, Drexel University: Philadelphia, PA, USA, 2009. [Google Scholar]
- Münster, S.; Kamposiori, C.; Friedrichs, K.; Kröber, C. Image libraries and their scholarly use in the field of art and architectural history. Int. J. Digit. Libr. 2018, 19, 367–383. [Google Scholar] [CrossRef]
- Münster, S.; Terras, M. The visual side of digital humanities: A survey on topics, researchers, and epistemic cultures. Digit. Scholarsh. Humanit. 2020, 35, 366–389. [Google Scholar] [CrossRef]
- Münster, S.; Utescher, R.; Ulutas-Aydogan, S. Digital Topics on Cultural Heritage quantified. Built Herit. 2021, 5, 25. [Google Scholar] [CrossRef]
- Muenster, S.; Fritsche, K.; Richards-Risetto, H.; Apollonio, F.; Schwartze, V.; Aehnlich, B.; Smolarski, R. Teaching Digital Heritage and Digital Humanities—A current state and prospects. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2021, XLVI-M-1-2021, 471–478. [Google Scholar] [CrossRef]
- European Commission. Access to Finance for the Cultural and Creative Sectors. Available online: https://digital-strategy.ec.europa.eu/en/policies/finance-cultural-creative-sectors (accessed on 1 December 2023).
- Glenn, J.C. Artificial General Intelligence Issues and Opportunities. 2022. Available online: https://www.millennium-project.org/wp-content/uploads/2023/05/EC-AGI-paper.pdf (accessed on 1 December 2023).
- Colavizza, G.; Blanke, T.; Jeurgens, C.; Noordegraaf, J. Archives and AI: An overview of current debates and future perspectives. ACM J. Comput. Cult. Herit. (JOCCH) 2021, 15, 1–15. [Google Scholar] [CrossRef]
- Time Machine FET-FLAGSHIP-CSA. Time Machine: Big Data of the Past for the Future of Europe. A Proposal to the European Commission for a Large-Scale Research Initiative; Time Machine Organisation: Vienna, Austria, 2020. [Google Scholar]
- Wollentz, G.; Heritage, A.; Morel, H.; Forgesson, S.; Iwasaki, A.; Cadena–Irizar, A. Foresight for Heritage: A Review of Future Change to Shape Research, Policy and Practice; ICCROM: Rome, Italy, 2023. [Google Scholar]
- European Learning and Intelligent Systems Excellence (ELISE) Consortium. Creating a European AI Powerhouse. A Strategic Research Agenda from the European Learning and Intelligent Systems Excellence (ELISE) consortium; ELISE Consortium: Brussels, Belgium, 2021. [Google Scholar]
- Croce, V.; Caroti, G.; De Luca, L.; Jacquot, K.; Piemonte, A.; Véron, P. From the semantic point cloud to heritage-building information modeling: A semiautomatic approach exploiting machine learning. Remote Sens. 2021, 13, 461. [Google Scholar] [CrossRef]
- Zhitomirsky-Geffet, M.; Kizhner, I.; Minster, S. What do they make us see: A comparative study of cultural bias in online databases of two large museums. J. Doc. 2023, 79, 320–340. [Google Scholar] [CrossRef]
- Osoba, O.A.; Welser IV, W. An Intelligence in Our Image: The Risks of Bias and Errors in Artificial Intelligence; Rand Corporation: St. Monica, CA, USA, 2017. [Google Scholar]
- Ulutas Aydogan, S.; Münster, S.; Girardi, D.; Palmirani, M.; Vitali, F. A Framework to Support Digital Humanities and Cultural Heritage Studies Research. In Proceedings of the Workshop on Research and Education in Urban History in the Age of Digital Libraries, Dresden, Germany, 10–11 October 2019; Springer: Cham, Switzerland, 2019; pp. 237–267. [Google Scholar]
- Stamatoudi, I. Research Handbook on Intellectual Property and Cultural Heritage; Edward Elgar Publishing Limited: Cheltenham, UK, 2022. [Google Scholar]
- Münster, S. Digital Cultural Heritage as Scholarly Field—Topics, Researchers and Perspectives from a bibliometric point of view. J. Comput. Cult. Herit. 2019, 12, 22–49. [Google Scholar] [CrossRef]
- Xu, F.; Uszkoreit, H.; Du, Y.; Fan, W.; Zhao, D.; Zhu, J. Explainable AI: A brief survey on history, research areas, approaches and challenges. In Proceedings of the Natural Language Processing and Chinese Computing: 8th CCF International Conference, NLPCC 2019, Dunhuang, China, 9–14 October 2019; Proceedings, Part II 8. pp. 563–574. [Google Scholar]
- Mosqueira-Rey, E.; Hernández-Pereira, E.; Alonso-Ríos, D.; Bobes-Bascarán, J.; Fernández-Leal, Á. Human-in-the-loop machine learning: A state of the art. Artif. Intell. Rev. 2023, 56, 3005–3054. [Google Scholar] [CrossRef]
- Wu, X.; Xiao, L.; Sun, Y.; Zhang, J.; Ma, T.; He, L. A survey of human-in-the-loop for machine learning. Future Gener. Comput. Syst. 2022, 135, 364–381. [Google Scholar] [CrossRef]
- Field, A.P.; Gillett, R. How to do a meta-analysis. Br. J. Math. Stat. Psychol. 2010, 63, 665–694. [Google Scholar] [CrossRef]
Art Transfer by Google Arts & Culture Using AI algorithms, Art Transfer allows users to transform their photos into the style of famous artists such as Van Gogh or Picasso. Link: https://artsandculture.google.com/camera/art-transfer | |
MicroPasts by the British Museum MicroPasts is a project that combines crowd-sourced data with AI technology. Volunteers contribute by digitizing and tagging images while AI algorithms analyze the data. Link: https://micropasts.org/ | |
4Dcity by the University of Jena This application uses AI to automatically 4D reconstruct past cityscapes from historical cadastre plans and photographs. This 4D model is world-scale and enriched by links to texts and information, e.g., from Wikipedia, and accessible as mobile 4D websites [62]. Link: https://4dcity.org/ | |
SCAN4RECO This EU-funded project combines 3D scanning, robotics, and AI to create digital reconstructions of damaged or destroyed CH objects. Link: https://scan4reco.iti.gr/ | |
AI-DA by Aidan Meller Gallery AI-DA is an AI-powered robot artist developed by Aidan Meller Gallery in the United Kingdom. The robot uses AI algorithms to analyze and interpret human facial expressions, creating drawings and paintings inspired by the emotions it perceives. AI-DA’s artworks have been exhibited in galleries across Europe. Link: https://www.ai-darobot.com/ | |
Transkribus by Read Coop SCE Transkribus is a comprehensive solution for digitization, AI-powered text recognition, transcription, and searching historical documents. A specific emphasis is on handwritten text recognition. https://readcoop.eu/transkribus/ | |
Transcribathon The Transcribathon platform is an online crowd-sourcing platform for enriching digitized material from Europeana. It applies the Transkribus handwriting recognition technology to input documents, performs some automatic enrichments (including translation) on the obtained text and metadata, and lets volunteers validate the results. https://transcribathon.eu/ | |
The Next Rembrandt by ING Bank and Microsoft This project employed AI algorithms to analyze Rembrandt’s works and create a new painting in his style. https://www.nextrembrandt.com/ | |
Rekrei (formerly Project Mosul) Rekrei is a crowd-sourcing and AI project aimed at reconstructing CH sites that have been destroyed or damaged. Users can contribute photographs and other data, and AI algorithms help in reconstructing the lost heritage digitally. https://rekrei.org/ | |
Notre Dame reconstruction After a fire destroyed parts of the Notre Dame Cathedral in Paris in 2019, a digital twin model was created to experiment—physical anastylosis, reverse engineering, spatiotemporal tracking assets, and operational research—and create a reconstruction hypothesis. The results demonstrate that the proposed modeling method facilitates the formalization and validation of the reconstruction problem and increases solution performance [63]. https://news.cnrs.fr/articles/a-digital-twin-for-notre-dame | |
Finto AI by the National Library of Finland Finto AI is a service for automated subject indexing. It can be used to suggest subjects for text in Finnish, Swedish, and English. It currently gives suggestions based on concepts of the General Finnish Ontology, YSO. Link: https://ai.finto.fi | |
Europeana Translate This project has trained translation engines on metadata from the common European data space on cultural heritage in order to obtain a service that can translate CH metadata from 22 official EU languages to English, improving the multilingual experience provided to its users. It has been applied to 29 million metadata records so far. Link: https://pro.europeana.eu/post/europeana-translate-project-brings-together-multilingualism-and-cultural-heritage | |
MuseNet by OpenAI MuseNet composes original music in a wide range of styles and genres. It can create music inspired by different cultural traditions and historical periods, demonstrating the potential of AI in generating new compositions that reflect CH. Link: https://openai.com/research/musenet | |
The Hidden Florence by the University of Exeter The Hidden Florence is an AI-enhanced mobile app that guides visitors through the streets of Florence, Italy, offering insights into the city’s rich CH in an engaging way. The app utilizes AI algorithms to provide location-based narratives, AR experiences, and interactive storytelling. Link: https://hiddenflorence.org/ | |
Smartify App by Smartify Smartify utilizes AI to provide interactive experiences with artworks in museums and galleries. The mobile app uses image recognition to identify artworks, delivering detailed information, audio guides, and curated tours. It is compatible with numerous cultural institutions across Europe and beyond. Link: https://smartify.org/ | |
Second Canvas App by Madpixel and the Prado Museum The app uses AI technology to enhance the visitor experience. It provides high-resolution images of artworks, along with interactive features that allow users to explore the details and stories behind the paintings. Link: https://www.secondcanvas.net/ | |
WAIVE WAIVE is a smart DJ system utilizing AI to create unique music samples, beats, and loops from the digitized audio archives of the Netherlands Institute for Sound & Vision. Link: https://www.thunderboomrecords.com/waive |
R&D AGENDA FOR AI FOR CH Understanding the challenges and opportunities of AI and CH Despite much research, a full understanding of how AI and CH could contribute to each other is still limited. The challenge is to understand the specific challenges and opportunities within the field and identify key research questions and problems that AI can address, such as artifact analysis, preservation, restoration, historical context understanding, and public engagement. Vice versa, CH could contribute to the development of AI regarding specific data and problems, problem authoring, and results interpretation. |
Data collection and curation Since data collection and suitable training data is an all-time challenge of AI, CH applications increase the complexity of gathering, annotating, and curating the data to create training sets for AI models taking into account specific CH aspects, e.g., time variance, digitized analog material, or heterogeneous media sets. |
Domain-specific AI challenges CH poses some unique challenges to the development of AI applications:
|
Domain-specific AI applications Develop and fine-tune AI models tailored to CH tasks such as:
|
Cross-domain opportunities CH comprises a wide variety of AI usage scenarios— from tourism to research and education. A cross-cutting demand and prerequisite for employing AI is to make data connectible and, therefore, employ metadata schemes and vocabularies capable of dealing with different data types and domains. |
Context understanding and information enrichment There is an increasing move towards multimodality to include images, texts, and audio into a joint frame of reference, mixed methods combining AI with algebraic approaches, and information enrichment using domain and object-specific understanding to enhance the quality of information (e.g., [182]). Together, these can be used to build AI systems that can contextualize historical artifacts. |
Ethical considerations and transparency Biased collections and dominating cultural narratives have been flagged as a major challenge of CH [183]. AI intensifies this challenge by the tendency to replicate dominant features and create limited explainable results [184]. A resultant challenge is to ensure that AI systems respect cultural sensitivities and do not perpetuate biases [167]. |
Interdisciplinary collaboration CH as a field is marked by high complexity and “fuzzy” problems, which are challenging to transpose into computable approaches [185]. A resultant challenge is to foster collaboration between AI researchers, CH experts, computer scientists, and ethicists to ensure appropriate, high-quality, and meaningful results. |
Human in the loop Dealing with CH is still highly influenced by personal expertise and tacit knowledge [173]. It is therefore important to rigorously evaluate AI models’ performance against established benchmarks and human expertise and continuously improve models based on feedback from domain experts. |
Long-term sustainability Currently, most heritage data, AI models, and resources are held by companies outside Europe [50]. It is a major challenge to ensure the long-term maintenance, availability, and sustainability of AI tools, data, and platforms and foster open-source and open-data initiatives to not lose control and access to heritage and culture. |
Legal and intellectual property considerations CH in Europe is faced with a currently heterogeneous and highly complex legal situation (recently: [185,186]); thus, it is also challenging for AI technologies [4]. A resultant demand is to create and maintain an appropriate legal framework when working with AI for CH. |
AI for heritage education Adequate skills have been named as the most important challenge for heritage institutions in the digital realm [187]. Currently, qualifications and skills are mainly taught within academic programs [175]. Against the background of rapid technological developments CH stakeholders need continuous professional development and lifelong learning to be skilled to assess, apply, and reflect on AI. |
Heritage innovation support Due to the specifics of the heritage sector, most extant programs to support AI implementation in the European innovation landscape are limited and only applicable to this domain [176]. Intermediaries and tailoring of support offers are needed to successfully connect AI infrastructures, technology providers, financers and the CH sector. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Münster, S.; Maiwald, F.; di Lenardo, I.; Henriksson, J.; Isaac, A.; Graf, M.M.; Beck, C.; Oomen, J. Artificial Intelligence for Digital Heritage Innovation: Setting up a R&D Agenda for Europe. Heritage 2024, 7, 794-816. https://doi.org/10.3390/heritage7020038
Münster S, Maiwald F, di Lenardo I, Henriksson J, Isaac A, Graf MM, Beck C, Oomen J. Artificial Intelligence for Digital Heritage Innovation: Setting up a R&D Agenda for Europe. Heritage. 2024; 7(2):794-816. https://doi.org/10.3390/heritage7020038
Chicago/Turabian StyleMünster, Sander, Ferdinand Maiwald, Isabella di Lenardo, Juha Henriksson, Antoine Isaac, Manuela Milica Graf, Clemens Beck, and Johan Oomen. 2024. "Artificial Intelligence for Digital Heritage Innovation: Setting up a R&D Agenda for Europe" Heritage 7, no. 2: 794-816. https://doi.org/10.3390/heritage7020038
APA StyleMünster, S., Maiwald, F., di Lenardo, I., Henriksson, J., Isaac, A., Graf, M. M., Beck, C., & Oomen, J. (2024). Artificial Intelligence for Digital Heritage Innovation: Setting up a R&D Agenda for Europe. Heritage, 7(2), 794-816. https://doi.org/10.3390/heritage7020038