1. Introduction
Archives and museums, as cultural institutions, serve as vital repositories of historical information, preserving and safeguarding invaluable records that illuminate the past and enrich understanding of the present. In providing access to records, these cultural institutions facilitate research, education, and community engagement, promoting a deeper understanding of historical events and cultural heritage [
1]. Moreover, these institutions play a critical role in promoting transparency, accountability, and social justice by preserving records that document the experiences and struggles of marginalised communities [
2]. However, despite their crucial role in society, many archives and museums face significant challenges in reaching and engaging the public effectively [
3]. Lack of resources, including financial, technological, infrastructural, human, and policy constraints, combined with a rapidly changing digital environment, often hinders their ability to communicate their value proposition and make their collections accessible to a wide audience. [
4]. A primary concern is the low public awareness of resources and the services they offer. Many individuals are unaware of the wealth of information resources available within their local archives and museums, resulting in the underutilisation of these valuable resources [
3]. This lack of awareness can be attributed to various factors, including limited outreach efforts, inadequate communication strategies, and a perception of these cultural institutions as inaccessible or irrelevant to contemporary society [
4].
In addition, these cultural institutions are often perceived as dusty repositories of documents, inaccessible to the general public, and primarily serving the needs of professional historians. This perception hinders public engagement and limits the potential of these institutions to serve as vibrant centres for community learning and cultural exchange [
3]. To address these challenges, various scholars [
1,
3,
4,
5,
6] have advocated for efficacious public programming activities. Public programming encompasses a range of planned activities undertaken by cultural institutions to engage the public and foster meaningful connections [
6,
7]. These initiatives provide crucial platforms for community building, knowledge dissemination, and fostering critical discourse [
4,
8]. In line with this focus on public engagement, archives and museums implement a diverse array of tailored initiatives to connect with their respective communities, each designed to meet the specific needs and objectives of diverse audiences. For instance, traditional methods, such as curated exhibitions showcasing their materials, often themed around historical events or individuals, serve as foundational approaches [
9]. Furthermore, expert lectures and presentations facilitate the dissemination of in-depth knowledge and promote intellectual discourse [
10]. Complementing these approaches, hands-on workshops and classes, such as those focused on genealogy or document preservation, provide opportunities for direct interaction with archival and museum materials, thereby enhancing participant engagement [
11]. In addition, guided tours offer personalised explorations of cultural institutions’ collections, led by knowledgeable staff or volunteers [
12]. Moreover, outreach programmes extend beyond these institution’s walls, engaging the community through school visits, public demonstrations, and collaborative partnerships with local organisations, fostering a broader reach [
13,
14].
In the digital age, archives and museums have embraced innovative approaches. For example, online exhibitions utilise interactive technologies and multimedia elements to showcase their materials virtually, reaching a wider audience [
15]. Digital archives also provide online access to digitised materials, often with search functions, transcriptions, and contextual information [
16]. For instance, leveraging the power of social media, platforms like Twitter/X, Facebook, and Instagram serve as valuable tools for sharing archival and museum content, promoting events, and engaging with the public through interactive posts and discussions [
17,
18]. Also, webinars and online lectures offer virtual presentations that enhance accessibility and broaden audience participation [
19]. Moving beyond digital initiatives, community-based approaches foster strong connections between these institutions and their local community [
13,
20]. For example, oral history projects collect and preserve valuable oral testimonies, creating a living record of community experiences [
21]. Moreover, family history research days offer assistance and resources to individuals tracing their ancestry through archival materials [
22]. Further fostering community engagement, collaborative projects, such as community archives or museum initiatives, involve local residents in the identification, collection, and preservation of local historical materials, thereby cultivating a sense of ownership and investment [
12,
13,
23].
Beyond these core initiatives, these cultural institutions utilise a range of additional strategies, including publications such as newsletters and brochures, to disseminate information about their institutions and collections [
7]. Involving the community further, volunteer programmes empower community members to contribute to the mission by assisting with research, outreach, and event planning [
13]. Public programming varies greatly among archives and museums. This depends on their resources, mission, and community needs. However, data analytics can greatly improve how these programs are implemented and evaluated [
24,
25].
Data analytics entails the process of examining large datasets to uncover hidden patterns, trends, and insights that can be used to inform decision-making [
26]. As such, by collecting, analysing, and interpreting data on audience demographics, programme attendance, engagement levels, and feedback, these cultural institutions can gain valuable insights into the effectiveness of their outreach efforts [
27]. Arguably, public programming is essential for bridging the gap between archives, museums, and the public, fostering a deeper understanding of history, and promoting the value of archival or museum collections [
4,
6,
7,
8]. As such, data analytics can play a crucial role in supporting these objectives by providing evidence-based insights that inform programme development, resource allocation, and audience engagement strategies [
26,
27,
28]. For example, by analysing visitor data, archives and museums can identify key audience segments, tailor programmes to specific interests, and measure the impact of different outreach initiatives [
25]. This data-driven approach can help these cultural institutions to optimise their public programming efforts, maximise their impact, and ensure that their valuable resources remain accessible and relevant to diverse communities [
15,
25,
29].
As such, data analytics in archival and museums’ public programming involves using data analysis techniques to enhance outreach and engagement strategies, promoting collections and services to the community by identifying trends and patterns in user behaviour [
7,
30]. This understanding is crucial, given the acknowledged importance of public programming in archives and museums [
4,
6,
7,
8]. However, a significant gap remains in the collective and systematic understanding of how data analytics can be effectively leveraged to support and enhance these efforts. Although research has highlighted the potential benefits of using data analytics [
31], there is a lack of systematic understanding of the current state of knowledge in this area of study. The existing literature provides fragmented insights into the application of data analytics in archival and museum public programming, with studies focusing on specific aspects such as visitor data analysis [
32], or the use of digital technologies to enhance audience engagement [
15,
29].
A review is, therefore, needed to synthesise scholarship, identify gaps, and establish a foundation for future research on data analytics and public programming in archival and museum settings. This scoping review addresses this knowledge gap by examining the extant literature concerning the application of data analytics in these cultural institutions’ public programming. This review synthesises knowledge to help archives and museums develop effective, data-driven public programming for better community engagement and collection promotion. To this end, the following research objectives guided this review:
To describe the current state of knowledge and research on the use of data analytics in archival and museum public programming.
To examine the challenges and limitations of using data analytics in archival and museum public programming.
These objectives guided the direction of the current review. Firstly, describing the current state of knowledge and research provides a foundational understanding of existing work, identifying gaps and opportunities for further investigation. This ensures the research builds upon existing knowledge and avoids unnecessary duplication of effort. Furthermore, this objective allows exploration of the diverse range of data collected and analysed by these cultural institutions. This is crucial for comprehending the full scope and potential impact of data analytics within these institutions’ domain. Investigating the challenges and limitations of using data analytics in these cultural institutions’ public programming is crucial for addressing practical concerns. Thus, in identifying and mitigating potential risks, individuals can develop practical solutions and best practices for overcoming these challenges, eventually enabling these institutions to make informed decisions about the effective and ethical use of data analytics in their public programming initiatives.
3. Results
This Section presents the findings of the study, structured in accordance with the core objectives. The analysis addresses two primary research aims: first, to delineate the current state of knowledge and research concerning the application of data analytics within archival and museum public programming; and second, to investigate the inherent challenges and limitations associated with the implementation of data analytics in this context. The findings presented offer an overview of the existing literature and practical considerations surrounding data-driven approaches to public engagement in these institutions.
To ascertain the current state of knowledge and research regarding the application of data analytics within archival and museum public programming, a chronological analysis of the included studies was conducted, focusing on publication trends over time. This temporal analysis, the results of which are presented in
Figure 2, provided valuable insights into the evolution of scholarly attention to this emerging field. Examining publication trends allowed for the identification of key periods of growth, stagnation, or shifting research priorities, potentially reflecting broader trends in the archival and museum sectors, technological advancements, or evolving research agendas. Furthermore, this analysis helped to contextualise the current body of knowledge by highlighting the historical development of research on data-driven public programming in these institutions. A detailed examination of
Figure 2 revealed the specific patterns and trends observed, contributing to a more nuanced understanding of the current research landscape.
The analysis of publication trends (
Figure 2) reveals a growing interest in the application of data analytics within archival and museum contexts.
Beyond the chronological analysis of publication trends, this study also undertook an examination of the diverse data analytics techniques and their focus within archival and museum public programming.
Table 2 shows the results of the analysis.
Table 2 reveals a diverse field of data analytics applications within archives and museums. These applications vary considerably; for instance, they range from enhancing access using semantic web technologies to analysing visitor behaviour patterns. The analytical techniques employed are equally varied. Alongside established methods like web analytics and statistical analysis, institutions are increasingly utilising advanced tools such as machine learning and deep learning. Crucially, these techniques are not applied in isolation. Instead, they serve several interconnected purposes, including improving online access, optimising archive management, facilitating a deeper understanding of visitors, and promoting effective public engagement.
Complementing the analysis of data analytics types, this study further investigated the specific applications of these diverse techniques within archival and museum public programming. The findings of this analysis are presented in
Table 3.
The table illustrates the significant impact of data analytics across a spectrum of archival and museum functions, moving beyond traditional practices to embrace innovative approaches. Notably, the application of data analytics not only focuses on internal improvements like archival management but also extends to enhancing external interactions through improved access, user experiences, and public engagement. Furthermore, the emergence of new research methodologies signifies a transformative shift towards data-driven insights within the cultural heritage sector.
In addition to examining the application of data analytics within archival and museum public programming, this study also considered the diverse data sources employed in these data-driven initiatives. Understanding the origin and nature of the data used was crucial for evaluating the validity and reliability of analytical insights. The results of this analysis are shown in
Table 4.
Table 4 highlights the diverse ways in which data analytics is employed, ranging from understanding textual content and user behaviour to enhancing metadata and analysing multimedia. As such, the table underscores the data-driven approaches being adopted to improve access, engagement, research, and operational efficiency within cultural heritage institutions. The second core objective of this study addressed the challenges and limitations inherent in the application of data analytics within archival and museum public programming. The findings of this analysis, presented in
Table 5, provide a synopsis of the multifaceted challenges encountered by researchers and practitioners seeking to implement data analytics in public programming archival and museum initiatives.
Table 5 outlines key challenges associated with the increasing adoption of data analytics within archival and museum contexts. These challenges span ethical considerations around data privacy, issues of data quality and potential biases, the complexities of long-term digital preservation, and the critical need to ensure equitable accessibility for all users. Therefore, the table underscores the multifaceted hurdles that cultural heritage institutions must address to responsibly and effectively leverage the power of data analytics.
4. Discussion
The analysis of publication trends, as depicted in
Figure 2, reveals an increasing scholarly and practical interest in the application of data analytics within archival and museum contexts. This trend is not occurring in a vacuum; rather, it is significantly propelled by technological advancements [
4,
77,
78]. The proliferation of digitisation initiatives, the decreasing cost of data storage, and the development of increasingly sophisticated yet user-friendly analytical tools have made it feasible for archival and museum professionals and researchers to engage with large datasets in unprecedented ways [
50,
78]. This expanding availability of digital data [
79], mirroring Lapszynski’s observations [
15] regarding the opportunities it presents, fuels a concurrent desire to leverage its potential for enhanced research, analysis, and even public engagement. Furthermore, changes in funding priorities have likely played a crucial role in shaping this publication trend [
44]. As funding bodies increasingly recognise the potential of digital technologies and data-driven approaches to enhance access [
80], preservation, and interpretation of cultural heritage, resources have been directed towards projects that explicitly incorporate data analytics [
51,
81]. This shift in emphasis has incentivised researchers and institutions to explore and publish findings related to these applications [
82,
83].
The positive trajectory observed also signifies a growing recognition amongst researchers and practitioners of data analytics’ capacity to inform and potentially transform work in these fields [
25,
28,
31]. This recognition is intertwined with shifts in research agendas within archival and museum studies [
83]. As these fields grapple with the challenges and opportunities of the digital age [
4], there is a growing impetus to move beyond traditional descriptive approaches towards more quantitative and analytical approaches [
15,
82]. The desire to uncover hidden patterns, understand user behaviour, and demonstrate the impact of cultural heritage through data-driven insights is driving scholarly inquiry in this area [
4,
81]. However, the modest rate of increase observed suggests the scope of research concerning data analytics applications within archival and museum contexts remains limited, indicating its nascent stage as a distinct area of scholarly inquiry [
77,
78].
The study findings presented in
Table 2 furnish an overview of the diverse applications of data analytics within archival and museum contexts. Specifically, a key finding underscores the breadth of data analytics techniques currently employed [
83]. Web analytics was also found to be a prominent technique (
Table 2), emphasising the growing importance of online presence for archives and museums [
48]. These studies employed analyses of user behaviour on websites and online platforms to enhance accessibility and engagement, thus demonstrating the critical role of user needs assessment within archival contexts [
7,
11]. Social media analysis was found to be another significant area of focus, indicating the importance of understanding how archives and museums interact with the public on these platforms [
73,
76]. The inclusion of studies using qualitative data analysis was crucial (
Table 2), as it provided a deeper understanding of user experiences, perspectives, and the social context of archival and museum work [
58,
65]. Whilst quantitative methods provide valuable insights, qualitative approaches complement these findings and provide a more comprehensive understanding of the research questions [
4,
58,
65].
Table 2 also showcases a diverse range of other techniques, including linked data, text mining, 3D modelling, digital forensics, and data envelopment analysis. This variety demonstrates the adaptability of data analytics to different research questions and archival/museum challenges [
25]. The general picture presented by the study (
Table 3) points towards a move towards data-driven decision-making within archives and museums [
25]. These institutions are increasingly recognising the value of data in informing strategies, improving services, and demonstrating impact [
31].
The implementation of machine learning and artificial intelligence signifies a trend towards advanced data analysis, demonstrating the emergent potential of these technologies within archival and museum domains [
80]. These techniques can be used for tasks like image recognition, personalised recommendations, and automating archival processes [
50,
66,
67]. However, while advanced technological applications like deep learning and AI are undeniably critical in offering varied benefits to archival and museum practices, it is imperative to acknowledge that they also present various limitations and challenges. These include, notably, data quality issues, algorithmic biases, ethical concerns surrounding data privacy and security, as well as the necessity for significant investment in infrastructure and expertise to ensure effective implementation, as detailed in
Table 5. Conversely, the heterogeneity of these data-driven approaches, as illustrated in
Table 2, signifies a field actively exploring the multifaceted potential of these tools, thereby affirming the increasing relevance and integration of data analytics within archival and museum settings [
32,
80].
The study’s findings, as synthesised in
Table 3, demonstrated the transformative impact of data analytics not only on the internal operations of archival and museum institutions but also on their external interactions and public engagement strategies. Consequently, data analytics emerges as a pivotal force in revolutionising archival practice, offering unprecedented capabilities for understanding user behaviour, collection engagement patterns, and the broader impact of cultural heritage resources [
29,
74]. This analysis reveals five key dimensions through which this transformative influence is manifested (
Table 3). Firstly, data analytics significantly facilitate enhanced access and discoverability of archival and museum holdings. This is achieved through mechanisms such as the strategic implementation of linked data principles and the development of refined search functionalities [
40]. Linked data transcend traditional keyword-based retrieval by establishing semantic relationships between disparate data points, thereby unlocking the potential for novel and more intuitive modes of discovery and access to cultural heritage information [
40,
48]. Secondly, data analytics demonstrably facilitates substantial enhancements in archival management. This is realised through the strategic implementation of automation technologies and artificial intelligence (AI) systems. Specifically, AI possesses the inherent capacity to automate a range of routine archival processes, including metadata creation and enrichment, and to significantly elevate the overall quality and consistency of metadata [
53]. This, in turn, directly contributes to improved operational efficiency, enhanced data integrity, and more effective long-term preservation strategies [
43,
52].
Thirdly, data analytics plays a crucial role in enhancing user experience within archival and museum settings. This is achieved through the provision of personalised content recommendations and the development of interactive and engaging exhibits, underscoring the increasing importance of user-centric design principles [
48,
49]. Empirical research consistently demonstrates the efficacy of data analytics in optimising visitor experiences, informing the design of impactful exhibits, enabling data-driven personalisation of content delivery, and fostering more meaningful and impactful engagement with cultural heritage resources [
51,
67]. Fourthly, data analytics significantly enhances public engagement by enabling the development and implementation of data-driven outreach strategies. In providing deeper insights into audience demographics, preferences, and engagement patterns, data analytics facilitates a more nuanced understanding of the public [
46]. This, in turn, leads to the creation of more targeted, effective, and impactful engagement initiatives, ultimately broadening the reach and relevance of archival and museum collections [
72,
75]. Fifthly, and finally, data analytics enables the adoption of novel research methodologies within the humanities and cultural heritage fields. These innovative methodologies have the potential to reveal latent patterns, uncover hidden connections, and generate deeper insights into historical and cultural information contained within vast datasets [
42,
45]. Therefore, these multifaceted findings underscore the increasing necessity of cultivating data literacy amongst professionals within cultural heritage institutions and making strategic investments in robust data analytics infrastructure and specialised expertise. Only through such proactive measures can these institutions fully realise the transformative potential of data-driven approaches for the long-term preservation, enhanced access, and meaningful public engagement with our shared cultural heritage [
11,
13].
This study identified a diverse range of data sources that serve as crucial inputs for informing data-driven initiatives within archival and museum contexts (
Table 4). Specifically, the analysis revealed that textual data, derived from sources such as web archives and collection descriptions, offer valuable insights into public perceptions, prevailing attitudes, and evolving discourse surrounding cultural heritage, thereby significantly enhancing narrative comprehension and contextual understanding of archival and museum collections [
41]. Furthermore, meticulously curated metadata plays a pivotal role in augmenting the discoverability of archival materials and fostering enhanced resource access through improved interoperability across digital platforms [
40,
51]. User-generated data, particularly web analytics capturing user behaviour on institutional websites, empowers institutions to strategically optimise website design, refine information architecture, and enhance the overall user experience, ultimately leading to demonstrably increased user engagement and satisfaction [
49].
Similarly, visitor data, encompassing the tracking of movements, interactions within physical spaces, and expressed preferences, facilitate the creation of personalised, deeply engaging, and interactive experiences within museum environments, catering to individual interests and learning styles [
51,
57]. The analysis of social media data furnishes invaluable, real-time insights into public opinions, prevailing attitudes towards collections and exhibitions, and broader trends in public discourse related to cultural heritage, enabling institutions to tailor outreach efforts and gauge public sentiment effectively [
56,
76]. Moreover, the integration and analysis of image and multimedia data significantly enrich digital content and substantially elevate the user experience by providing detailed visual information, contextual depth, and alternative modes of engagement with collections [
68]. Consequently, these comprehensive findings underscore the critical imperative for archives and museums to strategically leverage this diverse spectrum of data sources. Thus, through effectively harnessing the analytical potential of textual data, metadata, user data, visitor data, social media data, and multimedia data, cultural heritage institutions can significantly enhance operational efficiencies, demonstrably improve public engagement strategies, and substantially enrich user experiences within the increasingly vital digital domain [
25,
32].
Despite the significant opportunities presented by data analytics for enhancing the operations and public engagement of archives and museums, this study identifies several key challenges that necessitate careful consideration for its effective and responsible implementation (
Table 5). Paramount among these are ethical considerations surrounding data privacy and security [
51,
67]. These demand attention to the principles of informed consent, robust data security protocols, and a proactive mitigation of the potential for data misuse or discriminatory outcomes, particularly given the often-sensitive personal information held within these institutions [
79]. Consequently, archives and museums bear a profound ethical and legal responsibility to rigorously safeguard the privacy and confidentiality of individuals represented, both directly and indirectly, within their collections [
44]. Furthermore, issues pertaining to data quality and the presence of inherent biases represent another substantial challenge in the application of data analytics, including sophisticated techniques leveraging machine learning, deep learning, and artificial intelligence (AI) [
52,
66]. Data imperfections, such as inaccuracies, incompleteness, and inconsistencies, can arise from a multitude of factors, including initial data collection methodologies, sampling techniques, and the inherent biases that can be embedded within analytical algorithms, including those underpinning AI systems [
41]. These biases, if left unaddressed, can have significant ramifications on the validity and fairness of analytical outcomes, underscoring the necessity for data cleaning, validation procedures, and evaluation of the outputs generated by machine learning, deep learning, and other analytical approaches [
41,
72].
Moreover, the long-term preservation of digital data constitutes a critical challenge for cultural heritage institutions increasingly reliant on digital assets and data-driven insights [
43]. Given the inherent fragility and susceptibility of digital information to data loss, format obsolescence, and media degradation, the development and implementation of robust digital preservation strategies are imperative for ensuring the longevity and accessibility of valuable digital resources [
44]. Thus, guaranteeing continued data access for future generations necessitates proactive and adaptive measures, encompassing systematic data migration, ongoing format conversion, and the establishment of resilient and well-managed digital archives capable of accommodating evolving technological landscapes [
43]. Furthermore, ensuring accessibility for all users, including individuals with diverse abilities and disabilities, is an ethical and practical imperative in the deployment of data analytics [
50]. Effective accessibility requires diligent adherence to established data analytics accessibility guidelines, the proactive implementation of inclusive design principles from the outset of project development, and a sustained commitment to the principles of universal design to ensure that data-driven services and resources are usable and equitable for everyone [
79]. Therefore, addressing these multifaceted challenges is paramount for archives and museums to ethically and effectively harness the transformative potential of data analytics, including advanced techniques like AI and deep learning, to enhance their operations, accessibility, and engagement with the public.