Next Article in Journal
A Comprehensive Evaluation of Machine Learning and Deep Learning Models for Churn Prediction
Next Article in Special Issue
The Impact of AI-Driven Application Programming Interfaces (APIs) on Educational Information Management
Previous Article in Journal
Biasing Rule-Based Explanations Towards User Preferences
Previous Article in Special Issue
Behavioral Drivers of AI Adoption in Banking in a Semi-Mature Digital Economy: A TAM and UTAUT-2 Analysis of Stakeholder Perspectives
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Crowdsourcing and Digital Information: Looking for a Future Research Agenda

by
Fernando J. Garrigos-Simon
1 and
Yeamduan Narangajavana-Kaosiri
2,*
1
Departamento de Organización de Empresas, Universitat Politècnica de València, Camino Vera S/N, 46022 Valencia, Spain
2
Departamento de Comercialización e Investigación de Mercados, Universitat de València, 46022 Valencia, Spain
*
Author to whom correspondence should be addressed.
Information 2025, 16(7), 536; https://doi.org/10.3390/info16070536
Submission received: 30 May 2025 / Revised: 17 June 2025 / Accepted: 18 June 2025 / Published: 25 June 2025
(This article belongs to the Special Issue New Information Communication Technologies in the Digital Era)

Abstract

Crowdsourcing has become increasingly relevant in academic research due to its role in the evolving digital landscape, where information is a key driver of organizational performance. In a context dominated by emerging technologies and digital platforms, organizations are turning to external sources for data and idea generation. This paper offers a comprehensive review of the literature on crowdsourcing and digital information, using bibliometric techniques and qualitative analysis to identify major trends. The findings reveal several shifts in focus: from conceptual frameworks to practical applications; from customer participation to broader stakeholder involvement; and from general technological and managerial approaches to specific technologies and emerging perspectives in entrepreneurship and finance. The primary contributing disciplines are Computer Science, Engineering, and Information Science. Recent research (post 2023) emphasizes constructs such as “crowdfunding”, “digital platforms”, and “machine learning”, moving beyond earlier focuses like “citizen science” and “social media.” This review also reveals growing interest in managerial, medical, and cultural heritage applications, alongside a decline in research related to geography and crisis management over the past two years. This study enhances our understanding of current research directions and practical implications in crowdsourcing and digital information, offering valuable insights for both academics and practitioners.

1. Introduction

Crowdsourcing has emerged as a critical phenomenon in the digital information era, playing a key role in idea generation, problem-solving, and the transformation of organizational models. Since Howe [1] introduced the concept, it has gained increasing attention from both scholars and practitioners, driven by rapid technological advancements.
Crowdsourcing—also referred to as “massive outsourcing” or “voluntary outsourcing” [2,3]—was originally defined by Howe [1] (p. 1) as “taking a function once performed by employees and outsourcing it to an undefined (and generally large) network of people in the form of an open call”. Garrigos-Simon & Narangajavana-Kaosiri [4] (p. 2) define crowdsourcing as “the action of taking a specific task or job, whether or not previously performed by an employee of an organization or by a designated agent (such as a contractor, an external worker or a supplier), and subcontracting it, through an “open call” to a large group of people (inside or outside the organization), a community or the general public via the Internet, for compensation that does not have to be financial”.
The concept of crowdsourcing encompasses several defining characteristics. First, it relies on collective intelligence and co-creation, drawing on the contributions of individuals rather than organizations as primary sources of innovation and resources. This redefines organizational boundaries, evolving traditional outsourcing into more open and flexible structures.
The “crowd” is typically large, diverse, and composed of both external and internal stakeholders. Participation may be broad or selectively targeted and can involve humans as well as non-human agents. Contributors provide various resources—such as labor, funding, knowledge, or expertise—often mediated through digital platforms and new technologies [5].
Crowdsourcing applies across tasks of varying complexity and modularity within the value chain, including activities not previously undertaken by organizations or formerly handled by employees [6]. It can be initiated by individuals, firms, nonprofits, or public institutions and is relevant across sectors and disciplines.
Importantly, the model generates mutual benefits: while organizations gain valuable input, participants may receive compensation not only in monetary form but also through social recognition, personal development, or skill acquisition—signaling a shift in traditional labor dynamics. Ethical considerations, particularly regarding fair treatment and recognition of contributors, are essential to its responsible implementation [7,8].
Crowdsourcing has become a prominent and expanding topic in academic research, attracting attention across a wide range of disciplines. Its widespread adoption across various domains reflects its growing significance [9]. In particular, managing information across digital platforms has become a central aspect of crowdsourcing in the current technological context [10]. From a practical perspective, it enhances firm competitiveness by offering cost-effective, high-quality, and rapid solutions [1,11]. It also generates positive network externalities, reduces dependence on traditional suppliers, mitigates information asymmetries, accelerates data collection, and fosters innovation through voluntary participation and broad public engagement [12,13,14].
Lebraty and Lobre-Lebraty [15] identify three primary sources of value creation in crowdsourcing: cost reduction, innovation, and authenticity. Furthermore, crowd participation can promote market orientation, organizational change, and increased technological exposure, which may positively influence performance [3]. Moreover, crowdsourcing can indirectly contribute by developing distinctive capabilities, driving innovation, and reshaping organizational and customer behaviors [4,16].
However, despite its exponential growth and practical relevance, research on crowdsourcing remains limited [17]. Although Garrigós-Simón and Narangajavana [4] provide an in-depth review of the relevant literature, the existing literature reviews in this area are reduced, require updating, and have not specifically addressed particular aspects related to crowdsourcing and other specific issues.
Even more scarce is the literature exploring its intersection with digital information (the aim of this paper), with a notable lack of comprehensive reviews on the current state and future research directions. This constitutes a significant gap, especially considering the pivotal role of information—commonly recognized as the most critical asset for modern organizations. In the context of an increasingly complex and digitalized technological environment, the ability to manage digital information effectively is not merely beneficial but essential to organizational resilience and innovation. Within this framework, crowdsourcing emerges as a vital tool, offering a dynamic and participatory approach to information management that enables organizations to harness collective intelligence, enhance data processing capabilities, and adapt more effectively to evolving informational demands.
This article presents a bibliographic and integrative review of the literature on crowdsourcing and digital information (CDI). It identifies emerging trends, key applications, and future research directions in the field. This study contributes to the academic literature in four key ways. First, it presents the first comprehensive bibliometric analysis of crowdsourcing and digital information (CDI) research, offering novel insights into the field. Second, it provides a detailed examination of the literature’s development by analyzing core references, sources, authors, and especially keywords, thereby identifying key patterns, networks, and emerging trends. Third, this study introduces a methodological innovation that facilitates the comparison of different periods and enhances the visualization of research evolution. Finally, the findings offer practical insights for professionals and propose solid foundations and directions for future scholarly work.

2. Materials and Methods

To examine this research landscape, we employ bibliometric analysis, complemented by visual mapping techniques. Bibliometrics is an interdisciplinary method that applies mathematical and statistical tools to quantitatively assess scholarly knowledge [17]. Originating in 1917 [18], it is now widely used to trace the development of academic fields. Its capacity to generate objective, concise, and interpretable insights into specific research areas highlights its methodological value [17].
The data for this study were obtained from the Web of Science (WoS) Core Collection, the most widely used and recognized source in bibliometric research [19]. While both the WoS and Scopus are commonly employed, this study focuses exclusively on WoS due to its higher selectivity and its inclusion of only the most reputable journals [20]. The WoS is also regarded as the most neutral and representative database in the field [21].
The study retrieved all publications related to the keywords “crowdsourcing” (or “crowd sourcing”), “digital,” and “information” up to 31 March 2025, with data collection conducted in April 2025. The dataset includes full records and cited references. From an initial set of 963 documents, only articles, reviews, letters, and notes were selected to focus on the core literature, following the criteria of Garrigós-Simón et al. [21]. This resulted in a final sample of 679 documents, representing 70.5% of the original dataset.
For the co-occurrence analysis, two distinct periods were considered, 2009–2022 and 2023–2025, with a focus on author keywords. Although the post-2023 sample accounts for approximately one-third of the total and the unequal period lengths may influence the results, this approach was adopted to highlight the most recent trends in the literature.
The bibliometric indicators used are the most common in the literature: the number of papers published in the area to analyze productivity; the number of citations to measure the most influential papers and authors [20]; the h-index to determine the quality of the papers [22]; and the WoS impact factor to see the dissemination power of the journals [23].
The paper also utilized the VOSviewer 1.6.20 software [24] for data mapping, providing a clearer illustration of structures, networks, and clusters in the analysis. The analyses conducted included a co-occurrence analysis of authors’ keywords and a co-citation analysis of references, journals and authors [25]. The work also develops a hierarchical analysis of clusters or groups of related works and develops a bibliographic coupling of authors [26]. These analyses are the most used in the bibliometric literature [21,23].
Our research introduces novel visual innovations by employing color coding to facilitate the comparative analysis of authors’ keyword evolution across different periods. This approach aims to enhance the visualization of thematic networks and their development, contributing new methodological tools to academic research.

3. Results

The results are structured around five main analyses. First, this section outlines the current state, trends, and developments in the literature on crowdsourcing and digital information, including an examination of citation patterns. Second, it identifies and reviews the most frequently cited and co-cited papers in the field. Third, the paper analyzes the primary publication sources through citation and co-citation analyses. Fourth, the study explores the co-occurrence of author keywords, tracing their evolution over distinct phases. Finally, it examines the most influential authors using co-citation and bibliographic coupling techniques.

3.1. The Status and Evolution of Crowdsourcing and Digital Information in the Literature

The first article on crowdsourcing in a digital context indexed in Web of Science was published by Trainor in 2009 [27], highlighting the role of open-source software and crowdsourcing in enhancing library services. Shortly thereafter, academic interest in the topic surged. By 2012, the number of publications (articles, reviews, and letters) had grown to 12, reaching 62 in 2017 and peaking at 89 in 2023. This growth in publications was mirrored by a sharp rise in citations: over 100 annual citations were recorded by 2014, 576 in 2017, and more than 1000 citations annually since 2019. In 2024, citations peaked at 2680. Figure 1 presents the annual trends in publications and citations.
Table 1 presents the citation structure in the field of CDI. According to the Web of Science, one article has received over 500 citations, and eight have surpassed 200 (1.18% of the total). Additionally, 40.8% of the publications have received at least 10 citations. The h-index for the field is 56, indicating that 56 papers have received at least 56 citations each, which reflects the overall impact of the area [23]. In total, the field has accumulated 13,937 citations, with an average of 20.53 citations per article.

3.2. Analysis of Citation and Co-Citation of Papers

3.2.1. Most Cited Papers in CDI

Table 2 lists the most frequently cited articles on CDI within our sample, based on citation counts from the Web of Science. Citation count serves as a key indicator of an article’s quality, influence, and academic relevance [17,23].
The most cited article in the sample is Pennycook and Rand [28], with 550 citations and the highest average citations per year (137.5). This work, situated in psychology and behavioral sciences, explores why individuals believe and share misinformation online, highlighting the role of crowdsourced veracity ratings.
Wu et al. [29] ranks second with 456 citations and third in citations per year (45.6). Positioned at the intersection of computer science and management, it analyzes cloud-based design and manufacturing within a digital framework.
Steelman et al. [30], published in MIS Quarterly, ranks third in total citations (307), though it lags in annual citation rate. The study examines the use of digital crowdsourcing platforms, such as Amazon Mechanical Turk, for data collection in the digital age.
Tangcharoensathien et al. [31], published in the Journal of Medical Internet Research, is fourth in total citations (296) but ranks second in citations per year (59.2). The article proposes crowdsourced strategies for managing the COVID-19 infodemic through stakeholder collaboration

3.2.2. References Co-Citation Analysis

The co-citation analysis—based on the network of documents co-cited by the 679 selected publications—reveals the intellectual structure, thematic connections, and evolution of the field of CDI. In this network, nodes represent co-cited documents, highlighting core research themes.
Figure 2 shows that Howe [1], Goodchild [32], and Estellés-Arolas & González-Ladrón-De-Guevara [33] lead in both total link strength and citation frequency. Howe (2006), cited by 49 of the 679 documents (after merging duplicates), is foundational for introducing and defining the term “crowdsourcing.” Although highly cited, this article is not part of the sample itself as it was published in a magazine not indexed by the Web of Science.
Goodchild [32], with 40 citations from the sample, is the second most co-cited work. It examines volunteered geographic information (VGI), compares it with citizen science, and explores issues such as motivation, privacy, and data accuracy.
Estellés-Arolas and González-Ladrón-De-Guevara [33] complete the top three. Their study offers a comprehensive review of crowdsourcing definitions and proposes a validated, integrated definition widely adopted in the field.
Figure 2 displays six co-citation clusters, each representing thematic concentrations within the CDI literature.
The largest cluster (red, 30 items) is led by Goodchild [32] and centers on volunteered geographic information and citizen science. It includes highly cited works such as Zook [34], Bonney et al. [35], Goodchild et al. [36], and Haklay [37].
The second-largest cluster (green, 29 items) is led by Afuah and Tucci [38] and includes Fornell and Larcker [39] and Brabham [40]. This group focuses on crowdsourcing from a managerial and methodological perspective.
The third cluster (dark blue, 23 items), led by Howe [1] and Estellés-Arolas & González-Ladrón-De-Guevara [33], emphasizes conceptual and theoretical analyses of crowdsourcing. It also includes Brabham [12,41], Howe [42], and Braun & Clarke [43].
The fourth cluster (yellow, 15 items) focuses on crowdfunding, a financial dimension of crowdsourcing. It includes Mollick [44] and Belleflamme et al. [45], ranked fifth and seventh in citations, respectively.
The fifth cluster (violet, three items), led by Gao et al. [46], approaches crowdsourcing from a digital and computing perspective.
The sixth and smallest cluster (light blue, three items) is marginal, with no articles among the top 50 most cited.

3.3. Analysis of Citation and Co-Citation of Sources

3.3.1. Leading CDI Areas and Journals

The 679 documents on CDI were published across 577 journals and span 128 subject categories. The dominant fields were Computer Science (453 articles, 66.62%), Engineering (26.07%), Information Science & Library Science (22.98%), Communication (21.94%), and Business & Economics (20.47%).
Two journals—ISPRS International Journal of Geo-Information and Sensors—each published 10 papers on the topic. Thirteen journals, each contributing five or more articles, accounted for ninety documents (13.25% of the total) (see Table 3). Among them, the highest h-index values (6) were reported by ISPRS International Journal of Geo-Information, Sensors, Technological Forecasting and Social Change, and PLOS One.
In terms of journal specialization, ACM Journal on Computing and Cultural Heritage had the highest share of its total output (1.26%) dedicated to the field, followed by JMIR Public Health and Surveillance (0.35%) and Journal of the Association for Information Science and Technology (0.28%).
Regarding impact, measured by average citations per article, Journal of Medical Internet Research led with 65 citations per article, followed by Remote Sensing (51.2) and JMIR Public Health and Surveillance (36.2)

3.3.2. Journal Co-Citation Analysis

This co-citation analysis examines the frequency with which two journals are cited together in a third source, revealing intellectual linkages across disciplines [21]. As shown in Figure 3, three primary clusters emerge.
The first cluster (red, 44 items) is led by Lecture Notes in Computer Science—the most cited journal (2307 citations) but 28th in link strength (42,641). It also includes PLOS One (third in citations, 1865) and Science (eighth). This group is primarily associated with computer science, engineering, and related fields such as medicine, geography, and environmental sciences.
The second cluster (green, 24 items) is led by MIS Quarterly (second in citations with 245, but first in link strength: 8835), Computers in Human Behavior (144 citations, 2818 link strength), and Technological Forecasting and Social Change (137 citations, 4395 link strength). Journals in this group focus on information science and the application of computing in management and marketing.
The third cluster (blue, 17 items) includes Academy of Management Journal (151 citations; second in link strength: 6304), Management Science (151 citations, 5302), Research Policy (147 citations, 5440), and Organization Science (143 citations; third in link strength: 5736). This cluster is primarily oriented toward management and organizational studies.

3.4. Co-Occurrence Analysis of “Author Keywords”

This section analyzes publications on CDI across two periods: 2009–2022 and 2023–2025.
Figure 4a focuses on the first period (2009–2022). Excluding crowdsourcing (unifying variants like crowd-sourcing and crow sourcing), the most frequent keyword is citizen science, followed by social media.
The red cluster (11 items), the largest, includes keywords such as digital humanities, innovation (6th and 7th in frequency), sustainability (13th), and collaboration (16th). The green cluster, also with 11 items and led by social media (3rd), includes emergency management and digital humanitarianism (both ranked 13th and 16th).
The dark blue cluster (10 items) includes COVID-19 and internet (both 8th), as well as surveillance (16th). The yellow cluster (nine items) features Internet of Things and privacy (8th and 16th).
The violet cluster (seven items) is led by OpenStreetMap and cloud computing, though it lacks terms among the top 10. The light blue cluster (six items), is led by citizen science and VGI (Volunteered Geographic Information, fourth; we consolidated multiple variants of this term).
The orange cluster (five items) includes big data and machine learning (fourth and eighth). The brown cluster (four items) contains crowdsourcing, and open innovation (eighth).
The pink cluster (four items) is led by smartphone (16th), while the remaining four clusters contain only one or two keywords each, all in marginal positions.
Figure 4b presents the keyword co-occurrence network for the 2023–2025 period. While crowdsourcing remains the dominant term, new leading keywords include crowdfunding, digital platforms, machine learning, and blockchain.
The red cluster (nine items) is led by crowdfunding and digital platforms—now the second and third most frequent terms—and includes digital transformation (15th). The green cluster (eight items), headed by blockchain and digital twin (both ranked 5th), also features task analysis and privacy (8th and 15th, respectively).
The dark blue cluster (seven items) is led by citizen science (9th) and includes digital health (11th), artificial intelligence, and photogrammetry (both 14th). The yellow cluster, also with seven items, is centered on crowdsourcing and includes machine learning (4th), deep learning (9th), and crowdwork (14th).
The violet cluster (five items) is led by cultural heritage (7th) and includes online communities and social media (both 11th). Lastly, smartphone, though outside the top 19 keywords, is the sole member of the light blue cluster
The results reveal a significant shift in keyword rankings, highlighting evolving research trends. As shown in Table 4, crowdsourcing remains the only consistent term, while the rest of the top keywords undergo substantial changes.
First, the new crowdsourcing cluster incorporates emerging terms such as deep learning, crowdwork, authentication, misinformation, and MTurk, with machine learning rising to fourth place. This reflects both a technological evolution and a shift in theoretical focus—from client-centered co-creation to the broader involvement of diverse stakeholders through crowdwork.
Second, all components of the main post-2023 cluster (red) are newly ranked, led by crowdfunding and digital platforms (second and third, respectively). This underscores the increasing emphasis on the financial and managerial applications of crowdsourcing and digital information. Similarly, the second cluster (green), now focused on blockchain, digital twin, and task analysis—all within the top eight—signals a move from general innovation-focused terms (e.g., innovation, open innovation) to more specific entrepreneurship-related concepts (digital entrepreneurship, entrepreneurship).
This trend is also reflected in the replacement of broad technological terms such as big data, internet, IoT, cloud computing, and data collection with more targeted tools and concerns such as privacy and fairness. The prominence of smartphone also declines.
Moreover, the importance of citizen science diminishes, dropping from second to ninth place, and the literature related to geographic and emergency applications—volunteered geographic information, emergency management, OpenStreetMap, and critical cartography—disappears from the top rankings. In contrast, new technologies like artificial intelligence and photogrammetry, as well as digital health, gain prominence.
Heritage-related research also rises, with cultural heritage and digital heritage entering the ranking at seventh and nineteenth, respectively. Meanwhile, online communities and social media occupy the eleventh position, though the latter has declined from its previous third-place standing.
Finally, several previously prominent keywords, including digital humanities and digital humanitarianism, have vanished from the current ranking, underscoring a broader thematic reorientation in the field.
These changes reflect a broader evolution in the literature—from an initial phase focused on defining crowdsourcing to a new phase emphasizing practical and methodological concerns. This shift explains the emergence of keywords such as task analysis (now ranked eighth) and case study, as well as the growing role of advanced technological tools that increasingly serve as research methodologies.
A detailed analysis reveals shifting disciplinary emphases within the literature. While perspectives from Economics (e.g., sharing economy) and Planning/Environmental Sciences (e.g., built environment) show emerging relevance—despite the disappearance of sustainability—there is a notable decline in contributions from Documentation, Psychology, and Marketing, as evidenced by the absence of keywords such as digital libraries, research libraries, and gamification. In contrast, terms associated with Law and Ethics (privacy) and, particularly, Finance (crowdfunding, now the second most prominent keyword) have gained prominence.
The findings also underscore a paradigm shift in the management and information systems literature, transitioning toward the vision of Web 3.0, where digital technologies enable enhanced human collaboration and adaptive web services [2]. This evolution aligns with the rise in ubiquitous computing and pervasive digital environments and the increasing automation of data-driven decision-making in business intelligence. It also reflects the emergence of Marketing 5.0 and 6.0, grounded in next-generation technologies.
This transformation is particularly evident in the post-2023 ranking, with the ascent of machine learning, the introduction of blockchain and deep learning, and the continued rise in artificial intelligence—all of them signifying the growing centrality of cutting-edge technologies in current research

3.5. Authors Co-Citation Network and Bibliographic Coupling of Authors

Figure 5 presents the author co-citation network, revealing five distinct clusters. The largest, in red (23 items), is led by Howe and Brabham—the most and third-most cited authors with 97 and 68 citations, respectively—alongside Estellés-Arolas (34) and Deng (24). This group is primarily oriented toward management and organizational studies.
The second cluster, in green (also 23 items), is headed by Zhang, the World Health Organization, Liu, and Wang (with 33, 24, 23, and 22 citations, respectively). This cluster reflects an interdisciplinary focus, integrating computational, health-related, and social network research.
The blue cluster (29 items) is led by Nambisan (42 citations), Majchrzak (24), and Boudreau (22), and centers on digital and strategic innovation and entrepreneurship.
The yellow cluster (16 items) focuses on crowdfunding from a financial perspective and is led by Mollick (37 citations), Belleflamme (28), and Burtch (22).
Finally, the violet cluster (14 items) is dominated by Goodchild, the second-most cited author (88 citations), along with Haklay (65) and Elwood (36). This group specializes in volunteered geographic information, citizen science, and environmental data.
Figure 6 illustrates the bibliographic coupling of authors, highlighting five main clusters with a total of eighteen interconnected authors. The ranking is led by Tucker (361 link strength, five documents), followed by Tang (327 l.s., four doc.), Clifford (287 l.s., four doc.), and Joshi and Taylor (both with 259 l.s., 3 doc.).
The largest cluster, in red (seven authors), includes Joshi, Taylor, Steinke, Van Etten, and the five lowest-ranked authors. This group focuses on management information systems and citizen science.
The green cluster (four authors) includes Clifford, Behar, Zhu, and Sameni (ranked seventh to ninth), with a shared emphasis on physiological and biomedical computing and medical applications.
The light blue cluster (11 authors) encompasses the top-ranked authors—Tucker, Tang, and Wu (sixth)—and is oriented toward health services and medical research.
The yellow cluster (two authors), Liu and Zhang (ranked 10th), is centered on computer science topics such as the Internet of Things and intelligent big data management.
The violet cluster (two authors), López-Pérez and Molina (ranked 14th), is linked to computational methods and software development in biomedical contexts.

4. Discussion and Conclusions

This study has examined the scholarly literature on crowdsourcing and digital information using bibliometric analysis. Through this methodology, this research traced the evolution of the field, offering a structured overview of its intellectual development. The findings enabled the identification, organization, and visualization of key knowledge domains; the assessment of the field’s current state and historical trajectory; the exploration of prevailing theoretical perspectives; and the recognition of practical applications and emerging trends. Furthermore, this study provides insights to enhance implementation strategies and mitigate potential challenges associated with the use of crowdsourcing in digital information management
This work offers several key findings on the evolving field of crowdsourcing and its intersection with digital information. Crowdsourcing is identified as a multifaceted and interdisciplinary phenomenon, characterized by ongoing debates, evolving discourses, and critical perspectives since its inception [12,33]. In this work, crowdsourcing is considered as “the action of taking a specific task or job, whether or not previously performed by an employee of an organization or by a designated agent (such as a contractor, an external worker or a supplier), and subcontracting it, through an “open call” to a large group of people (inside or outside the organization), a community or the general public via the Internet, for compensation that does not have to be financial” [1,24,33] (p. 2). Our analysis underscores the complexity of the crowdsourcing process and elaborates on its key characteristics and interdisciplinary nature by focusing on its intersection with digital information. Furthermore, the analysis highlights crowdsourcing’s strategic value in enhancing organizational competitiveness and efficiency.
Crowdsourcing has emerged as a pivotal tool for organizations and a dynamic area of scholarly inquiry. As a form of digital innovation, it facilitates the decentralization of knowledge production and task execution, enabling organizations to leverage dispersed expertise, creativity, and labor across global networks [47]. Its transformative impact is particularly salient in the reconfiguration of e-commerce models and organizational processes, representing a paradigm shift in how tasks are conceptualized and operationalized in the digital age [48].
Moreover, and specifically, the study underscores the centrality of information as the most strategic resource for contemporary organizations. It argues that the effective generation, dissemination, and management of digital information—facilitated by crowdsourcing platforms—is critical for organizational adaptability and competitiveness in increasingly complex technological environments [49,50].
Although academic research on crowdsourcing remains relatively nascent, especially at the intersection with digital information, the convergence of globalization, digitalization, and rapid technological evolution has created a fertile context for its development and institutionalization [51]. The bibliometric analysis conducted in this study reveals a marked increase in scholarly output and influence, as evidenced by publication volume, citation rates, and the emergence of leading contributors to the field. These trends reinforce both the theoretical relevance and practical viability of crowdsourcing as a cornerstone of digital strategy.
In addition to mapping the evolution of the field on CDI, in a pioneering way, this study offers critical insights for future research trajectories and provides actionable guidance for practitioners seeking to harness crowdsourcing for innovation, efficiency, and strategic growth in the digital economy.

4.1. Theorical Implications and Research Agenda

The rapid emergence and exponential growth of the literature on crowdsourcing and digital information underscore its significant implications and substantial potential for future research. However, advancing this research requires a comprehensive understanding of the current state of the literature, its primary domains, key themes, and emerging opportunities.
Our findings suggest that CDI research is undergoing a notable transformation.
There is a shift from broad conceptual and theoretical discussions—often centered on defining crowdsourcing—toward more focused analyses of its concrete applications, organizational implementation, and management practices.
The prominence of conceptual research is evident, as the first and third most co-cited papers—Howe [1] and Estellés-Arolas & González-Ladrón-De-Guevara [33]—are foundational works focused on defining crowdsourcing. Similarly, the third-largest co-citation cluster centers on theoretical analyses.
Empirical research is primarily focused on case studies and investigations related to emerging technological developments, which increasingly serve as methodological tools (see Table 4). Methodological contributions also play a significant role; notably, the second-largest co-citation cluster encompasses both managerial and methodological perspectives.
Despite these advances, there remains a notable gap in the application of classical quantitative methodologies, such as structural equation modeling and other statistical techniques, which continue to be secondary (see Table 4 and Figure 2). This gap reflects both the scarcity and limitations of available data, which constrain more rigorous, data-driven investigations of CDI processes. Moreover, the findings highlight the need to develop novel indicators to effectively assess the benefits, challenges, and organizational impacts of crowdsourcing
The results identify the main fields engaged with CDI, trace their evolution, and reveal connections across areas of study and application. This highlights emerging analytical perspectives that may open new avenues for research. While traditional disciplines such as Computer Science, Engineering, and Information Science remain central, fields like Communication and Business–Economics are increasingly relevant and offer significant potential for future exploration. Let us concentrate on the main areas observed.

4.1.1. Computer Science

The dominant role of Computer Science in the CDI field is evident, as two of the top three most-cited papers focus on cloud-based design [29] and computer-aided design [30]. Additionally, the fifth-largest cluster in the reference co-citation analysis reflects a strong digital and computing orientation. Sensors emerges as the most prolific and influential journal—sharing the highest h-index with Technological Forecasting and Social Change—while Lecture Notes in Computer Science leads the largest journal co-citation cluster, a network closely tied to computer science and engineering disciplines.
The second-largest author co-citation cluster shows an interdisciplinary orientation, combining computational approaches with social network analysis and other domains. The bibliographic coupling of authors reveals two key clusters within this area: one focused on computer science topics like the Internet of Things and intelligent big data management and another on computational methods and software development within biomedical contexts.
An analysis of the evolution of author keyword co-occurrence networks indicates a shift from broad, traditional technological terms (e.g., big data, IoT, cloud computing, data collection) toward more specialized constructs aligned with advances in ubiquitous computing and artificial intelligence (e.g., machine learning, deep learning, photogrammetry). Meanwhile, terms such as digital humanities and digital humanitarianism have declined in prominence, suggesting the growing influence of “non-human” contributors [52] and a shift toward technology-driven innovation in organizational management.
Future research should further explore the expanding roles of image and multimedia analysis [53,54], sensor technologies, and mobile applications in the CDI landscape.

4.1.2. Engineering and Information Science and Library Science

Engineering, the second most prominent field in CDI research, is closely intertwined with Computer Science. This overlap is evident in shared publication venues and citation patterns—Sensors and PLOS One, for example, are among the most prolific and highly cited sources in both areas. The largest cluster in the journal co-citation analysis reflects this integration, encompassing journals from computer science, engineering, and related disciplines, including PLOS One and Science.
Information Science and Library Science also hold a significant position in CDI research. The Journal of the Association for Information Science and Technology stands out for having a high concentration of publications in the field. However, the evolution of the author keyword co-occurrence network reveals a decline in this area’s influence, reflected in the decreasing prominence of constructs such as digital libraries and research libraries.
Further research could focus on the individual evolution of these disciplines.

4.1.3. Management

The managerial perspective—particularly within the field of Information Systems—emerges as a key dimension in CDI research, despite being formally ranked in the fourth most prominent area. A deeper analysis reveals strong connections: the second most-cited paper [29] addresses manufacturing, while the third [30], published in MIS Quarterly, focuses on digital platforms. The second-largest reference co-citation cluster adopts a managerial orientation, as do the second and third journal co-citation clusters, led by MIS Quarterly and Academy of Management Journal, respectively. Similarly, the largest author co-citation cluster includes scholars with a managerial focus.
Several important trends emerge from the analysis. First, a theoretical shift is observed—from a client-centered co-creation model to broader stakeholder engagement through crowdwork, as reflected in the co-occurrence of author keywords. Second, growing interest in specific managerial applications is evident, particularly in areas such as blockchain, digital twins, and task analysis (second largest cluster). Third, there is a transition from general innovation-related terms (e.g., open innovation, social media) (or the focus on the social media or smartphones) toward concepts linked to digital entrepreneurship and the implementation of crowdsourcing in organizational processes (keywords such as digital entrepreneurship, entrepreneurship, and task analysis, or the third author co-citation cluster).
A notable development is the rise in the financial dimension of crowdsourcing, especially crowdfunding. This is evident in the fourth-largest reference co-citation cluster, centered on crowdfunding, as well as in the keyword co-occurrence analysis, where crowdfunding ranks as the second most important term for the 2023–2025 period, leading the largest keyword cluster alongside digital platforms. The author co-citation network also identifies a distinct cluster with a financial focus.
Future research should further investigate the application of CDI across various value chain activities [16] and explore incentive mechanisms beyond extrinsic motivations. Additionally, emphasis should be placed on ‘fair play’ principles and the effective management of human capital.

4.1.4. Psychology, Law, Ethics Marketing, Economics

Additional emerging trends can be identified in the CDI field. One notable development is the growing emphasis on human capital, both in the development and organizational implementation of CDI. This is reflected in the rising relevance of keywords related to human resource management, digital transformation, and the broader management of the ubiquitous web. Related disciplines such as Psychology, Law, Ethics, Marketing and Economics are also increasingly represented.
Psychological and behavioral sciences show particular promise. The most cited paper in the field [28] originates from this area, and journals like Computers in Human Behavior play a prominent role. Although the evolution of author keyword co-occurrence does not indicate a significant rise in contributions from psychology overall, notable activity is observed in studies using Amazon Mechanical Turk (MTurk), which appears frequently as a keyword.
Future research could explore underdeveloped yet promising directions, including cognitive psychology (e.g., learning processes such as reinforcement learning) and reputation analysis—both of which are emerging as potential areas of growth for CDI.
The growing importance of the individual perspective in CDI is reflected in the increased prominence of terms related to Law and Ethics (e.g., privacy, fairness, affordances), and Marketing (e.g., authentication, misinformation) (observe the keywords co-occurrence analysis)—area represented in the journal co-citation analysis. Ethical concerns, particularly around privacy and fairness, remain essential for ensuring crowd participation and trust [11,55]. Marketing also presents significant potential, as well as integrating interdisciplinary approaches to crowdsourcing management.
In addition, perspectives from Economics (e.g., sharing economy) are gaining relevance. Keyword co-occurrence analysis highlights its emerging role, indicating its growing contribution to CDI research. Further research should build upon these trends, with a particular focus on the co-creation process and the circular economy facilitated by CDI.

4.1.5. Geography and Citizen Science

Geography and Citizen Science play a prominent role in CDI research. Goodchild [32], a foundational work on volunteered geographic information (VGI), is the second most co-cited paper, and this field represents the largest reference co-citation clusters. The ISPRS International Journal of Geo-Information stands out as the leading publication in terms of volume and h-index. Geographic sources also appear prominently in the largest journal co-citation cluster.
The author co-citation network reveals a distinct cluster centered on VGI, citizen science, and environmental data, while the main cluster in the bibliographic coupling of authors reflects a blend of management information systems and citizen science. Keyword co-occurrence analysis shows that until 2023, citizen science and VGI ranked among the top four keywords in the CDI literature. However, recent trends indicate a decline in their prominence, with geographic and emergency-related constructs (e.g., emergency management) disappearing from top rankings (although some new resources such as digital twins are extensively used nowadays in related areas).
Conversely, cultural heritage is gaining importance. The ACM Journal on Computing and Cultural Heritage has the highest proportion of its output dedicated to CDI, and cultural heritage leads the fifth-largest keyword co-occurrence cluster for the 2023–2025 period. It now ranks seventh overall, alongside the emergence of digital heritage among the top keywords.
Future research could explore related geographical domains such as transport and tourism or digital twins of the cities [53], which show potential as emerging areas within the CDI field.

4.1.6. Medicine

Medical perspectives are increasingly significant in CDI research. The fourth most-cited paper—Tangcharoensathien et al. [31], ranked second in citations per year—focuses on crowdsourcing strategies for managing the COVID-19 infodemic. JMIR Public Health and Surveillance has the second-highest share of its total output dedicated to CDI, while it and the Journal of Medical Internet Research rank third and first in average citations per article.
Medicine also features prominently in the largest journal co-citation cluster. The keyword digital health rose to 11th place in the 2023–2025 co-occurrence ranking, indicating increased relevance. The second-largest author co-citation cluster includes scholars integrating computational and health-related research. Similarly, in the bibliographic coupling of authors, the second cluster focuses on biomedical computing and medical applications, the third on health services and medical research, and the fifth on computing in medical contexts.
Future research should explore the integrated use of crowdsourcing, computing, and medicine to advance physiological computing and enhance health services and applications

4.1.7. Others

CDI research spans a wide range of disciplines and should be examined through diverse methodological and disciplinary lenses. Environmental sciences stand out, with constructs like built environment gaining prominence in keyword co-occurrence analysis and environmental journals appearing in the main journal co-citation cluster. Qualitative evidence also highlights CDI’s relevance in communication, biology, telecommunications, mathematics, and arts humanities, as well as in emerging or underexplored fields such as education, physics, linguistics, or chemistry—some of them supported by publication and co-citation patterns in multidisciplinary journals like PLOS One, IEEE journals, and Science.
Overall, the dispersion of CDI-related work across scientific domains, coupled with the expansion of the literature in computing, management, and geography, and the growing diversity of relevant keywords, points to strong potential for future development. Additionally, the presence of “trendy” topics may shape the field’s trajectory, as citation-driven publication incentives influence both authors and journals.

4.2. Practical Implications

Professionals across sectors face the challenge of integrating new policies and processes within their organizations. The diverse disciplinary perspectives on CDI contribute to a more comprehensive understanding, fostering innovation. Crowdsourcing is applicable across industries and domains [11], offering opportunities both for developing new business models [12,56] or information models [53] and for enhancing organizational management.
CDI encompasses various crowdsourcing schemes—such as crowdvoting, creative crowdsourcing, microwork, crowd workforce management [16], and crowdfunding [44,57]—and can be employed throughout multiple stages of the value chain [2,3,6,16,58], including in tasks traditionally overlooked by firms [52], or for marketing purposes.
However, the success of CDI initiatives depends on effective task analysis and design, contributor selection, and attention to considerations such as privacy. Engagement and workers’ motivation also depends on the evolution of online communities [59] and social media. Poorly structured processes or inadequate crowd management (crowdwork) can significantly limit the potential impact of crowdsourcing efforts.
The implementation of CDI especially raises critical ethical considerations, particularly regarding fair remuneration [7], fairness, affordance, misinformation, and the risks associated with the “dark side” of crowdsourcing [8]. Managers must ensure the sustainability of initiatives [21], promote social value, and integrate ethical practices across social, environmental, cultural, and economic dimensions [60] or consider heritage and the digital health.
Emerging CDI frameworks should also account for non-human participation [52], enabled by Web 3.0 technologies and the pervasive web [2], or for the development of the Industry 4.0 [58]. Practitioners need to assess the evolving role of digital platforms [57], new planning and management tools, open innovation solutions [59,61], and the digital transformation around artificial intelligence [54,56], blockchain [10,62], digital twin [53,54,56,62] and especially machine learning and deep learning [54].
Future CDI should be conceived as a dynamic, multilevel process shaped by advances in computer science, management, psychology, geography, and medicine. This evolution must continuously redefine its challenges and applications to address complex economic, social, environmental, technological, and ethical issues—particularly those linked to participant motivation and fair engagement [55,59].

4.3. Limitations and Future Research

The scope of this study highlights certain limitations that offer opportunities for further research. The emerging relevance and rapid expansion of CDI, alongside limited scholarly output in key scientific domains, underscore important research gaps. Future studies could build on the trends identified here, adopting more specialized disciplinary lenses—such as Computer Science, Management, or Geography.
In particular, inside the management domain, greater attention is needed on areas such as marketing, finance, social media management, intelligent technologies, knowledge management, intellectual capital, open innovation, entrepreneurship, ethics, and corporate social responsibility.
Additionally, the literature shows signs of increasing maturity, with a shift toward empirical research. However, the development of robust measurement indicators remains an important challenge for advancing the field.
The bibliometric approach employed in this study presents inherent limitations if not complemented by qualitative analysis, which we have sought to address through detailed data interpretation. The exclusive use of the WoS Core Collection, focusing only on articles, reviews, letters, and notes, also constrains the scope. Future research could expand on this by incorporating additional document types, alternative databases (e.g., Scopus, Google Scholar), gray literature (e.g., theses), or non-English sources. Moreover, the use of diverse analytical tools and software could enrich the findings. Further refinement through an in-depth examination of specific clusters or thematic areas is also recommended.

Author Contributions

Conceptualization, F.J.G.-S. and Y.N.-K.; Methodology, F.J.G.-S. and Y.N.-K.; Software, F.J.G.-S.; Validation, F.J.G.-S.; Formal analysis, F.J.G.-S. and Y.N.-K.; Investigation, F.J.G.-S. and Y.N.-K.; Resources, F.J.G.-S.; Data curation, F.J.G.-S. and Y.N.-K.; Writing—original draft, F.J.G.-S.; Writing—review & editing, Y.N.-K.; Project administration, Y.N.-K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Acknowledgments

The authors would like to thank Universitat Politècnica de València, and Universitat de València for supporting this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Howe, J. The rise of crowdsourcing. Wired Mag. 2006, 14, 176–183. [Google Scholar]
  2. Garrigos-Simon, F.J.; Lapiedra Alcami, R.; Barbera Ribera, T. Social Networks and Web 3.0: Their impact on the management and marketing of organizations. Manag. Decis. 2012, 50, 1880–1890. [Google Scholar] [CrossRef]
  3. Garrigos-Simon, F.J.; Narangajavana, Y. From Crowdsourcing to the Use of Masscapital. The Common Perspective of the Success of Apple, Facebook, Google, Lego, TripAdvisor, and Zara. In Advances in Crowdsourcing; Garrigos, F., Gil, I., Estelles, S., Eds.; Springer: Cham, Switzerland, 2015; pp. 1–13. [Google Scholar]
  4. Garrigos-Simon, F.J.; Narangajavana-Kaosiri, Y. Crowdsourcing: A Systematic Literature Review and Future Research Agenda. Eur. J. Stud. Manag. Bus. 2024, 30, 1–26. [Google Scholar] [CrossRef]
  5. Boudreau, K.J.; Lakhani, K.R. Using the crowd as an innovation partner. Harv. Bus. Rev. 2013, 91, 60–69. [Google Scholar]
  6. Garrigos-Simon, F.J.; Narangajavana, Y.; Galdón-Salvador, J.L. Crowdsourcing as a competitive advantage for new business models. In Strategies in E-Business; Gil, I., Palacios, D., Peris, M., Vendrell, E., Ramirez, C., Eds.; Springer: Boston, MA, USA, 2014; pp. 29–37. [Google Scholar]
  7. Bañón-Gomis, A.J.; Martínez-Cañas, R.; Ruiz-Palomino, P. Humanizing Internal Crowdsourcing Best Practices. In Advances in Crowdsourcing; Garrigos, F., Gil, I., Estelles, S., Eds.; Springer: Cham, Switzerland, 2015; pp. 105–117. [Google Scholar]
  8. Wilson, M.; Robson, K.; Botha, E. Crowdsourcing in a time of empowered stakeholders: Lessons from crowdsourcing campaigns. Bus. Horiz. 2017, 60, 247–253. [Google Scholar] [CrossRef]
  9. Nikooghadam, M.; Shahriari, H.R. Verifiable and secure data sharing in crowdsourcing-based healthcare. J. Supercomput. 2025, 81, 685. [Google Scholar] [CrossRef]
  10. Song, M.; Gong, X.; Jiao, R.J.; Moore, R. A blockchain-enabled information as a service (IaaS) system for crowdsourced manufacturing: A crowdsourcing case study of tank trailer manufacturing. J. Ind. Inf. Integr. 2025, 45, 100844. [Google Scholar] [CrossRef]
  11. Blohm, I.; Zogaj, S.; Bretschneider, U.; Leimeister, J.M. How to manage crowdsourcing platforms effectively? Calif. Manag. Rev. 2018, 60, 122–149. [Google Scholar] [CrossRef]
  12. Brabham, D.C. Crowdsourcing as a model for problem solving: An introduction and cases. Convergence 2008, 14, 75–90. [Google Scholar] [CrossRef]
  13. Kleemann, F.; Voß, G.G.; Rieder, K. Un(der)paid innovators: The commercial utilization of consumer work through crowdsourcing. Sci. Technol. Innov. Stud. 2008, 4, 5–26. [Google Scholar]
  14. Wang, L.; Xia, E.; Li, H.; Wang, W. A Bibliometric analysis of crowdsourcing in the field of public health. Int. J. Environ. Res. Public Health 2019, 16, 3825. [Google Scholar] [CrossRef]
  15. Lebraty, J.F.; Lobre-Lebraty, K. Crowdsourcing: One Step Beyond; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
  16. Garrigos-Simon, F.J.; Galdon, J.L.; Sanz-Blas, S. Effects of crowdvoting on hotels: The Booking.com case. Int. J. Contemp. Hosp. Manag. 2017, 29, 419–437. [Google Scholar] [CrossRef]
  17. Liao, H.; Tang, M.; Luo, L.; Li, C.; Chiclana, F.; Zeng, X.J. A bibliometric analysis and visualization of medical big data research. Sustainability 2018, 10, 166. [Google Scholar] [CrossRef]
  18. Osareh, F. Bibliometrics, Citation Analysis and co-citation analysis: A review of literature I. Libri 1996, 46, 221–225. [Google Scholar] [CrossRef]
  19. López-Meneses, E.; Vázquez-Cano, E.; Román, P. Analysis and Implications of the Impact of MOOC Movement in the Scientific Community: JCR and Scopus (2010–13). Comunicar 2015, 22, 73–80. [Google Scholar] [CrossRef]
  20. Merigó, J.M.; Yang, J.B. Accounting research: A bibliometric analysis. Aust. Account. Rev. 2017, 27, 71–100. [Google Scholar] [CrossRef]
  21. Garrigos-Simon, F.J.; Narangajavana-Kaosiri, Y.; Lengua-Lengua, I. Tourism and Sustainability: A Bibliometric and Visualization Analysis. Sustainability 2018, 10, 1976. [Google Scholar] [CrossRef]
  22. Hirsch, J.E. An index to quantify an individual’s scientific research output. Proc. Natl. Acad. Sci. USA 2005, 102, 16569–16572. [Google Scholar] [CrossRef]
  23. Blanco-Mesa, F.; Merigó, J.M.; Gil-Lafuente, A.M. Fuzzy decision making: A bibliometric-based review. J. Intell. Fuzzy Syst. 2017, 32, 2033–2050. [Google Scholar] [CrossRef]
  24. Van Eck, N.J.; Waltman, L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics 2010, 84, 523–538. [Google Scholar] [CrossRef]
  25. Small, H. Co-citation in the scientific literature: A new measure of the relationship between two documents. J. Am. Soc. Inf. Sci. 1973, 24, 265–269. [Google Scholar] [CrossRef]
  26. Kessler, M.M. Bibliographic coupling between scientific papers. J. Assoc. Inf. Sci. Technol. 1963, 14, 10–25. [Google Scholar] [CrossRef]
  27. Trainor, C. Open source, crowd source: Harnessing the power of the people behind our libraries. Program 2009, 43, 288–298. [Google Scholar] [CrossRef]
  28. Pennycook, G.; Rand, D.G. The psychology of fake news. Trends Cogn. Sci. 2021, 25, 388–402. [Google Scholar] [CrossRef]
  29. Wu, D.; Rosen, D.W.; Wang, L.; Schaefer, D. Cloud-Based design and manufacturing: A new paradigm in digital manufacturing and design innovation. Comput.-Aided Des. 2015, 59, 1–14. [Google Scholar] [CrossRef]
  30. Steelman, Z.R.; Hammer, B.I.; Limayem, M. Data collection in the digital age. MIS Q. 2014, 38, 355–378. [Google Scholar] [CrossRef]
  31. Tangcharoensathien, V.; Calleja, N.; Nguyen, T.; Purnat, T.; D’Agostino, M.; Garcia-Saiso, S.; Landry, M.; Rashidian, A.; Hamilton, C.; AbdAllah, A.; et al. Framework for managing the COVID-19 Infodemic: Methods and results of an online, crowdsourced WHO technical consultation. J. Med. Internet Res. 2020, 22, e19659. [Google Scholar] [CrossRef]
  32. Goodchild, M.F. Citizens as Sensors: The world of volunteered geography. GeoJournal 2007, 69, 211–221. [Google Scholar] [CrossRef]
  33. Estellés-Arolas, E.; González-Ladrón-De-Guevara, F. Towards an integrated crowdsourcing definition. J. Inf. Sci. 2012, 38, 189–200. [Google Scholar] [CrossRef]
  34. Zook, M.; Graham, M.; Shelton, T.; Gorman, S. Volunteered geographic information and crowdsourcing disaster relief: A case study of the Haitian earthquake. World Med. Health Policy 2010, 2, 7–33. [Google Scholar] [CrossRef]
  35. Bonney, R.; Cooper, C.B.; Dickinson, J.; Kelling, S.; Phillips, T.; Rosenberg, K.V.; Shirk, J. Citizen science: A developing tool for expanding science knowledge and scientific literacy. BioScience 2009, 59, 977–984. [Google Scholar] [CrossRef]
  36. Goodchild, M.F.; Li, L. Assuring the quality of volunteered geographic information. Spat. Stat. 2012, 1, 110–120. [Google Scholar] [CrossRef]
  37. Haklay, M. How good is volunteered geographical information? A comparative study of OpenStreetMap and Ordnance Survey datasets. Environ. Plan. B Plan. Des. 2010, 37, 682–703. [Google Scholar] [CrossRef]
  38. Afuah, A.; Tucci, C.L. Crowdsourcing as a solution to distant search. Acad. Manag. Rev. 2012, 37, 355–375. [Google Scholar] [CrossRef]
  39. Fornell, C.; Larcker, D.F. Evaluating structural equation models with unobservable variables and measurement error. J. Mark. Res. 1981, 18, 39–50. [Google Scholar] [CrossRef]
  40. Brabham, D.C. Moving the crowd at threadless: Motivations for participation in a crowdsourcing application. Inf. Commun. Soc. 2010, 13, 1122–1145. [Google Scholar] [CrossRef]
  41. Brabham, D.C. Crowdsourcing; MIT Press: Cambridge, MA, USA, 2013. [Google Scholar]
  42. Howe, J. Crowdsourcing: Why the Power of the Crowd Is Driving the Future of Business; Crown Business: New York, NY, USA, 2009. [Google Scholar]
  43. Braun, V.; Clarke, V. One Size Fits All? What counts as quality practice in (reflexive) thematic analysis? Qual. Res. Psychol. 2021, 18, 328–352. [Google Scholar] [CrossRef]
  44. Mollick, E. The Dynamics of crowdfunding: An exploratory study. J. Bus. Ventur. 2014, 29, 1–16. [Google Scholar] [CrossRef]
  45. Belleflamme, P.; Lambert, T.; Schwienbacher, A. crowdfunding: Tapping the right crowd. J. Bus. Ventur. 2014, 29, 585–609. [Google Scholar] [CrossRef]
  46. Gao, H.; Barbier, G.; Goolsby, R. Harnessing the crowdsourcing power of social media for disaster relief. IEEE Intell. Syst. 2011, 26, 10–14. [Google Scholar] [CrossRef]
  47. Majchrzak, A.; Malhotra, A. Towards an information systems perspective and research agenda on crowdsourcing for innovation. J. Strateg. Inf. Syst. 2013, 22, 257–268. [Google Scholar] [CrossRef]
  48. Zhao, Y.; Zhu, Q. Evaluation on crowdsourcing research: Current status and future direction. Inf. Syst. Front. 2014, 16, 417–434. [Google Scholar] [CrossRef]
  49. Nonaka, I.; Takeuchi, H. The Knowledge-Creating Company: How Japanese Companies Create the Dynamics of Innovation; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
  50. Davenport, T.H.; Prusak, L. Working Knowledge: How Organizations Manage What They Know; Harvard Business Press: Boston, MA, USA, 1998. [Google Scholar]
  51. Ghezzi, A.; Gabelloni, D.; Martini, A.; Natalicchio, A. Crowdsourcing: A review and suggestions for future research. Int. J. Manag. Rev. 2018, 20, 343–363. [Google Scholar] [CrossRef]
  52. Kietzmann, J.H. Crowdsourcing: A revised definition and an introduction to new research. Bus. Horiz. 2017, 60, 151–153. [Google Scholar] [CrossRef]
  53. Gkeli, M.; Potsiou, C. 3D Crowdsourced parametric cadastral mapping: Pathways integrating BIM/IFC, crowdsourced data and LADM. Land Use Policy 2023, 131, 106713. [Google Scholar] [CrossRef]
  54. Liu, P.; Zhao, T.; Luo, J.; Lei, B.; Frei, M.; Miller, C.; Biljecki, F. Towards human-centric digital twins: Leveraging computer vision and graph models to predict outdoor comfort. Sustain. Cities Soc. 2023, 93, 104480. [Google Scholar] [CrossRef]
  55. Fieseler, C.; Bucher, E.; Hoffmann, C.P. Unfairness by Design? the perceived fairness of digital labor on crowdworking platforms. J. Bus. Ethics 2019, 156, 987–1005. [Google Scholar] [CrossRef]
  56. Zhang, L.; Wang, X.; Lin, H.; Piran, M.J. A Crowdsourcing logistics solution based on digital twin and four-party evolutionary game. Eng. Appl. Artif. Intell. 2024, 130, 107797. [Google Scholar] [CrossRef]
  57. Hajiheydari, N.; Delgosha, M.S. Citizens’ support in social mission platforms: Unravelling configurations for participating in civic crowdfunding platforms. Technol. Forecast. Soc. Change 2023, 189, 122366. [Google Scholar] [CrossRef]
  58. Gong, X.; Song, M.; Wang, S.; Jiao, R.J.; Helo, P. Crowdsourced Manufacturing in Industry 4.0: Implications and Prospects. Systems 2025, 13, 183. [Google Scholar] [CrossRef]
  59. Taylor, J.; Joshi, K.D. Joining the crowd: The career anchors of information technology workers participating in crowdsourcing. Inf. Syst. J. 2019, 29, 641–673. [Google Scholar] [CrossRef]
  60. Narangajavana, Y.; Gonzalez-Cruz, T.; Garrigos-Simon, F.J.; Cruz-Ros, S. Measuring social entrepreneurship and social value with leakage. definition, analysis and policies for the hospitality industry. Int. Entrep. Manag. J. 2016, 12, 911–934. [Google Scholar] [CrossRef]
  61. Certoma, C.; Corsini, F.; Rizzi, F. Crowdsourcing urban sustainability. data, people and technologies in participatory governance. Futures 2015, 74, 93–106. [Google Scholar] [CrossRef]
  62. Elmay, F.; Kadadha, M.; Mizouni, R.; Singh, S.; Otrok, H.; Mourad, A. Digital twins and dynamic NFTs for blockchain-based crowdsourced last-mile delivery. Inf. Process. Manag. 2024, 61, 103756. [Google Scholar] [CrossRef]
Figure 1. The annual Web of Science (WoS) publications in crowdsourcing and digital information. The dark blue column shows the number of publications per year in WoS crowdsourcing and digital information research. The orange line shows the number of citations per year. The WoS data.
Figure 1. The annual Web of Science (WoS) publications in crowdsourcing and digital information. The dark blue column shows the number of publications per year in WoS crowdsourcing and digital information research. The orange line shows the number of citations per year. The WoS data.
Information 16 00536 g001
Figure 2. The co-citation of cited references on CDI (till 2025), showing 103 references of the 33,530 cited references that meet the threshold of the minimum number of citations of a cited reference of 6.
Figure 2. The co-citation of cited references on CDI (till 2025), showing 103 references of the 33,530 cited references that meet the threshold of the minimum number of citations of a cited reference of 6.
Information 16 00536 g002
Figure 3. Journal co-citation network on CDI, including 85 main journals of the 16,279 cited sources by the documents regarding CDI that meet the threshold of the minimum number of citations of a cited source of 40.
Figure 3. Journal co-citation network on CDI, including 85 main journals of the 16,279 cited sources by the documents regarding CDI that meet the threshold of the minimum number of citations of a cited source of 40.
Information 16 00536 g003
Figure 4. (a) A co-occurrence network of author keywords of CDI-related publications The figure considers a threshold of three occurrences, which shows 73 keywords before 2023 with the most frequent co-occurrences, of 1636 keywords. (b) A co-occurrence network of author keywords of CDI-related publications (2023–2025). The figure considers a threshold of three occurrences, which shows 38 keywords with the most frequent co-occurrences, of 1103 keywords.
Figure 4. (a) A co-occurrence network of author keywords of CDI-related publications The figure considers a threshold of three occurrences, which shows 73 keywords before 2023 with the most frequent co-occurrences, of 1636 keywords. (b) A co-occurrence network of author keywords of CDI-related publications (2023–2025). The figure considers a threshold of three occurrences, which shows 38 keywords with the most frequent co-occurrences, of 1103 keywords.
Information 16 00536 g004
Figure 5. Author co-citation network on CDI, including 99 authors of the 2586 cited authors, which meet the threshold of the minimum number of citations of a cited author of 12.
Figure 5. Author co-citation network on CDI, including 99 authors of the 2586 cited authors, which meet the threshold of the minimum number of citations of a cited author of 12.
Information 16 00536 g005
Figure 6. Bibliographic coupling of authors on CDI, including 20 authors, of the 2657 authors, that meet the threshold of the minimum number of documents of an author of 3.
Figure 6. Bibliographic coupling of authors on CDI, including 20 authors, of the 2657 authors, that meet the threshold of the minimum number of documents of an author of 3.
Information 16 00536 g006
Table 1. General structure of citations in crowdsourcing and digital information.
Table 1. General structure of citations in crowdsourcing and digital information.
Number of CitationsNumber of ArticlesCumulative n. of Articles % Articles% Accumulated Articles
≥200881.181.18
≥1505130.741.91
≥10010231.473.39
≥5037605.458.84
≥25631239.2818.11
≥1015427722.6840.80
<1040267959.20100.00
Total679
Source: Own elaboration based on WoS 2025.
Table 2. The top 20 papers with the most citations in CDI.
Table 2. The top 20 papers with the most citations in CDI.
Papers with the Most Citations in QT
RJournalTCArticleAuthorsYearCY
1TCS550“The Psychology of Fake News”Pennycook, G and Rand, DG202113.50
2CAD456“Cloud-based design and manufacturing: A new paradigm in digital manufacturing and design innovation”Wu, DZ; Rosen, DW; (...); Schaefer, D201545.60
3MQ307“Computer-Aided Design”Steelman, ZR; Hammer, BI and Limayem, M201427.91
4JMIR296“Framework for Managing the COVID-19 Infodemic: Methods and Results of an Online, Crowdsourced WHO Technical Consultation”Tangcharoensathien, V; Calleja, N; (...); Briand, S202059.20
5IATN268“Incentive Mechanisms for Crowdsensing: Crowdsourcing With Smartphones”Yang, DJ; Xue, GL; (...); Tang, J201629.78
6EAPD230“Automated identification and characterization of parcels with OpenStreetMap and points of interest”Liu, XJ and Long, Y201625.56
7ITPAM213“Demographic Estimation from Face Images: Human vs. Machine Performance”Han, H; Otto, C; (...); Jain, AK201521.30
8WRR204“The NorWeST Summer Stream Temperature Model and Scenarios for the Western US: A Crowd-Sourced Database and New Geospatial Tools Foster a User Community and Predict Broad Climate Warming of Rivers and Streams”Isaak, DJ; Wenger, SJ; (...); Parkes-Payne, S201725.50
9PNAS196“Next-generation Digital Earth”Goodchild, MF; Guo, HD; (...); Woodgate, P201215.08
10ITAC191“Building Naturalistic Emotionally Balanced Speech Corpus by Retrieving Emotional Speech from Existing Podcast Recordings”Lotfian, R and Busso, C201931.83
11NTWE186“Amazon Mechanical Turk and the commodification of labour”Bergvall-Kåreborn, B and Howcroft, D201416.91
12ARPH176“Social Media- and Internet-Based Disease Surveillance for Public Health”Aiello, AE; Renson, A and Zivich, PN202035.20
13B161“Structured crowdsourcing enables convolutional segmentation of histology images”Amgad, M; Elfandy, H; (...); Cooper, LAD201926.83
14RS145“Sentinel-1 InSAR Coherence to Detect Floodwater in Urban Areas: Houston and Hurricane Harvey as A Test Case”Chini, M; Pelich, R; (...); Matgen, P201924.17
15JMIR132“Towards an Artificially Empathic Conversational Agent for Mental Health Applications: System Design and User Perceptions”Morris, RR; Kouddous, K; (...); Schueller, SM201818.86
16SD131“The COUGHVID crowdsourcing dataset, a corpus for the study of large-scale cough analysis algorithms”Orlandic, L; Teijeiro, T and Atienza, D202132.75
17RDM128“Digital transformation, for better or worse: a critical multi-level research agenda”Dabrowska, J; Almpanopoulou, A; (...); Ritala, P202242.67
18AMCC128“Noninvasive Fetal ECG: the PhysioNet/Computing in Cardiology Challenge 2013”Silva, I; Behar, J; (...); Moody, GB201310.67
19AAAG125“Citizen Science in the Age of Neogeography: Utilizing Volunteered Geographic Information for Environmental Monitoring”Connors, JP; Lei, SF and Kelly, M20129.62
20JBR124“Exploring how consumer goods companies innovate in the digital age: The role of big data analytics companies”Mariani, MM and Wamba, SF202024.80
Source: Own elaboration based on WoS 2025. R: ranking; TC: total citations; CY: citations per year. TCS: Trends in Cognitive Sciences; CAD: Computer-Aided Design; MS: MIS Quarterly; JMIR: Journal of Medical internet Research; IATN: IEEE-ACM Transactions on Networking; EAPD: Environment and Planning B-Planning & Design; ITPAM: IEEE Transactions on Pattern Analysis and Machine Intelligence; WRR: Water Resources Research; PNAS: Proceedings of the National Academy of Sciences of the United States of America; ITAC: IEEE Transactions on Affective Computing; NTWE: New Technology Work and Employment; ARPH: Annual Review of Public Health; B: Bioinformatics; RS: Remote Sensing; SD: Scientific Data; RDM: R&D Management; AMCC: 40th Annual Meeting on Computing in Cardiology; AAAG: Annals of the Association of American Geographers; JBR: Journal of Business Research.
Table 3. The top 13 journals with CDI publications.
Table 3. The top 13 journals with CDI publications.
RJournalAPCH-CTAPTCCACCPCC%APCIF≥200≥100≥50≥20
1IIJG106529112412412.400.192.8 2
2S10667,79617317317.300.013.4 12
3IA8392,96136364.500.013.4
4JMIR85870852052065.000.095.81223
5TFSC86620812011915.000.1312.9 11
6SU7590,218858512.140.013.3 2
7AJCCH6247826264.331.262.1
8IITJ6411,18326264.330.058.2
9PO66304,80620720734.500.002.9 14
10SD64559717717729.500.115.8 112
11JPHS54144318118036.200.353.5 113
12JAIST541758616112.200.282.8 1
13RS5436,95525625351.200.014.2 123
Source: Own elaboration based on WoS 2025. R: ranking; H-C: the H index in the area of crowdsourcing and digital information; APC: articles published in CDI.; TAP: total articles published (2009–2025); TCC: total citations in CDI: ACC: articles cited in CDI; PCC: average of cites by articles in CDO; %APC: percentage of articles published in CDI (APC/TAP); IF: impact factor; ≥200, ≥100, ≥50 and ≥20: articles with more of 200, 100, 50 and 20 citations; IIJG: ISPRS International Journal of Geo-Information; S:Sensors; IA: IEEE Access; JMIR: Journal of Medical Internet Research; TFSC: Technological Forecasting and Social Change; SU: Sustainability; AJCCH: ACM Journal on Computing and Cultural Heritage; IITJ: IEEE Internet of Things Journal; PO: Plos One; SD: Scientific Data; JPHS: JMIR Public Health and Surveillance; JAIST: Journal of the Association for Information Science and Technology; RS: Remote Sensing.
Table 4. The top author keyword co-occurrence of CDI-related publications.
Table 4. The top author keyword co-occurrence of CDI-related publications.
RKeyword (2009–2022)OcCoKeyword (2023–2025)OcCo
1crowdsourcing187192crowdsourcing6356
2citizen science2435crowdfunding1711
3social media2244digital platforms1110
4big data1536machine learning109
5Vgi1522blockchain920
6digital humanities1420digital twin910
7innovation913cultural heritage88
8COVID-19713task analysis720
9internet76citizen science66
10internet of things79deep learning67
11machine learning76digital health53
12open innovation711online communities56
13emergency management617social media54
14openstreetmap69artificial intelligence45
15sustainability68crowdwork43
16cloud computing58digital transformation44
17collaboration58photogrammetry43
18digital humanitarianism59privacy412
19privacy55affordances33
20smartphone58authentication37
21surveillance59built environment31
22artificial intelligence413case study35
23co-creation46citizen participation30
24crisis informatics417COVID-1931
25critical cartography46digital entrepreneurship32
26crowdsensing44digital heritage32
27data collection46entrepreneurship35
28digital health48fairness35
29digital libraries410information sharing37
30gamification44misinformation35
31metadata46Mturk33
32online communities48reinforcement learning35
33participatory sensing47reliability37
34research libraries46sensors39
36Twitter47sharing economy31
smart manufacturing35
smartphone32
surveillance36
Source: Own elaboration based on WoS 2025. R: rank; Oc: author keyword occurrences; Co: author keyword co-occurrence link. In keyword columns—in yellow color: the keywords that enter the ranking (there are not in the first period but are in the second); in green: the ones that increase in the second period; in orange: the ones that decrease in frequency; in red: the ones that disappear from the ranking in the second period. In the column of occurrences (second period), the colors indicate the various clusters.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Garrigos-Simon, F.J.; Narangajavana-Kaosiri, Y. Crowdsourcing and Digital Information: Looking for a Future Research Agenda. Information 2025, 16, 536. https://doi.org/10.3390/info16070536

AMA Style

Garrigos-Simon FJ, Narangajavana-Kaosiri Y. Crowdsourcing and Digital Information: Looking for a Future Research Agenda. Information. 2025; 16(7):536. https://doi.org/10.3390/info16070536

Chicago/Turabian Style

Garrigos-Simon, Fernando J., and Yeamduan Narangajavana-Kaosiri. 2025. "Crowdsourcing and Digital Information: Looking for a Future Research Agenda" Information 16, no. 7: 536. https://doi.org/10.3390/info16070536

APA Style

Garrigos-Simon, F. J., & Narangajavana-Kaosiri, Y. (2025). Crowdsourcing and Digital Information: Looking for a Future Research Agenda. Information, 16(7), 536. https://doi.org/10.3390/info16070536

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop