Digital Technologies and Open Data Sources in Marine Biotoxins’ Risk Analysis: The Case of Ciguatera Fish Poisoning

Abstract Currently, digital technologies influence information dissemination in all business sectors, with great emphasis put on exploitation strategies. Public administrations often use information systems and establish open data repositories, primarily supporting their operation but also serving as data providers, facilitating decision-making. As such, risk analysis in the public health sector, including food safety authorities, often relies on digital technologies and open data sources. Global food safety challenges include marine biotoxins (MBs), being contaminants whose mitigation largely depends on risk analysis. Ciguatera Fish Poisoning (CFP), in particular, is a MB-related seafood intoxication attributed to the consumption of fish species that are prone to accumulate ciguatoxins. Historically, CFP occurred endemically in tropical/subtropical areas, but has gradually emerged in temperate regions, including European waters, necessitating official policy adoption to manage the potential risks. Researchers and policy-makers highlight scientific data inadequacy, under-reporting of outbreaks and information source fragmentation as major obstacles in developing CFP mitigation strategies. Although digital technologies and open data sources provide exploitable scientific information for MB risk analysis, their utilization in counteracting CFP-related hazards has not been addressed to date. This work thus attempts to answer the question, “What is the current extent of digital technologies’ and open data sources’ utilization within risk analysis tasks in the MBs field, particularly on CFP?”, by conducting a systematic literature review of the available scientific and grey literature. Results indicate that the use of digital technologies and open data sources in CFP is not negligible. However, certain gaps are identified regarding discrepancies in terminology, source fragmentation and a redundancy and downplay of social media utilization, in turn constituting a future research agenda for this under-researched topic.


Introduction
The rapid acceleration of digital technologies, evidenced more intensely during the past decade, globally permeates every private and public organization, transforming their daily working practices, at the same time reshaping social interactions and citizens' expectations [1,2]. Digital tools, among which the Internet, social media, mobile computing, big data, data analytics, and numerous others, open up a fascinating world of innovation opportunities with a significant impact on multiple aspects of contemporary societies [1][2][3]. This overwhelming penetration of information and communication technologies (ICTs) in everyday life is altering the information sharing preconditions and can technically support more collaborative cultures of information production and dissemination, thus shifting the focus from technology itself to strategies for its exploitation [4].
ing a roster of openly accessible information sources for CFP, could facilitate the efforts to tackle the weaknesses identified.
Doubtlessly, developments in digital technologies and open data sources amplified the volume of potentially exploitable scientific information for MBs risk analysis purposes. However, bibliographical references on the advancements achieved with the participation of such means in counteracting CFP-related hazards are scattered, whereas, to date, no substantial summary or research study has cumulatively investigated their utilization. The problem addressed in the present work thus relates to examining digital technologies' and open data sources' utilization in CFP research associated with risk analysis tasks. The research question answered in this review, therefore, is "What is the current extent of digital technologies' and open data sources' utilization within risk analysis tasks in the MBs field, particularly on CFP?" The absence of targeted review articles and scarcity of structured information on the topic necessitated an in-depth literature investigation, within both peer-reviewed publications and grey literature documents to accomplish this study. For the purposes of this review, grey literature is defined according to the Prague definition, "manifold document types produced on all levels of government, academics, business and industry in print and electronic formats that are protected by intellectual property rights, of sufficient quality to be collected and preserved by libraries and institutional repositories, but not controlled by commercial publishers; i.e., where publishing is not the primary activity of the producing body" [17] (p.11), typically including "conference abstracts, presentations, proceedings; regulatory data; unpublished trial data; government publications; reports (such as white papers, working papers, internal documentation); dissertations/theses; patents; and policies & procedures" [18] (para.2).
The structure of the review is as follows: the next section provides brief background knowledge on the concepts of digital technologies, open data sources and risk analysis, viewed from a public health and food safety perspective, to assist in determining the appropriate keywords for the literature investigation. The subsequent two sections describe the research methodology employed and present the bibliographical analysis results. Finally, the findings are discussed, and relevant future research is suggested.

Digital Technologies
Digital technologies are broadly defined as "combinations of information, computing, communication, and connectivity technologies" [19]. An initial review of recent research mainly focusing on the public health and food safety contexts, but also at wider level, reveals that concepts such as 'technology', 'digital technologies', 'ICT', 'information technologies', 'digital media' and 'digital tools' are used interchangeably to refer to a broad set of digital devices and applications, such as websites, databases, blogs, online platforms, mobile/wearable devices, mobile phones, social media and the Internet [20][21][22][23][24]. Digital technologies/ICTs are also strongly intertwined with the 'digital transformation' and 'digitalization' concepts. Indeed, 'digital transformation' is defined as the use of digital technologies/ICTs to enable changes and improvements for achieving business and/or organizational goals [25] and 'digitalization' as the sociotechnical process of using digital infrastructures [1]. In this context, the terms 'digital transformation' and 'digitalization' may also constitute relevant keywords for investigating digital technologies utilization, as they are linkable to improvements and changes in work processes of organizations responsible for risk analyses. Digital technologies' proliferation enhances the quality and quantity of daily generated data, creating conditions of information abundance, able to significantly facilitate public authorities' decision-and policy-making processes [19,26].

Open Data Sources
Open data refer to "non-privacy-restricted and non-confidential data produced with public money by public and/or private organizations and made available without any usage or distribution restrictions" [6] (p. 258). Open data, frequently termed also as 'public data', can be enriched with data from other sources, resulting in the emergence of large datasets, known as 'big data' [7]. The latter present specific needs for processing, curation, linking, visualization and maintenance, as their sizes overpass common software tools' abilities, whereas value is generated by the combination of different datasets [19,26].
Public policy development frequently relies on 'open data' and 'big data' availability, being indispensable tools for public organizations. Ample open data in diverse formats are stored in repositories on national or international organizations' websites and also can be exploited by other public institutions, thus counteracting unnecessary duplication and associated costs [6]. However, food safety data and information are generally scattered across the food, health and agriculture sectors, with limited interoperability. Consequently, public authorities in charge of food safety-related risk analysis tasks ordinarily resort to multiple open access scientific resources, such as research project websites, online databases, open-access journals, dissertations or other published material, to obtain upto-date technical information. Efficient access to such sources is granted by the growth of digital technologies [27,28]. For the purposes of the present review, the 'open data sources' concept will also extend to 'big data', including those forms of 'open-source' and 'openaccess' scientific data and software freely available in the public domain [7]. Consequently, the search for appropriate information based on the keywords selected, will also encompass results of common Internet search engines, besides the literature databases [29].

Risk Analysis in Food Safety
Risk analysis is a powerful science-based tool for reaching sound, consistent solutions to food safety problems. The Codex Alimentarius Commission defines risk analysis in a food safety context as "a process consisting of three components: risk assessment, risk management, and risk communication" [30] (p. 120). More precisely, risk analysis in food safety is a systematic, disciplined decision-making approach, used to estimate human health and safety risks, to identify and implement appropriate measures for risk control, and to communicate with stakeholders about the risks and measures applied [31]. 'Risk assessment' is the science-based component of risk analysis, comprising hazard identification and characterization, exposure assessment and risk characterization. 'Risk management', on the other hand, involves weighing policy alternatives in consultation with relevant stakeholders, according to the risk assessment outcomes and other factors relevant for consumers' health protection, towards selecting appropriate prevention and control options. Lastly, 'risk communication' entails an "interactive exchange of information and opinions throughout the risk analysis process concerning risk, risk-related factors and risk perceptions, among risk assessors, risk managers, consumers, industry, the academic community and other interested parties, including the explanation of risk assessment findings and the basis of risk management decisions" [30,31].
Food safety risk analyses are carried out by national, regional and international authorities, depending on the nature and localization of the specific risk examined [31]. Scientific knowledge on the food issue identified is considered a prerequisite for successful risk analysis; therefore, aggregation of the largest possible appropriate datasets is essential [28]. Strategies to obtain data on food contaminant issues, particularly MBs, require multidisciplinary approaches combining scientific information from fields such as environmental sciences, biology, chemistry, veterinary science, public administration, epidemiology, public health and toxicology. Data collection can present significant difficulties due to frequent gaps identified in information availability; in this context, the exploitation of digital technologies and open/big data sources may catalyze these efforts [10,28,31].

Literature Research Method
The current state of digital technologies and open data utilization in the field of CFP risk analysis was envisaged by a systematic literature review conducted according to previously established principles [29,32]. Three main steps were followed: (i) selecting appropriate keywords and combinations thereof; (ii) choosing source database(s) and running the searches; and (iii) analyzing the results.
The literature review protocol employed is detailed in Table 1. The focus period was set from 2010 to date (mid 2021). The main keywords identified within the background section were divided into five groups, namely, "Digital technologies", "Open data", "Risk analysis", "Biotoxins" and "Ciguatera", according to the concepts comprising the research topic. Each of the keywords from Groups 1-3 was combined with one or more keywords from the remaining two groups to retrieve the articles of interest, utilizing the Boolean operators "AND" and "OR" (on a case basis) to produce more focused results. Searches were performed separately for each combination of keywords and applied to the journals' abstracts, title and keywords, using the Scopus abstract and citation database of peerreviewed literature. All subject areas were selected, due to the multidisciplinary character of this research. This strategy yielded only one result when the keywords of the "Digital technologies" or "Open data" groups were combined with keywords of the "Biotoxins" or "Ciguatera" groups. A much higher total number of articles was obtained, as expected, when the "Risk analysis" group keywords were looked-up in combination to those of the "Biotoxins" and "Ciguatera" groups. Searches were merged, and after removing the duplicates, 88 articles of multiple types and subject areas remained ( Figure 1). The articles' abstracts were carefully read to assess their relevance and to exclude articles containing the selected keywords in another semantic way, shortening down the list to 28 articles. After full-text examination for the presence of appropriate information, more were excluded as "out of topic", with only 11 studies remaining, a rather expectable outcome considering the narrowness of the field and the specialized nature of the research topic. An additional search in the "Pubmed" database, using the same keyword combinations, yielded five further articles. Thereafter, a thorough Google search was conducted, combining in pairs all the above keywords and some additional terms (e.g., database, smartphone, website, satellite imaging, machine learning), to obtain further material from both the scientific and grey literature, such as press releases, health and fishery authorities' websites, local media, project documents, codes of practice, etc. Finally, reference lists of all selected documents were reviewed to find other articles of interest, whereas their citations in later publications were also evaluated for inclusion in this review [32].  Table 1 for details).
Articles considered relevant contained at least one reference to data input for CFP risk analysis or its individual components (assessment, management, communication) obtained by means of digital technologies and/or open data sources, such as websites, databases, software, social media, specific pieces of digital equipment, etc. It is noted that this research only considers digital equipment utilization in terms of mass-market tools, such as computers and portable digital devices (e.g., notebooks, smartphones, tablets); the use of sophisticated analytical equipment, such as liquid chromatographs and mass spectrometers, although largely incorporating digital components (computerized appliances, support PCs and processing software), is beyond the scope of this work. Similarly, statistical analysis software packages, as well as common office-computer software for word processing, spreadsheets creation, etc., are not included in this literature review, as their use is a prerequisite in CFP data generation. In this context, the above strategy resulted in a final list of 38 articles, of which only 19 were openly accessible to the regular public. In the next step, information of relevance was abstracted from the selected documents and contents were analyzed within the identified research concepts' framework, as presented in the following section.

Result
Keywords found in the 38 articles meeting the eligibility criteria are summarized in Table 2. Notably, references connected to digital technologies were fewer than those categorized within the open data sources concept, with 16 and 33 articles, respectively, whereas 11 articles contained keywords of both groups. 'Database' was the keyword most encountered, with 28 articles, while the highest incidence keyword combination was 'website'-'database', with five articles. Further details are provided in Supplementary Materials Table S1.

Digital Technologies
Only three results [33][34][35] were finally obtained using the exact keywords indicated within the digital technologies concept, combined with those related to ciguatera and risk analysis (Table 1); on the other hand, the extended search for specific digital tools retrieved 13 studies containing the terms 'software', 'smartphone' and 'website', as semantic content relevant to the production, processing and/or communication of the data necessary for CFP risk analysis (Table 2). Interestingly, only two articles combining 'social media' and 'ciguatera' within the context of risk analysis were retrieved, despite the existence of several CFP-relevant Facebook and Twitter accounts (Supplementary Materials Table S2) and the popularity of social media [34,35]. The first one referred to social media mechanisms for food/waterborne complaints surveillance and indicated specific social media accounts serving this purpose [34]. The second one only mentioned the appearance of anecdotal reports of CFP cases on social media, such as online fishing for a, where fishers comment on their own experiences providing the opportunity for broader data collection and risk communication, but without pointing to any specific social media accounts [35].
The term 'software' in risk analysis-related CFP studies primarily concerned programs used for molecular/phylogenetic identification of ciguateric fish and CTX-producing microalgae and secondly web applications assisting record-keeping and communication regarding the presence of ciguateric fishes in trade operations [36][37][38]. Accurate identification of high-risk fish species implicated in CFP and the ability to prevent these from reaching the market, according to regional legislative requirements, are critical in CFP risk assessment, management and communication; therefore, software-based tools can facilitate risk analysis processes [39,40].
Generally, instances of 'website' in the selected articles referred to governmental and organizations' internet pages containing diverse scientific information, including CFP case reports, epidemiological and environmental data, outbreaks occurrence and advice to consumers, as well as other public health data, all being major inputs to CFP risk analysis components [11,39,[41][42][43][44][45]. Nevertheless, 'website' was also used by some authors to denote any type of online-available content, such as public databases or even open data portals (Table 3) [41,44,45]. Furthermore, although fishing bans related to geographical origin (known toxic locations), high-risk fish species and fish size restrictions constitute fundamental measures in terms of CFP risk management in endemic areas [11], often communicated to relevant stakeholders through designated websites, social media or applications belonging to public agencies, no relevant articles were retrieved in the literature (scientific or grey) referring to these specific risk communication actions.
Widely marketed digital tools, such as smartphones, have recently emerged as attractive analytical platforms, which in the future may revolutionize food safety control by enabling citizens without any expertise to perform screening tests [46]. A number of smartphone-based devices or assays have already been developed for various contaminants, including marine toxins [28,46,47] and CTXs, in particular [48]. It should be noted that, currently, smartphones cannot be used on their own to detect food contaminants, without the contribution of some auxiliary part or hand-held device, such as portable electrochemical or optical sensors [28,[47][48][49]. However, they possess independent power sources, computing power, flash-light cameras (i.e., optical systems with constant light sources), web access and wireless data communication, being powerful alternative analytical tools, able to radically change food testing. Although smartphone apps for CTXs testing are not yet commercially available, the future ability of consumers to screen fish for CFP is expected to improve food security and increase public awareness, facilitating also risk assessment and management [47,49].

Open Data Sources
Occurrence of keywords belonging to the 'open data sources' group combined to 'ciguatera' was extensively searched, but no studies were found containing 'open data', 'public data' and 'open source', whereas only one publication (a Master's thesis) included the term 'big data' [50]. On the other hand, searching specifically for 'database', after exclusion of instances related to literature/journal databases, resulted in 28 publications containing at least one reference to a data source compliant to the 'open data sources' concept of the present work [11,14,36,37,40,41,[43][44][45][51][52][53][54][55][56][57][58][59][60][61][62][63][64][65][66]. Another relevant term encountered in a semantic fitting the concept was 'dataset' [67,68], a term frequently used interchangeably to 'database' [69], while the more general term 'data' was the only one present in other works containing records of CFP incidents derived from public databases [39,70]. A cumulative presentation of the open data sources found in the selected studies is included in Table 3, along with the geographic coverage and an attempt to categorize source types in compliance with the concept description of the present work, using terms as 'open data portal', 'open documents repository', 'public/open source software', etc. This summary is provided in order to explicitly demonstrate the extent, diversity and fragmentation of the available sources, as well as the type of data available for risk analysis purposes, but also to facilitate future CFP research with regard to data retrieval. To our knowledge, all sources included in Table 3 are openly accessible to the regular public, although in some cases a user registration may be required.
The variety of open data source types found in the studied literature (Table 4) indicates that the data derived thereof are sufficiently exploited in the field of CFP research and risk analysis. Evidently though, the terms 'open data' 'public data', 'open source' and 'big data', commonly used in relevant social sciences' research, are practically unknown to authors involved in this field. On the other hand, 'database' was the most frequently used term to describe such information sources, with some articles specifically referring to databases as 'public' [33,55,56], 'web-based' [34], 'online' [59], 'internet' [60], 'electronic' [61] or 'open access' [65], whereas 'online data' was also used in one case [43].        [11] (1) All links accessed on 25 September 2021. (2) "PW" indicates absent/broken/obsolete links in referenced works retrieved/updated by the present work. Geographical coverage of the open data sources found in the selected articles ranged from worldwide to regional, with the majority of non-global coverage sources focusing their data on areas located in the American and Oceania continents, where CFP is long encountered and considered endemic. In contrast, sources targeting for instance European countries, where CFP issue has recently emerged, are scarcer.
Plurality in open data sources of a similar nature containing data on different regions is also noteworthy, indicating that efforts to collect data, especially those related to CFP surveillance, epidemiology, case reports and outbreaks incidence, are localized and fragmented, even within the same country, such as the data sources of different states within the USA. Conversely, the evident absence of instances of open data sources in certain CFP-susceptible areas of the world, such as some African and Asian countries of the West Indian Ocean, is also notable. Significant redundancies are also encountered, primarily with regard to climate data, and sea surface temperatures in particular, where at least five different sources are available at a worldwide level. Similarly, at least four different open sources exist for fish or algal species taxonomy and identification. As such, policy-makers and researchers undertaking international risk analysis tasks are commonly obliged to resort to multiple information sources and spend considerable time to obtain the required amount of data. On the other hand, discrepancies may also occur between data from different sources, the resolution of which may create an additional burden in order to obtain acceptable data quality for risk analysis purposes.

Research Question Revisited
This review addressed the research question, "What is the current extent of digital technologies' and open data sources' utilization within risk analysis tasks in the MBs field, particularly on CFP?" Although the commonly expected terminology was almost absent in the relevant bibliography, modifying the search keywords revealed the existence of several CFP risk analysis-related publications, 38 in total, where the data input originated from the use of diverse digital tools and sources. As such, it appears that the current utilization of digital technologies and open data sources in the investigated field is generally not negligible, which reasonably answers the research question.

Further Remarks on the Findings
The aforementioned findings demonstrate that exploitation of digital technologies and open data sources in CFP risk analysis and policy-making studies is not negligible, with their utilization being more widespread in scientific works targeting CFP-endemic areas [40,42,44,45,[50][51][52]60,61,64,[66][67][68]70]. Nevertheless, the pronounced shortage of published works on CFP referring to common social sciences terminology, such as 'digital technologies', 'digital transformation', 'open data' and 'big data', in conjunction with the use of the general terms, e.g., 'website', 'database' and 'dataset', is indicative of an unfamiliarity with these terms regarding the scientific community creating/uploading information and datasets of interest on the Internet, as well as researchers utilizing the data obtained by these sources. Lack of uniformity between the social and natural sciences' terminology is not a new issue; in fact, it forms part of a long-observed general gap between social and natural sciences, thus highlighting the necessity to adopt more transdisciplinary and collaborative approaches across research fields belonging to environmental/marine sciences, toxicology, public health and social sciences [62,71,72].
The fragmented dispersion of data related to CFP surveillance, epidemiology and outbreaks occurrence encountered in the open data sources identified in this literature review, has also been suggested in previous studies. In fact, under-reporting or inconsistent and fragmented recording of CFP cases has been attributed to the absence of formal epidemiological and surveillance methods and a lack of clinical protocols and experience, whereas the need to establish an international register for CFP intoxication cases and also consolidate monitoring of HAB events at a global level is largely emphasized [11,43,53,55,62,73].
Surprisingly only two instances of 'social media' related to CFP [34,35] were found within both peer-reviewed and grey literature publications, suggesting that these digital tools could be underexploited in CFP risk analysis. In fact, food safety agencies already use social media, such as Facebook and Twitter, for risk communication with the general public on food safety issues [26,74], and CFP is no exception. Several CFP-relevant accounts already exist in social media (Table S2), and CFP risk communications, such as notifications of fishing bans or advice to fishers on species and areas at risk, are not uncommon, especially in CFP-endemic regions. On the other hand, online reporting of CFP cases in social media accounts [34], as well as exchange of CFP-related experiences through posts on fishing forums, are also frequent. Evidently, this does not seem to be adequately reflected in the literature, indicating that the impact of social media in the CFP field may constitute a scientific knowledge gap, requiring further research to elucidate their dynamics as dataproviding sources and communication tools in CFP policy-making.

Limitations, Conclusions and Future Research
To our knowledge, to date, no previous works have summarized the utilization of both digital technologies and open data sources in tasks relevant to risk analysis, regarding either MBs or specifically CFP. As such, this review constitutes an initial attempt towards documenting the utilization extent of these tools in CFP risk analysis, according to the currently available literature, but certainly cannot be considered an exhaustive summary of their contribution or an assessment of their effectiveness in this HAB management field. We anticipate this first theoretical approach to trigger further investigation, entailing empirical data, in order to provide concrete evidence on the extent of the interactions between developments in the digital world and their practical applications in the diverse natural sciences fields, including MBs and CFP in particular. In this context, a structured research strategy is required to thoroughly evaluate the impact level of such ICT tools in a qualitative and quantitative way. To achieve this objective, the following approaches are suggested: (1) Interviewing relevant stakeholders, such as experts, public administrators and researchers, involved in the field of CFP risk management, in order to assess (a) their degree of familiarization with the terminology related to digital technologies and open data sources; and (b) their understanding, own use and perception of specific digital technologies and open data sources. This assessment can be accomplished by means of structured questionnaires, containing both multiple-choice/close-ended (with a rating scale) and open-ended questions, as well as free statements, subsequently followed by content and statistical analysis of the responses obtained. Participation could also be expanded using online forms and/or email-invited questionnaires to more effectively target expert audiences.
(2) Introducing qualitative and quantitative criteria to create a framework for evaluating the impact of the given digital technologies and open data sources and subsequent application of this model to analyze the answers obtained within the context of the available literature. Unequivocally, capitalization of technological progress is the way forward to scientific progress in the modern world. On this basis, accessibility to and exploitation of digital tools and open/big data are synergistically expected to derive innovative applications and services, aiming to facilitate risk analysis and policy-making procedures in the field of food safety, similarly to the progress envisaged in the fisheries sector by the implementation of emerging data technologies, such as blockchain, data mining and artificial intelligence [75]. In the framework of the gaps identified within the present study, research towards consolidation of the currently fragmentary open data sources, such as epidemiological and HAB presence databases, at a worldwide level, can support more robust practices towards mitigation of the CFP problem. On the other hand, embracing the social media potential to strengthen data collection and enhance risk communication channels in the CFP sector is also considered crucial, and definitely requires further scientific research in order to both capture the benefits and tackle the challenges involved. Finally, and most importantly, transdisciplinary collaboration is essential to bridge the evident chasm between humanities and natural sciences, with establishing mutually accepted terminology and definitions for concepts of common interest as a starting point.

Supplementary Materials:
The following are available online at https://www.mdpi.com/article/ 10.3390/toxins13100692/s1, Table S1: Main keywords present in the selected articles related to the present review concepts, Table S2: Indicative social media accounts potentially relevant to CFP risk analysis.
Funding: This research received no external funding.