Next Article in Journal
Identification of Down-Regulated Proteome in Saccharomyces cerevisiae with the Deletion of Yeast Cathepsin D in Response to Nitrogen Stress
Next Article in Special Issue
Oxidative Stress Response of Aspergillus oryzae Induced by Hydrogen Peroxide and Menadione Sodium Bisulfite
Previous Article in Journal
Safe Cultivation of Medicago sativa in Metal-Polluted Soils from Semi-Arid Regions Assisted by Heat- and Metallo-Resistant PGPR
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Trends in Diatom Research Since 1991 Based on Topic Modeling

1
Institute for Ecological Research and Pollution Control of Plateau Lakes, School of Ecology and Environmental Science, Yunnan University, Kunming 650500, China
2
Yunnan Key Laboratory of International Rivers and Transboundary Eco-Security, Institute of International Rivers and Eco-Security Yunnan University, Kunming 650500, China
3
Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
*
Author to whom correspondence should be addressed.
Microorganisms 2019, 7(8), 213; https://doi.org/10.3390/microorganisms7080213
Submission received: 24 June 2019 / Revised: 15 July 2019 / Accepted: 22 July 2019 / Published: 24 July 2019
(This article belongs to the Special Issue Recent Advances in Applied Microbiology)

Abstract

:
Diatoms are fundamental carbon sources in a wide range of aquatic food webs and have the potential for wide application in addressing environmental change. Understanding the evolution of topics in diatom research will provide a clear and needed guide to strengthen research on diatoms. However, such an overview remains unavailable. In this study, we used Latent Dirichlet Allocation (LDA), a generative model, to identify topics and determine their trends (i.e., cold and hot topics) by analyzing the abstracts of 19,000 publications from the Web of Science that were related to diatoms during 1991–2018. A total of 116 topics were identified from a Bayesian model selection. The hot topics (diversity, environmental indicator, climate change, land use, and water quality) that were identified by LDA indicated that diatoms are increasingly used as indicators to assess water quality and identify modern climate change impacts due to intensive anthropogenic activities. In terms of cold topics (growth rate, culture growth, cell life history, copepod feeding, grazing by microzooplankton, zooplankton predation, and primary productivity) and hot topics (spatial-temporal distribution, morphology, molecular identification, gene expression, and review), we determined that basic studies on diatoms have decreased and that studies tend to be more comprehensive. This study notes that future directions in diatom research will be closely associated with the application of diatoms in environmental management and climate change to cope with environmental challenges, and more comprehensive issues related to diatoms should be considered.

1. Introduction

Diatoms (Class Bacillariophyceae) are a group of unicellular, photosynthetic, and eukaryotic algae with silica shells found in almost all types of habitats in freshwater and marine ecosystems [1,2]. Till now, totally more than 100,000 diatom species from over 1250 genera with diverse morphological forms in the world [3]. The large-scale ecological success of diatoms suggests that they have refined their cellular processes for efficient utilization of nutrients and sunlight to a greater extent than most other unicellular algae [2]. Although the individual size of diatoms is generally small (<200 μm), together, they play a vital role in fixing carbon dioxide and synthesizing carbohydrates, contributing to more than 20% of global primary production [4,5]. Most primary and secondary consumers in aquatic ecosystems depend on diatoms to some extent and even exclusively [6,7]. There are also many fish, shrimp and shellfish species of great fishery values that mainly feed on diatoms [8,9,10]. Therefore, diatoms are the main carbon source fueling a wide range of aquatic food webs and supporting a large proportion of global fisheries [11,12].
The global population explosion and accompanying human disturbances since the industrial revolution have resulted in large environmental changes and problems (e.g., climate change and eutrophication) [13]. Diatoms have the potential for wide application in related environmental research and management. First, diatoms can serve as indicators of environmental changes due to their high sensitivity and preservability. For example, diatoms respond to environmental changes (e.g., climate and salinity) by altering individual morphology/size and their assemblages [14,15,16,17,18]. Second, diatoms are a powerful surrogate for reconstructing paleoenvironmental history and even population dynamics of specific aquatic animals [19,20]. The silica shells of diatoms are easily preserved and provide a useful tool for fossil research [21,22,23]. Third, diatoms can also be used to assess environmental health, monitor environmental change [24,25], and then guide environmental management and restoration.
Promoting research on a specific area depends on a clear and objective overview of its development history and current status. However, this kind of overview of diatom research remains unavailable. A research overview can be achieved through a bibliometric analysis [26,27,28], in combination with sophisticated machine learning (ML) technologies (Latent Dirichlet Allocation, LDA) on existing literature [29]. LDA is a flexible generative probabilistic unsupervised topic model used to analyze the changes in topic importance over time for text documents [29]. LDA methods have been applied widely in various fields, including the fields of biology and environmental science, such as biodiversity [30], climate change [31], and animal communities [32]. This study aims to provide an objective overview of diatom research based on publication data from the Web of Science (WoS) during 1991–2018. We use ML technologies to identify research topics and track topic changes over time. This study has two objectives: (1) to summarize the topics in diatom research and (2) to identify the past trends and future directions of diatom research by analyzing the trends in diatom research topics. Hopefully, it will provide insights that will contribute to the future development of diatom research and thus global sustainability.

2. Materials and Methods

2.1. Data Collection

The data in the present work were sourced from the online database of the Science Citation Index Expanded (SCIE), Web of Science, which is the most comprehensive and frequently used data source in bibliometrics [33]. The exact query for diatom was TS = ((“diatom” OR “diatoms” OR “Bacillariophyta”) NOT (“diatomic” OR “diatomite” OR “diatomaceous earth”)), and the timespan was from 1991 to 2018. The publications in the sub-categories associated with biology, ecology and environmental science, including “agriculture, multidisciplinary”, “biology”, “biodiversity conservation”, “ecology”, “engineering, environmental”, “environmental sciences”, “limnology”, “marine and freshwater biology”, “microbiology”, “oceanography”, “paleontology”, “toxicology”, and “water resources” were selected. Raw data containing titles, keywords, abstracts, year of publication, journal of publication, authors’ names, authors’ affiliations, number of times cited, and cited reference counts were exported. All documents were used to analyze the scientific production per year, distribution in source, and publication distribution by countries. Impact factors (IF) were taken from the Journal Citation Reports (JCR) published in 2018. We used the abstract data from 1991 to 2018 to build the topic models.

2.2. Document-Terms Matrix Construction

The input data format for building topic models is a document-terms matrix (DTM), which consists of rows corresponding to the documents (papers) and columns that correspond to the terms (words found in each paper’s abstract). To generate the document-term matrix, raw textual data were pre-processed [34]. Punctuation, numbers, whitespace and stop words in the text and corpus were removed. Words were stemmed, in that they were reduced to their underlying root form, and terms that occurred less than five times were removed. We converted the abstract corpus into a document-term matrix using the function DocumentTermMatrix in the ‘tm’ package [35].

2.3. Topic Modeling Analysis

Latent Dirichlet Allocation (LDA) is a probabilistic latent variable model of documents that exploit the correlations among the words and latent semantic themes [36]. In LDA, each document in the text corpus is modelled as a set of draws from a mixture distribution over a set of hidden topics, where topics are assumed to be uncorrelated and each topic is characterized by a distribution over words. LDA has been widely used to explore, discover and illustrate key research topics in various academic fields, as well as the rise or fall in topic popularity [29].
LDA provides a probabilistic model for the latent topic layer [37] (see Figure 1 for the graphical model representation of LDA). For each document d, a multinomial distribution θ over topics is sampled from a Dirichlet distribution with parameter α. For each word wd,i, a topic zd,i is chosen from a topic-specific multinomial distribution φz,d,i sampled from a Dirichlet distribution with parameter β. The probability of generating a word w from a document d is:
P ( w | d ,   θ ,   ϕ ) = z T P ( w | z ,   ϕ z ) P ( z | d ,   θ d )
Therefore, the likelihood of document collection D is defined as:
P ( Z , W | Θ ,   Φ ) = d D z T θ d , z n d , z × z T v V ϕ z , v n z , v
where nd,z is the number of times that topic z has been associated with document d, and nz,v is the number of times that word w has been generated by topic z.
T represents the number of topics. This study used a statistical method for determining the value of T, following the suggestions of Newton and Raftery [38] and Griffiths and Steyvers [29]. The suggestion for determining T, referring to a harmonic mean, allows the discovery of the optimal number of topics, as well as a measure of the goodness-of-fit in the modelling. By using the calculation of the harmonic mean, which is one of the maximal log-likelihood methods, the optimum number of topics (T) can be ascertained.

2.4. Identification of Hot and Cold Topics

The mean θ (the per-document probabilities for topics) by year can reflect the rise and fall in the amount of scientific interest in topics. Here, we present a basic analysis based on a post hoc examination of the estimates of θ produced by the model. To identify topics that were consistently rising or falling in popularity from 1991 to 2018, we conducted a linear trend analysis on θ of each topic by year. According to the slope values (i.e., positive and negative) and their significance levels (i.e., p = 0.05, p = 0.01, p = 0.001, and p = 0.0001), topics were identified as “hot” topics and “cold” topics.

3. Results

3.1. Characteristic of Publication Outputs

A total of 19,483 documents with “diatom/s” or “Bacillariophyta” terms in the title, abstract or keywords were found among the 13 subject categories from 1991 to 2018 (Supplementary E). Article was the most frequent document type, accounting for 88.7% of the total production (Figure 2). Both of the total documents and articles demonstrated a linear increasing tendency in the past three decades, with an annual increase of 29 documents/articles. There were 19,000 documents with abstracts from 1991 to 2018, and we used these documents to identify topics and determine their trends.
The top 20 most active journals on diatom research are listed in Table 1, as well as their total number of publications (TP), total number of publications on diatom (TPD), the ratio of number of publications on diatom with the total number of publications (TPD/TP), total citation counts of publications on diatom (TC), average citation of each publication on diatom from 1991 to 2018 (TC/TPD) and impact factors (IF). The number of articles related to diatom published on a journal is associated with the total number of publications in this journal, therefore, we normalized the data in Table 1. Diatom research reasonably had the highest ratio of TPD/TP (80.2%), as it only accepts papers related to diatom, followed by Journal of Paleolimnology (34.4%), Journal of Phycology (17.1%), Cryptogamie Algologie (16.4%), and European Journal of Phycology (15.0%). Limnology and Oceanography had the highest average citation per article on diatom (62.7), which had a relatively high IF (4.325) among these 20 journals. The following journals with relatively high average citation were Marine Chemistry (45.3, IF 2.713), Deep-Sea Research Part II-Topical Studies in Oceanography (44.8, IF 2.430), Journal of Phycology (40.3, IF 2.831), and Protist (35.2, IF 3.000). The journals with an IF below 2.000 had low average citation according to Table 1.
A total of 109 countries/regions have published documents focusing on diatoms in these sub-categories during the period analyzed (Figure 3). The United States (USA) published the most documents related to diatom (3879), followed by Canada (1230), the United Kingdom (1223), France (1149), China (1146), and Germany (1131). The top twelve countries with the most documents published account for 70.73% of the total number of publications. Europe and North America accounted for 45.43% and 28.42% of the total publications, respectively, followed by Asia (15.32%), Oceania (4.86%) and South America (4.24%). Africa accounted for only 1.75% of the total publications.
The publication trends of the top 12 countries from 1991 to 2018 were presented in Figure 4. United States, France, China, Germany, Italy, and Russia have an uptrend in publishing documents on diatom since 1991, among which, China has the most obvious increased trend. The number of publications per year in Canada and United Kingdom have not changed much since 1991. Diatom-related publications in Japan, Spain, Australia, and Netherlands had increased from 1991 to the early 21th century, but has decreased in the last decade.

3.2. Research Topics and Major Themes in Diatom Research

A total of 116 topics were extracted using the LDA model (Supplementary A Figure S1). The 15 most likely terms and the potential themes of the 116 topics are listed in Supplementary A Table S1. We found 18 uninformative topics according to the extracted terms, and we grouped the remaining 98 topics into four broader themes: Biology and Life History, Biodiversity and Distribution, Environmental Changes and Algal Blooms, and Applications and Others; the four broader themes included 17, 32, 35, and 14 topics and 3788, 4193, 7477, and 2791 documents, respectively (Supplementary A Table S2). The broader theme of Biodiversity and Distribution is related to the community structure and the spatial or/and temporal distribution of diatoms, including community composition, biodiversity, habitats and temporal variations. Environmental Changes and Algal Blooms refers to the environmental pressure (e.g., eutrophication, nutrient limitation, heavy metal, climate change, hydrodynamics, and parasites) on diatoms and the harmful algal blooms caused by environmental changes. Biology and Life History is mainly related to the biology and physiology of diatoms, including growth, morphology, photosynthesis, and metabolic products. The broader theme of Applications and Others includes the use of diatoms in feeding zooplankton and the larvae of other animals. The other themes mainly involve the study methods, such as remote sensing, stable isotope, microscopic scanning and constructing models.

3.3. Hot and Cold Topics

A total of 44 topics were identified as cold topics at the significance level of p = 0.05 (Table 2). Among these topics, 17 topics showed a significant decreasing linear trend at the p = 0.0001 level (Table 2). Thirty-one of the topics were identified as hot topics at the significance level of p = 0.05 level (Table 2), and 17 of the topics showed a significant increasing linear trend at the p = 0.0001 level (Table 2).
The slope trend of each topic in the four broader themes is shown in Figure 5. The positive slopes (hot topics) represent the increasing trend in topic popularity, and the negative slopes (cold topics) represent the decreasing trend in topic popularity from 1991 to 2018. In the Biology and Life History theme, the top three hot topics are morphology, molecular identification, and gene expression (p = 0.0001); the top cold topics include growth rate, culture growth, inorganic carbon assimilation, chlorophyll and pigment, size, cell life of history, diel cycle, and diatom motility (p = 0.0001). In the Biodiversity and Distribution theme, the hot topics related to spatial-temporal distribution, diversities, river, and biofilm are the most prevalent (p = 0.0001); the cold topics of primary productivity (p = 0.0001), intertidal microphytobenthos, and food web (p = 0.001) decreased most in popularity over time. In the Environmental Changes and Algal Bloom theme, the hot topics of environmental indicator, climate change, land use, water quality, and toxicity test are the most prevalent (p = 0.0001); cold topics of surface-bottom mixing, acid deposition reconstruction, spring bloom, and cyanobacteria declined extremely significantly over time (p = 0.0001). In the Application and Others theme, the hot topics of review (p = 0.0001) and aquaculture cultivation (p = 0.001) are the most prevalent; the cold topics of copepod feeding, grazing by microzooplankton, and zooplankton predation (p = 0.0001) became unpopular from 1991 to 2018.

4. Discussion

This study objectively summarized research topics and identified the trends in diatom research by using bibliometric analysis and machine learning models for the first time. We found that the documents related to diatoms were largely published in Europe and North America. The LDA model indicated that the popular topics of diversity, environmental indicator, climate change, land use, and water quality signify that diatoms are increasingly used as indicators to assess water quality and identify modern climate change impacts on aquatic ecosystems. From most cold topics (basic topics, such as growth rate, culture growth, cell life history, copepod feeding, grazing by microzooplankton, zooplankton predation, and primary productivity) associated with hot topics (spatial-temporal distribution, morphology, molecular identification, gene expression, and review), we determined that basic studies on diatoms decreased, and studies tended to be more comprehensive. Some other cold topics (surface–bottom mixing and spring bloom) have been well studied in recent decades, and currently, few studies focus on these topics.
Our results reveal that research using diatoms to indicate or assess environmental change has become more popular. With intensive human disturbances, environmental changes such as the degradation of water quality and climate change have become greater concerns of ecologists and environmentalists. Diatoms are the most useful tool for monitoring running waters. They are the key component of river periphytic algae and react over a brief period to changes in water quality. The use of diatoms in bio-assessments was initiated starting in the early 20th century [25], warranted by decreased water quality in streams and rivers induced by anthropogenic activities such as the expansion of developed areas and agricultural areas. Climate change, especially global warming, is another issue of great concern caused by human activities that has a great impact on aquatic ecosystems. Climate warming changed the timing of spring phytoplankton blooms [41] and enhanced the intensity of harmful algae blooms [42], which have led to a significant increase in research on the topic of climate change over the past two decades. We also noted that the studies on paleoclimate (topics Holocene climate, Paleolimnology, acid deposition reconstruction) related to diatoms presented a decreasing trend popularity over these decades (the data are not shown). In paleoclimatic studies, researchers used diatoms to reconstruct ancient climates and learn about the changing climate [43]. However, after intensive studies, researchers determined that climate change is largely associated with modern limnology, and it is a global issue that deserves more attention. Thus, diatoms afford an elevated degree of precision in assessing the trophic status of a water body and reflecting climate change in modern aquatic ecosystems.
Along with the development of science and technology, studies on diatoms are becoming increasingly comprehensive, with decreasing individual basic studies. First, research on the growth and culture of diatoms, as well as feeding on zooplankton, were popular in the “early” days, with broader studies of diatoms in the “future”, and these fields were considered as the basis for further research on diatoms (e.g., species competition studies with batch cultures, aquaculture operations involving seminatural food chains, toxicological tests, and biofuel production). Second, the physiological, morphological and behavioral traits of diatoms (chlorophyll and pigment, inorganic carbon assimilation, diel cycle, diatom motility, and size) have been studied declining trends, and these traits have also been used as the basis for research on hot topic (functional diversity, which involves various traits of diatoms). Third, studies of diatom community structure have expanded at a larger spatio-temporal scale [24,44,45], with more ecologists recognizing that diatoms are more likely to be spatially and temporally structured [46]. Fourth, researchers have simultaneously focused on the multiple facets of biodiversity, such as taxonomic diversity, genetic diversity and functional diversity [47,48], as they have realized that globally decreasing biodiversity resulting from anthropogenic disturbances are predicted to have severe effects on aquatic ecosystem functions [49]. Fifth, with the advent of electronic scanning microscopy and the development of molecular technology, the substructure [50] of diatoms and the sequencing of the 18S rRNA gene for diatoms [51] have been widely used to classify diatoms as opposed to only diatom morphology under light microscopy. Finally, as an increasing number of studies on diatoms have been published, researchers have summarized, analyzed and refined data, materials, and main points in a large number of original research papers and have combined some of their own points of view to write a review related to diatoms. Thus, this the topic of review also presented an increasing trend in popularity over time.
Some topics related to diatoms had been well studied by previous researchers, so they were less studied by later researchers. For example, the relationship between the cold topic of surface–bottom mixing and diatoms has already been acquainted by most ecologists and biologists. Thermal stratification and vertical mixing (hydrodynamics) cause changes in underwater light conditions and nutrient concentrations and further affect the dynamics of diatoms [52]. This process would lead to the seasonal dynamics of the algae community. The mechanisms of seasonal changes in algae in both fresh and marine waters have been well demonstrated in the Plankton Ecology Group (PEG) model by Sommer et al. [53]. The spring bloom is the first step in the PEG model of phytoplankton succession [53], and in most cases, diatoms occupy the spring blooms. In spring, the water column is well mixed, and the high availability of nutrients enables diatom communities to develop biomass peaks. In addition, the temperature in spring was found to be suitable for the proliferation of many diatom species [54,55,56].
The analysis of country/regional publications indicates that Europe and North America have contributed much more in terms of diatom research than have Oceania, South America, and Africa. One of the possible reasons for this result is official languages. Non-English-speaking countries are hindered in writing and publishing articles in English. Another possible reason is related to development levels of these regions. In undeveloped countries, there are few researchers, and scientific research is inadequately supported by governments, especially in most countries of Africa. We also noticed that the number of publications in China has increased notably since 2009. The same phenomenon also occurred in the studies related to “eutrophication” [57], “nitrogen” [58], and “climate change” [26] using bibliometric analysis. In 2007, the pollution of drinking water caused by cyanobacteria bloom in Lake Taihu led to serious social and economic impacts in China, and the water environment health and protection has received much more attentions. Diatom has increasingly been used as environmental indicator to assess the environmental health, and the number of publications related to diatom therewith increased significantly.
This study also has some potential limitations. For example, this study only provided the first affiliation country’s frequency distribution, but not the exact sites where the studies conducted. In most cases, the first affiliation address is consistent with the study location. However, there are also some inconsistences between them. Therefore, conclusions drawn from using the affiliation country as the surrogate of study area may not accurate enough. Further synthesize studies, particularly those expected to acquire general rules from case studies conducted in gradient geolocations, should extract the exact location of study sites.

5. Conclusions

Our study identified 116 topics from the abstracts of 19,000 publications in Web of Science associated with “diatom” using latent Dirichlet allocation (LDA) from 1991 to 2018. We found that 18 of the 116 topics were uninformative and grouped the remaining 98 topics into four broader areas or themes: Biology and Life History, Biodiversity and Distribution, Environmental Changes and Algal Blooms, and Applications and Others. In addition, we have explored the topics that consistently rose (hot topics) and fell (cold topics) in popularity from 1991 to 2018. The degradation of water quality and climate change caused by intensive anthropogenic activities has resulted in the current increase in use of diatoms to indicate water quality and reflect modern climate change. Moreover, with the development of science and technology, basic research on diatoms has decreased, and more studies tend to be comprehensive. The bibliometrics analysis of diatom-related publications indicated that there are more publications produced in developed regions compared with undeveloped regions. This study highlighted that future research directions on diatoms will develop as environmental or climate changes occur and science and technology develop. Diatoms will be widely applied in managing aquatic environments and reflecting climate change to cope with environmental challenges.

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-2607/7/8/213/s1, Supplementary A provided the model selection results, showing the Harmonic mean of the data for different settings of the number of topics (Figure S1); The 15 most probable terms and the potential theme of the 116 topics (Table S1); The topics of the four broader themes and the number of documents in each topic (Table S2). Supplementary B provided the R code of all analysis in this study (RStudio, version 1.2.1226). Supplementary C provided the publications title that are arranged in descending order according to the contribution rate to each topic. Supplementary D and Supplementary E provided downloaded data and filtered data from Web of Science respectively (The data was converted to a format that can be recognized by bibliometric analysis in RStudio using the function of “covert2df”).

Author Contributions

Conceptualization, Y.Z., J.T., and L.D.; Software, J.W. and L.D.; Writing—Original Draft Preparation, Y.Z.; Writing—Review and Editing, J.T., J.W., and C.D.; Supervision, H.Z., Y.L., D.L., and Q.Z.; Funding Acquisition, H.Z. and Q.Z.

Funding

This work was funded by the Yunnan Provincial Government Leading Scientist Program (No. 2015HA024) and the National Natural Science Foundation of China (No. 41601208).

Acknowledgments

We sincerely thank the reviewers for constructive comments on an earlier version of this manuscript. We also thank Xinyu Cheng, Meiling Chen and Wan Su for helpful suggestions and comments on an early draft of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Armbrust, E.V. The life of diatoms in the world’s oceans. Nature 2009, 459, 185–192. [Google Scholar] [CrossRef] [PubMed]
  2. Seckbach, J.; Kociolek, J.P. The Diatom World; Springer: New York, NY, USA, 2011; pp. 1–35. [Google Scholar]
  3. Kooistra, W.H.C.F.; Gersonde, R.; Medlin, L.K.; Mann, D.G. The origin and evolution of the diatoms. In Evolution of Primary Producers in the Sea; Falkowski, P.G., Knoll, A.H., Eds.; Elsevier: Boston, MA, USA, 2007; pp. 207–249. [Google Scholar]
  4. Falkowski, P.G.; Barber, R.T.; Smetacek, V. Biogeochemical controls and feedbacks on ocean primary production. Science 1998, 281, 200–207. [Google Scholar] [CrossRef] [PubMed]
  5. Saade, A.; Bowler, C. Molecular tools for discovering the secrets of diatoms. Bioscience 2009, 59, 757–765. [Google Scholar] [CrossRef]
  6. Krause, J.W.; Brzezinski, M.A.; Landry, M.R.; Baines, S.B.; Nelson, D.M.; Selph, K.E.; Taylor, A.G. The effects of biogenic silica detritus, zooplankton grazing, and diatom size structure on silicon cycling in the euphotic zone of the eastern equatorial pacific. Limnol. Oceanogr. 2010, 55, 2608–2622. [Google Scholar] [CrossRef]
  7. Lundholm, N.; Krock, B.; John, U.; Skov, J.; Cheng, J.; Pančić, M.; Wohlrab, S.; Rigby, K.; Nielsen, T.G.; Selander, E.; et al. Induction of domoic acid production in diatoms—Types of grazers and diatoms are important. Harmful Algae 2018, 79, 64–73. [Google Scholar] [CrossRef] [PubMed]
  8. Daume, S.; Long, B.M.; Crouch, P. Changes in amino acid content of an algal feed species (navicular sp.) and their effect on growth and survival of juvenile abalone (haliotis rubra). J. Appl. Phycol. 2003, 15, 201–207. [Google Scholar] [CrossRef]
  9. Verónica, V.E.; de Souza, D.M.; Rodríguez, E.M.; Jr Wasielesky, W.; Abreu, P.C.; Ballester, E.L.C. Biofilm feeding by postlarvae of the pink shrimp farfantepenaeus brasiliensis (Decapoda, Penaidae). Aquac. Res. 2013, 44, 783–794. [Google Scholar]
  10. Kumaran, J.; Jose, B.; Joseph, V.; Singh, I.S.B. Optimization of growth requirements of marine diatom chaetoceros muelleri using response surface methodology. Aquac. Res. 2017, 48, 1513–1524. [Google Scholar] [CrossRef]
  11. Jackson, T.; Bouman, H.A.; Sathyendranath, S.; Devred, E. Regional-scale changes in diatom distribution in the Humboldt upwelling system as revealed by remote sensing: implications for fisheries. ICES J. Mar. Sci. 2010, 68, 729–736. [Google Scholar] [CrossRef] [Green Version]
  12. Guo, F.; Kainz, M.J.; Sheldon, F.; Bunn, S.E. The importance of high-quality algal food sources in stream food webs–current status and future perspectives. Freshw. Biol. 2016, 61, 815–831. [Google Scholar] [CrossRef]
  13. Bunn, S.E. Grand challenge for the future of freshwater ecosystems. Front. Environ. Sci. 2016, 4, 21. [Google Scholar] [CrossRef]
  14. Hickman, M.; Reasoner, M.A. Diatom responses to late quaternary vegetation and climate change, and to deposition of two tephras in an alpine and a sub-alpine lake in Yoho national park, British Columbia. J. Paleolimnol. 1994, 11, 173–188. [Google Scholar] [CrossRef]
  15. Curtis, C.J.; Juggins, S.; Clarke, G.; Battarbee, R.W.; Kernan, M.; Catalan, J.; Thompson, R.; Posch, M. Regional influence of acid deposition and climate change in European mountain lakes assessed using diatom transfer functions. Freshw. Biol. 2010, 54, 2555–2572. [Google Scholar] [CrossRef]
  16. Hinder, S.L.; Hays, G.C.; Edwards, M.; Roberts, E.C.; Walne, A.W.; Gravenor, M.B. Changes in marine dinoflagellate and diatom abundance under climate change. Nat. Clim. Chang. 2012, 2, 271–275. [Google Scholar] [CrossRef]
  17. Frenken, T.; Velthuis, M.; Ln, D.S.D.; Stephan, S.; Aben, R.; Kosten, S.; Van Donk, E.; Van de Waal, D. Warming accelerates termination of a phytoplankton spring bloom by fungal parasites. Glob. Chang. Biol. 2016, 22, 299–309. [Google Scholar] [CrossRef] [PubMed]
  18. Zhang, Y.; Peng, C.R.; Wang, Z.C.; Zhang, J.L.; Li, L.J.; Huang, S.; Li, D.H. The species-specific responses of freshwater diatoms to elevated temperatures are affected by interspecific interactions. Microorganisms 2018, 6, 82. [Google Scholar] [CrossRef] [PubMed]
  19. Finney, B.P.; Gregory-Eaves, I.; Sweetman, J.; Douglas, M.S.V.; Smol, J.P. Impacts of climatic change and fishing on pacific salmon abundance over the past 300 years. Science 2000, 290, 795–799. [Google Scholar] [CrossRef]
  20. Li, Y.L.; Chen, X.; Xiao, X.Y.; Zhang, H.C.; Xue, B.; Shen, J. Diatom-based inference of Asian monsoon precipitation from a volcanic lake in southwest china for the last 18.5 ka. Quat. Sci. Rev. 2018, 182, 109–120. [Google Scholar] [CrossRef]
  21. Tapia, P.M.; Fritz, S.C.; Baker, P.A.; Seltzer, G.O.; Dunbar, R.B.; Seltzer, G.O.; Dunbar, R.B. A late quaternary diatom record of tropical climatic history from lake Titicaca (Peru and Bolivia). Palaeogeogr. Palaeocl. 2003, 194, 139–164. [Google Scholar] [CrossRef]
  22. Nguetsop, V.F.; Servant-Vildary, S.; Servant, M. Late Holocene climatic changes in west Africa, a high resolution diatom record from equatorial Cameroon. Quat. Sci. Rev. 2004, 23, 591–609. [Google Scholar] [CrossRef]
  23. Qian, W.; Yang, X.; Anderson, N.J.; Zhang, E.; Li, Y. Diatom response to climate forcing of a deep, alpine lake (Lugu Hu, Yunnan, SW. China) during the last glacial maximum and its implications for understanding regional monsoon variability. Quat. Sci. Rev. 2014, 86, 1–12. [Google Scholar]
  24. Rantala, M.V.; Luoto, T.P.; Weckström, J.; Rautio, M.; Nevalainen, L. Climate drivers of diatom distribution in shallow subarctic lakes. Freshw. Biol. 2017, 62, 1971–1985. [Google Scholar] [CrossRef]
  25. Wu, N.C.; Dong, X.H.; Liu, Y.; Wang, C.; Baattrup-Pedersen, A.; Riis, T. Using river microalgae as indicators for freshwater biomonitoring: Review of published research and future directions. Ecol. Indic. 2017, 81, 124–131. [Google Scholar] [CrossRef]
  26. Li, J.; Wang, M.; Ho, Y. Trends in research on global climate change: A Science Citation Index expanded-based analysis. Glob. Planet. Chang. 2011, 77, 13–20. [Google Scholar] [CrossRef]
  27. Tao, J.; Che, R.X.; He, D.K.; Yan, Y.Z.; Sui, X.Y.; Chen, Y.F. Trends and potential cautions in food web research from a bibliometric analysis. Scientometrics 2015, 105, 435–447. [Google Scholar] [CrossRef]
  28. Hulme, M.; Obermeister, N.; Randalls, S.; Borie, M. Framing the challenge of climate change in Nature and Science editorials. Nat. Clim. Chang. 2018, 8, 515–521. [Google Scholar] [CrossRef]
  29. Griffiths, T.L.; Steyvers, M. Finding scientific topics. Proc. Natl. Acad. Sci. USA 2004, 101, 5228–5235. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Boussalis, C.; Coan, T.G. Text-mining the signals of climate change doubt. Glob. Environ. Chang. 2016, 36, 89–100. [Google Scholar] [CrossRef] [Green Version]
  31. Valle, D.; Baiser, B.; Woodall, C.W.; Chazdon, R. Decomposing biodiversity data using the latent dirichlet allocation model, a probabilistic multivariate statistical method. Ecol. Lett. 2015, 17, 1591–1601. [Google Scholar] [CrossRef]
  32. Christensen, E.M.; Harris, D.J.; Ernest, S. Long-term community change through multiple rapid transitions in a desert rodent community. Ecology 2018, 99, 1523–1529. [Google Scholar] [CrossRef]
  33. Ho, Y.S.; Kahn, M. A bibliometric study of highly cited reviews in the Science Citation Index Expanded™. J. Assoc. Inf. Sci. Tech. 2014, 65, 372–385. [Google Scholar] [CrossRef]
  34. Welbers, K.; Van Atteveldt, W.; Benoit, K. Text Analysis in R. Commun. Methods Meas. 2017, 11, 245–265. [Google Scholar] [CrossRef]
  35. R Development Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Available online: http://www.r-project.org/ (accessed on 23 July 2019).
  36. Blei, D.M.; Lafferty, J.D. A correlated topic model of science. Ann. Appl. Stat. 2007, 1, 17–35. [Google Scholar] [CrossRef]
  37. Blei, D.M.; Ng, A.Y.; Jordan, M.I.; Lafferty, J. Latent Dirichlet Allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  38. Newton, M.A.; Raftery, A.E. Approximate Bayesian Inference with the Weighted Likelihood Bootstrap. J. R. Stat. Soc. Ser. B 1994, 56, 3–48. [Google Scholar] [CrossRef]
  39. Ponweiser, M. Latent Dirichlet Allocation in R. Master’s Thesis, Vienna University of Economics and Business, Vienna, Austria, 2012. [Google Scholar]
  40. Hu, B.B.; Dong, X.L.; Zhang, C.W.; Bowman, T.D.; Ding, Y.; Milojević, S.; Ni, C.Q.; Larivière, V.A. Lead-Lag Analysis of the Topic Evolution Patterns for Preprints and Publications. J. Assoc. Inf. Sci. Tech. 2015, 66, 2643–2656. [Google Scholar] [CrossRef]
  41. Peeters, F.; Straile, D.; Lorke, A.; Livingstone, D.M. Earlier onset of the spring phytoplankton bloom in lakes of the temperate zone in a warmer climate. Glob. Chang. Biol. 2007, 13, 1898–1909. [Google Scholar] [CrossRef] [Green Version]
  42. Kosten, S.; Huszar, V.L.; Bécares, E.; Costa, L.S.; Van Donk, E.; Hansson, L.A.; Jeppesen, E.; Kruk, C.; Lacerot, G.; Mazzeo, N.; et al. Warmer climates boost cyanobacterial dominance in shallow lakes. Glob. Chang. Biol. 2012, 18, 118–126. [Google Scholar] [CrossRef]
  43. Adams, K.E.; Taranu, Z.E.; Zurawell, R.; Cumming, B.F.; Gregory-Eaves, I. Insights for lake management gained when paleolimnological and water column monitoring studies are combined: A case study from Baptiste Lake. Lake Reserv. Manag. 2014, 30, 11–22. [Google Scholar] [CrossRef]
  44. Dong, X.; Bennion, H.; Maberly, S.C.; Sayer, C.D.; Simpson, G.L.; Battarbee, R.W. Nutrients exert a stronger control than climate on recent diatom communities in Esthwaite Water: Evidence from monitoring and palaeolimnological records. Freshw. Biol. 2012, 57, 2044–2056. [Google Scholar] [CrossRef]
  45. Kissling, W.D.; Ahumada, J.A.; Bowser, A.; Fernández, M.; Fernández, N.; García, E.A.; Guralnick, R.P.; Isaac, N.J.B.; Kelling, S.; Los, W.; et al. Building essential biodiversity variables (EBVs) of species distribution and abundance at a global scale. Biol. Rev. 2018, 93, 1–26. [Google Scholar] [CrossRef] [PubMed]
  46. Potapova, M.G.; Charles, D.F. Benthic diatoms in USA rivers: Distributions along spatial and environmental gradients. J. Biogeogr. 2002, 29, 167–187. [Google Scholar] [CrossRef]
  47. Morin, S.; Roubeix, V.; Batisson, I.; Winterton, P.; Pesce, S. Characterization of freshwater diatom communities: Comparing taxonomic and genetic-fingerprinting approaches. J. Phycol. 2012, 48, 1458–1464. [Google Scholar] [CrossRef] [PubMed]
  48. Abonyi, A.; Horváth, Z.; Ptacnik, R. Functional richness outperforms taxonomic richness in predicting ecosystem functioning in natural phytoplankton communities. Freshw. Biol. 2018, 63, 178–186. [Google Scholar] [CrossRef]
  49. Sjöqvist, C.O.; Kremp, A. Genetic diversity affects ecological performance and stress response of marine diatom populations. ISME J. 2016, 10, 2755–2766. [Google Scholar] [CrossRef] [PubMed]
  50. Gaul, U.; Geissler, U.; Henderson, M.; Mahoney, R.; Reimer, C.W. Bibliography on the fine-structure of diatom frustules (Bacillariophyceae). Proc. Acad. Nat. Sci. Phila. 1993, 144, 69–238. [Google Scholar]
  51. Medlin, L.K.; Kaczmarska, I. Evolution of the diatoms: V. Morphological and cytological support for the major clades and a taxonomic revision. Phycologia 2004, 43, 245–270. [Google Scholar] [CrossRef] [Green Version]
  52. Hubble, D.S.; Harper, D.M. Phytoplankton community structure and succession in the water column of Lake Naivasha, Kenya: A shallow tropical lake. Hydrobiologia 2002, 488, 89–98. [Google Scholar] [CrossRef]
  53. Sommer, U.; Gliwicz, Z.M.; Lampert, W.; Duncan, A. The PEG–model of seasonal succession of planktonic events in fresh-waters. Arch. Hydrobiol. 1986, 106, 433–471. [Google Scholar]
  54. Sommer, U.; Lewandowska, A. Climate change and the phytoplankton spring bloom: Warming and overwintering zooplankton have similar effects on phytoplankton. Glob. Chang. Biol. 2011, 17, 154–162. [Google Scholar] [CrossRef]
  55. Yang, Q.; Xie, P.; Shen, H.; Xu, J.; Wang, P.L.; Zhang, B. A novel flushing strategy for diatom bloom prevention in the lower-middle Hanjiang River. Water Res. 2012, 46, 2525–2534. [Google Scholar] [CrossRef] [PubMed]
  56. Striebel, M.; Schabhüttl, S.; Hodapp, D.; Hingsamer, P.; Hillebrand, H. Phytoplankton responses to temperature increases are constrained by abiotic conditions and community composition. Oecologia 2016, 182, 815–827. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Li, X.; Nan, R.Q. A bibliometric analysis of eutrophication literatures: An expanding and shifting focus. Environ. Sci. Pollut. R. 2017, 24, 17103–17115. [Google Scholar] [CrossRef] [PubMed]
  58. Yao, X.L.; Zhang, Y.L.; Zhang, L.; Zhou, Y.Q. A bibliometric review of nitrogen research in eutrophic lakes and reservoirs. J. Environ. Sci.-China 2017, 66, 274–285. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Latent Dirichlet Allocation (LDA) and the meaning of notations (adapted from Ponweiser [39], and Hu et al. [40]).
Figure 1. Latent Dirichlet Allocation (LDA) and the meaning of notations (adapted from Ponweiser [39], and Hu et al. [40]).
Microorganisms 07 00213 g001
Figure 2. Journal publications with diatom in title, abstract and keywords during 1991–2018 in Web of Science.
Figure 2. Journal publications with diatom in title, abstract and keywords during 1991–2018 in Web of Science.
Microorganisms 07 00213 g002
Figure 3. Distribution map of worldwide research on diatom from 1991 to 2018 based on number of publications.
Figure 3. Distribution map of worldwide research on diatom from 1991 to 2018 based on number of publications.
Microorganisms 07 00213 g003
Figure 4. The top 12 countries with most publications related to diatom in title, abstract and keywords during 1991-2018 on Web of Science.
Figure 4. The top 12 countries with most publications related to diatom in title, abstract and keywords during 1991-2018 on Web of Science.
Microorganisms 07 00213 g004
Figure 5. The significant cold and hot topics sorted by the slope of the linear models for the four broader themes. (* represents p = 0.05, ** represents p = 0.01, *** represents p = 0.001, and **** represents p = 0.0001).
Figure 5. The significant cold and hot topics sorted by the slope of the linear models for the four broader themes. (* represents p = 0.05, ** represents p = 0.01, *** represents p = 0.001, and **** represents p = 0.0001).
Microorganisms 07 00213 g005
Table 1. The top 20 most active journals on diatom research from 1991 to 2018.
Table 1. The top 20 most active journals on diatom research from 1991 to 2018.
Journal Name (Category Name)TPD/TP (%)TPDTPTCTC/TPDIF
DIATOM RESEARCH (Marine and Freshwater Biology)80.2%48360236857.61.169
JOURNAL OF PALEOLIMNOLOGY (Environmental Sciences; Limnology)34.4%601174715,38025.62.009
JOURNAL OF PHYCOLOGY (Marine and Freshwater Biology)17.1%774452431,18140.32.831
CRYPTOGAMIE ALGOLOGIE (Marine and Freshwater Biology)16.4%1167067566.51.361
EUROPEAN JOURNAL OF PHYCOLOGY (Marine and Freshwater Biology)15.0%2681782517219.32.526
JOURNAL OF PLANKTON RESEARCH (Marine and Freshwater Biology; Oceanography)14.9%506340415,69431.02.209
PHYCOLOGICAL RESEARCH (Marine and Freshwater Biology)14.3%765316458.51.342
AQUATIC MICROBIAL ECOLOGY (Ecology; Marine and Freshwater Biology; Microbiology)14.2%2681889850531.72.788
HARMFUL ALGAE (Marine and Freshwater Biology)13.6%2031497491824.25.012
LIMNOLOGY AND OCEANOGRAPHY (Limnology; Oceanography)12.9%729563545,74162.74.325
OCEANOLOGICAL AND HYDROBIOLOGICAL STUDIES (Marine and Freshwater Biology; Oceanography)12.2%695652764.00.674
DEEP-SEA RESEARCH PART II-TOPICAL STUDIES IN OCEANOGRAPHY (Oceanography)11.7%524446023,45644.82.430
MARINE MICROPALEONTOLOGY (Paleontology)11.4%1501321392426.22.663
FUNDAMENTAL AND APPLIED LIMNOLOGY (Limnology; Marine and Freshwater Biology)11.4%796966398.10.980
PROTIST (Microbiology)10.0%91910320235.23.000
PHYCOLOGIA (Marine and Freshwater Biology)9.5%3543709693619.61.976
AQUATIC ECOLOGY (Ecology; Limnology; Marine and Freshwater Biology)9.5%76799112114.82.505
VIE ET MILIEU-LIFE AND ENVIRONMENT (Ecology; Marine and Freshwater Biology)9.2%657103876.00.362
BIOFOULING (Marine and Freshwater Biology)8.9%1311475436833.32.847
MARINE CHEMISTRY (Oceanography)8.3%229277010,38045.32.713
TPD total number of publications on diatom research, TP total number of publications in this journal, TC total citation counts of the publications on diatom research, IF impact factor from the Journal Citation Report (JCR) published in 2018.
Table 2. Topics with significant negative and positive trends at four significant levels.
Table 2. Topics with significant negative and positive trends at four significant levels.
Topic Trendp = 0.05p = 0.01p = 0.001p = 0.0001
Negative trend 44342717
Positive trend 31272117
Total75614834

Share and Cite

MDPI and ACS Style

Zhang, Y.; Tao, J.; Wang, J.; Ding, L.; Ding, C.; Li, Y.; Zhou, Q.; Li, D.; Zhang, H. Trends in Diatom Research Since 1991 Based on Topic Modeling. Microorganisms 2019, 7, 213. https://doi.org/10.3390/microorganisms7080213

AMA Style

Zhang Y, Tao J, Wang J, Ding L, Ding C, Li Y, Zhou Q, Li D, Zhang H. Trends in Diatom Research Since 1991 Based on Topic Modeling. Microorganisms. 2019; 7(8):213. https://doi.org/10.3390/microorganisms7080213

Chicago/Turabian Style

Zhang, Yun, Juan Tao, Jun Wang, Liuyong Ding, Chengzhi Ding, Yanling Li, Qichao Zhou, Dunhai Li, and Hucai Zhang. 2019. "Trends in Diatom Research Since 1991 Based on Topic Modeling" Microorganisms 7, no. 8: 213. https://doi.org/10.3390/microorganisms7080213

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop