Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses
Abstract
:1. Introduction
2. Background
2.1. Big Data
2.2. Big Data’s Impact
2.3. Big Data-Driven Research Across Disciplines
2.4. NSF and Big Data
2.5. Research Using NSF Data
2.6. Significance and Contribution
3. Data and Methods
3.1. Research Design and Rationale
3.2. Data
3.3. Analysis Procedures
- Each topic is a cluster of words that shares some semantic domain.
- Each document is a mixture of the topics, i.e., the document contains words from various topics in different proportions. For this study, documents are the synopses of NSF awards.
- For each document , draw a topic proportion vector from a Dirichlet distribution parameterized by .
- For each topic , draw a topic distribution , representing the distribution over vocabulary terms, from a Dirichlet distribution with parameter .
- For each word in document (where ),
- i.
- Select a topic for the word from a multinomial distribution governed by .
- ii.
- Draw the word itself from a multinomial distribution determined by the topic distribution .
4. Results
4.1. Trends and Patterns in Funded Big Data-Driven Research (RQ1)
4.1.1. General Trends in Funded Big Data-Driven Research
4.1.2. Big Data Keyword Trends over Time
4.1.3. Big Data Themes over Time
4.2. Big Data-Driven Research Trends by Institutional Characteristics (RQ2)
4.2.1. Research Classification
4.2.2. Population Served
4.3. Big Data-Driven Research Within Subdomains of NSF-Defined Research Areas (RQ3)
- -
- SBE: Emergency management, cognitive neuroscience, industrial infrastructure, and climate change.
- -
- EDU: Diversity/inclusion, cybersecurity, virtual learning, community education, and student success.
- -
- CISE: Machine learning, cybersecurity, hydroinformatics, and online community networks.
- -
- MPS: Computational simulations, astrophysics, mathematical modeling, and quantum mechanics.
- -
- ENG: Sustainable infrastructure, robotics design, geotechnical engineering, and health technologies.
- -
- BIO: Bioeducation, climate change, specimen digitization, and epidemiology.
- -
- GEO: Space physics, hurricanes, deep-sea volcanoes, STEM Education, and earthquake dynamics.
5. Discussion
6. Limitations
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Topic | Top Terms | Weight |
---|---|---|
Astrophysics | Galaxies, stars, observations, formations, gas, students, data, team | 8.23% |
Chemical Reactions | Chemistry, reactions, metal, students, catalysts, synthesis | 7.42% |
Computational Simulation | Methods, computational, algorithm, learning, develop, model, application, optimization, efficient | 5.34% |
Differential Equations | Equations, nonlinear, numerical, differential, mathematical, solutions, fluid, analysis, methods, models | 7.25% |
Geometric Spaces | Theory, geometry, study, geometric, spaces, pi, mathematics, dimensional, manifolds, topology | 12.5% |
Gravitational Waves | Gravity, wave, black, neutron, physics, holes, relativity | 2.08% |
Materials Science | Materials, properties, technical, electronic, device, solid, temperature | 7.08% |
Mathematical Models | Network, mathematical, model, time, stochastic, system, spectrum, control, develop | 3.02% |
Metal Catalysts | Students, science, material, engineering, development, award, support, engineering, program, education, development, impacts | 2.85% |
Molecular biology | Cell, biology, material, DNA, student, systems, design | 3.10% |
Molecular Dynamics | Molecule, dynamics, chemical, computational, model, chemistry, experimental, species, methods | 2.64% |
MPS Education | Conference, workshop, theory, student, support, summer, REU, physics | 9.71% |
Nuclear Physics | Quantum, physics, plasma, matter, nuclear, measurements, electron, laser, experiments, energy | 4.16% |
Particle Stability | Particles, surface, water, environmental, liquid, coatings, model, ice | 2.25% |
Topology | Light, optical, energy, students, nanoparticles, chemical, surface, spectroscopy, properties | 4.55% |
Statistical Theory | Data, statistical, methods, models, analysis, dimensional, inference, develop, | 4.35% |
Quantum Mechanics | Systems, theory, quantum, physics, random, study, model, dynamical, particle | 3.84% |
Protein Structure | Chemistry, protein, molecule, structure, students, program. | 5.02% |
Polymer Science | Materials, polymer, properties, organic, molecular, application, chemistry, structure, student, | 4.61% |
Topic | Top Terms | Weight |
---|---|---|
Information Technology | Systems, networks, data, wireless, communication, power, distributed, design | 5.02% |
Structural Design | Building, structures, data, earthquake, soil, engineering, seismic, design | 4.51% |
Particle Dynamics | Flow, fluid, particle, dynamics, heat, transport, experiments | 5.89% |
Biosensors | Sensing, detection, optical, device, resolution, micro | 5.42% |
Thermodynamics | Chemical, gas, reaction, energy, carbon, fuel, production, catalyst, biomass | 4.85% |
Infrastructure | Data, infrastructure, social, risk, urban, public, impacts, decision, communities, disaster | 4.57% |
Robotics | Control, human, robots, motion, system, soft, design | 3.04% |
Supply Chain | Supply production, chain, care, service, models, develop, cost, food | 2.68% |
Nanoparticles | Nanoparticles, molecule, protein, surface, interactions, DNA, properties, assembly | 5.13% |
Engineering Education | Workshop, conference, students, engineering, international, support, meeting, science, education, learning, professional, development, REU, STEM | 15.30% |
Industry Technology | University, center, students, industry, engineering, technology, proposed, program | 9.71% |
Optimization | System, model, control, methods, optimization, computational, time | 7.04% |
Manufacturing | Design, manufacturing, support, evaluation, process | 2.25% |
Materials Science | Materials, properties, mechanical, polymer, process, applications, fundamental, manufacturing | 9.77% |
Tissue Mechanics | Cell, tissue, mechanical, cancer, disease, stem development | 6.49% |
Quantum Devices | Quantum, devices, optical, materials, light, semiconductor, photonic, applications | 6.72% |
Topic | Top Terms | Weight |
---|---|---|
CISE Education | Students, science, learning, computer, school, education, computing, teachers, support, undergraduate | 6.02% |
Computational Molecular Biophysics | Computational, simulation, materials, molecular, Software, methods, physics, biological, science | 3.00% |
Cloud Computing | Compute, data, performance, systems, memory, Applications, cloud, hardware, parallel | 7.84% |
Wireless Networks | Network, wireless, communication, spectrum, internet, traffic, data, design | 7.35% |
Online community Networks | Social, design, support, people, information, public, community, understanding, technology, online | 7.07% |
Computational Theory | Algorithms, theory, computation, optimization, applications, efficient, science, computer, method | 6.46% |
Human Robot Interaction | Human, control, robot, system, physical, autonomous, real, time | 6.11% |
Information Technology | Data, network, science, university, infrastructure, community, support, campus, resources, computing | 5.31% |
Software Analysis | Software, systems, code, techniques, programming, tools, analysis, verification, program | 5.37% |
User Experience | Doctoral, consortium, information, students, community, participants, feedback, science, gravitational, human | 1% |
Data Privacy | Privacy, users, mobile, data, device, web, information, access, techniques | 3.54% |
Cybersecurity | Security, attacks, quantum, systems, cyber, information, cybersecurity, techniques | 4.19% |
Health IT | Human, system, patient, medical, data, clinical, time | 1.9% |
Hydroinformatics | Data, science, scientific, community, cyberinfrastructure, water, software, tools, support | 5% |
Software Engineering | Software, community, tools, engineering, design, infrastructure, support, development, source, evaluation | 2% |
Virtual Reality | Virtual, visual, speech, computer, video, vision, 3d, reality, systems, recognition | 2% |
Machine Learning | Learning, machine, models, data, algorithms, deep, ai, model, methods, evaluation | 5% |
Knowledge Graphs | Data, analysis, information, techniques, methods, mining, knowledge, graph, algorithms, search | 5% |
Topic | Top Terms | Weight |
---|---|---|
Bio Education | Scientists, workshop, meeting, conference, biology, support, career, students, students, training, REU | 11.8% |
Biogeochemical Cycles | Carbon, soil, water, ecosystem, nitrogen, forest, streams, organic, climate | 6.42% |
Climate Change | Species, change, climate, environmental, tree, responses, effects, drought, ecological, temperature | 7.34% |
Computational Biology | Data, methods, tools, develop, models, analysis, computational, community, software | 4.7% |
Epidemiology | Disease, host, virus, immune, infection, pathogen, transmission, COVID, diseases | 4.22% |
Ethology | Behavioral, social, species, animals, study, understanding Reproductive, individuals | 5.23% |
Evolution Genetics | Species, evolution, genetic, diversity, traits, genomic, study, populations | 8.92% |
Marine Biology | Students, marine, university, instrument, system, undergraduate, science, training, support | 5.4% |
Metabolic Engineering | Synthetic, systems, engineering, biology, metabolic, chemical, design, develop, molecular | 4.4% |
Microbial Interactions | Plant, microbial, species, diversity, communities, fungi, interactions, host | 5.9% |
Molecular Biology | Gene, expression, RNA, genome, DNA, cell, function | 8.57% |
Neuroscience | Neurons, system, students, sensory, activity, mechanisms, memory, animal, behavior | 3.69% |
Physiological Responses | Plant, stress, crop, growth, response, signaling, students, molecular | 4.05% |
Pollination Mechanisms | Plant, flowering, data, pollinators, species, time | 1.29% |
Protein Structure | Cell, proteins, cellular, molecular, students, understanding, signaling, iron | 5.6% |
Specimen Digitization | Collections, specimens, biodiversity, species, collection, museum, digitization | 5.15% |
Trophic Interactions | Food, species, prey, interactions, models, experiments, communities, students, ecosystems, understanding | 3.12% |
Urban Ecology | Ecological, urban, species, human, natural, coastal, land, data, ecosystem, change | 4.24% |
Topic | Top Terms | Weight |
---|---|---|
Social Network Analysis | Data, methods, analysis, social, information, develop, tools, statistical, science, network | 17.6% |
SBE Education | Science, students, stem, program, workshop, education, training, university | 8.6% |
Archaeology | Social, archaeological, political, study, local, communities, data, ancient, society | 7.12% |
Neurolinguistics | Speech, linguistics, English, words, children, understanding, learning, processing | 6.1% |
Behavioral Economics | Behavior, decision, people, models, social, theory, information, individuals | 5.9% |
Cognitive Neuroscience | Cognitive, learning, Neural, Human, memory, visual, understanding, information, activity, development | 5.8% |
Emergency Management | Social, public, covid, risk, political, pandemic data, survey, support, information | 5.8% |
Primate Genomics | Human, genetic, data, primate, species, biological, study, understanding | 5.6% |
Urban Political Economy | Urban, political, social, local, cities, public, economic, development | 5.5% |
Economic Policy | Economic, policy, financial, data, effects, income, market, labor | 4.7% |
Environmental Sustainability | Environmental, food, land, Energy, Social, Systems, change, climate, communities, development | 4.4% |
Climate Change | Human, environmental, archaeology, change, sites, times Climate, past, data | 4.3% |
Information Technology | Firms, innovation, market, information, technology, trade, policy, industry, data | 3.8% |
Indigenous Communities | Indigenous, native, linguistic, American, conference, community, documentation, knowledge | 3.5% |
Criminal Justice | Legal, law, justice, criminal, court, police, enforcement | 3.2% |
International Security | Conflict, violence, international, countries, care, security, political, military, medical, war | 3.1% |
Wildfire Emissions | Spatial, forest, doctoral, climate, human, fire, land, dissertation | 2.6% |
Industrial Infrastructure | Infrastructure, water, workers, technology, systems, support, impacts, human, public, social | 2.4% |
Topic | Top Terms | Weight |
---|---|---|
GEO EDUCATION | Students, science, workshop, program, geoscience, community, scientists, university, scientific, education | 7.8% |
Ocean Temperature | Ocean, deep, data, seafloor, hydrothermal, ridge, sea, samples, cruise, program | 7.6% |
Coral Reef Ecosystems | Marine, species, coral, ecosystem, communities, reef, understanding, environmental | 6.2% |
Earthquake Dynamics | Earthquake, seismic, slip, zone, data, deformation, subduction, plate, understanding | 6.1% |
Ground/Surface Water | Water, sediment, river, coastal, erosion, transport, groundwater, flow, rivers, processes | 6.0% |
Ocean Carbon Cycle | Iron, ocean, FE, water, trace, isotope, chemical, oxygen, elements, isotopes | 5.9% |
Ocean models | Climate, ocean, variability, model, pacific, circulation, Atlantic, north, tropical | 5.8% |
Polar Climate | Arctic, communities, change, social, Alaska, human, climate, environmental, local, community | 5.7% |
Atmospheric Aerosols | Atmospheric, aerosol, organic, cloud, chemistry, air, compounds | 5.1% |
Artic Climate Change | Ice, sea, arctic, sheet, level, climate, ocean, Greenland, change, model | 4.9% |
Geologic Times | Climate, records, past, change, time, cores, data, proxy, lake | 4.8% |
Hurricanes | Climate, soil, precipitation, arctic, fire, weather, permafrost, water, vegetation, land | 4.5% |
Space Physics | Support, solar, space, instrumentation, university, instrument, funded, system, acquisition | 4.3% |
Mantle Composition | Mantle, crust, earth, subduction, rocks, plate, seismic, tectonic, deformation, processes | 4.2% |
Deep-Sea Volcanoes | Volcanic, eruption, processes, rocks, volcanoes, volcano, study, geochemical, understanding | 3.8% |
Ocean Productivity | Carbon, ocean, co2, organic, nitrogen, production, microbial, biogeochemical, cycle, water | 3.6% |
Marine Microbials | Antarctic, polar, field, antarctica, study, southern, public, region, students, time | 2.9% |
Measurement System Analysis | Data, system, time, community, development, based, develop, analysis, tools, methods | 2.7% |
Oceanographic facilities | Carbon, ocean, co2, organic, nitrogen, production, microbial, biogeochemical, cycle, water | 2.5% |
Topic | Top Terms | Weight |
---|---|---|
Cybersecurity | Cybersecurity, security, students, cyber, education, modules, learning, hands, systems, privacy | 6.24% |
Virtual Learning | Learning, virtual, data, spatial, manufacturing, human, understanding, support, 3d, develop | 2.91% |
Workforce Development | Training, students, program, education, graduate, science, industry, skills, development, workforce | 4.78% |
STEM Education | Learning, stem, children, science, study, studies, cognitive, development, knowledge, program | 3.51% |
STEM Education | Stem, students, graduate, program, school, underrepresented, programs, careers, education, university | 4.46% |
Student Success | Students, stem, student, low, retention, income, science, support, success, academic | 3.86% |
Teacher Education | Teachers, teacher, stem, school, teaching, science, mathematics, university, program, Noyce | 4.46% |
MPS Reasoning | Students, physics, mathematics, reasoning, development, learning, instructional, student, science, instruction | 5.81% |
Change Theory | Education, stem, change, network, teaching, undergraduate, national, study, nsf, support | 5.28% |
Geoscience Education | Learning, students, workshops, student, geoscience, development, teaching, materials, based, professional | 4.54% |
Student Outcomes | Students, stem, data, study, student, learning, outcomes, college, career, examine | 5.42% |
Biology Education | Students, undergraduate, biology, student, based, science, learning, institutions, stem, community | 4.18% |
HBCU Support | Undergraduate, award, students, support, institution, black, university, provide, historically, experiences | 5.67% |
Engineering Education | Students, student, learning, stem, engineering, courses, education, chemistry, undergraduate, skills | 7.3% |
Online Learning | Students, development, student, data, learning, design, materials, education, content, online | 4% |
Artificial Intelligence | Students, learning, data, ai, student, computer, online, system, support, programming | 4.96% |
Informal Learning | Learning, science, workshop, computer, computing, computational, design, education, community, informal | 6.24% |
Gender Equity | Stem, women, support, education, program, equity, participation, experiences, engineering, impacts | 6.70% |
Design-Based | Engineering, students, learning, design, based, technology, student, knowledge, courses, mathematics | 4.22% |
Community Education | Stem, community, program, students, education, college, institutions, support, colleges, university | 5.46% |
References
- Gobble, M.M. Big data: The next big thing in innovation. Res. Technol. Manag. 2013, 56, 64–67. [Google Scholar] [CrossRef]
- Strawn, G.O. Scientific Research: How Many Paradigms? Educ. Rev. 2012, 47, 26. [Google Scholar]
- Amado, A.; Cortez, P.; Rita, P.; Moro, S. Research trends on Big Data in Marketing: A text mining and topic modeling based literature analysis. Eur. Res. Manag. Bus. Econ. 2018, 24, 1–7. [Google Scholar] [CrossRef]
- Baig, M.I.; Shuib, L.; Yadegaridehkordi, E. Big data in education: A state of the art, limitations, and future research directions. Int. J. Educ. Technol. High. Educ. 2020, 17, 1–23. [Google Scholar] [CrossRef]
- Bello-Orgaz, G.; Jung, J.J.; Camacho, D. Social big data: Recent achievements and new challenges. Inf. Fusion 2016, 28, 45–59. [Google Scholar] [CrossRef]
- Choi, T.; Wallace, S.W.; Wang, Y. Big data analytics in operations management. Prod. Oper. Manag. 2018, 27, 1868–1883. [Google Scholar] [CrossRef]
- Fredriksson, C.; Mubarak, F.; Tuohimaa, M.; Zhan, M. Big data in the public sector: A systematic literature review. Scand. J. Public Adm. 2017, 21, 39–62. [Google Scholar] [CrossRef]
- Kalantari, A.; Kamsin, A.; Kamaruddin, H.S.; Ale Ebrahim, N.; Gani, A.; Ebrahimi, A.; Shamshirband, S. A bibliometric approach to tracking big data research trends. J. Big Data 2017, 4, 1–18. [Google Scholar] [CrossRef]
- Li, J.; Jiang, Y. The Research Trend of Big Data in Education and the Impact of Teacher Psychology on Educational Development During COVID-19: A Systematic Review and Future Perspective. Front. Psychol. 2021, 12, 753388. [Google Scholar] [CrossRef]
- Ciampi, F.; Demi, S.; Magrini, A.; Marzi, G.; Papa, A. Exploring the impact of big data analytics capabilities on business model innovation: The mediating role of entrepreneurial orientation. J. Bus. Res. 2021, 123, 1–13. [Google Scholar] [CrossRef]
- Eynon, R. The rise of Big Data: What does it mean for education, technology, and media research? Learn. Media Technol. 2013, 38, 237–240. [Google Scholar] [CrossRef]
- Tulasi, B. Significance of Big Data and Analytics in Higher Education. Int. J. Comput. Appl. 2013, 68, 21–23. [Google Scholar] [CrossRef]
- Mohammadi, E.; Karami, A. Exploring research trends in big data across disciplines: A text mining analysis. J. Inf. Sci. 2022, 48, 44–56. [Google Scholar] [CrossRef]
- .Abourezq, M.; Idrissi, A. Database-as-a-Service for Big Data: An Overview. Int. J. Adv. Comput. Sci. Appl. 2016, 7, 157–177. [Google Scholar] [CrossRef]
- Manyika, J.; Chui, M.; Brown, B.; Bughin, J.; Dobbs, R.; Roxburgh, C.; Byers, A.H. Big Data: The Next Frontier for Innovation, Competition, and Productivity; Mckinsey Global Institute: Washington, DC, USA, 2011. [Google Scholar]
- Yang, L. Big Data Analytics: What Is the Big Deal? 30 December 2013. Available online: https://english.ckgsb.edu.cn/knowledge/article/big-data-analytics-whats-the-big-deal/ (accessed on 1 October 2024).
- Favaretto, M.; De Clercq, E.; Schneble, C.O.; Elger, B.S. What is your definition of Big Data? Researchers’ understanding of the phenomenon of the decade. PLoS ONE 2020, 15, e0228987. [Google Scholar] [CrossRef]
- Jang, H. Identifying 21st Century STEM Competencies Using Workplace Data. J. Sci. Educ. Technol. 2016, 25, 284–301. [Google Scholar] [CrossRef]
- Tang, R.; Sae-Lim, W. Data science programs in U.S. higher education: An exploratory content analysis of program description, curriculum structure, and course focus. Educ. Inf. 2016, 32, 269–290. [Google Scholar] [CrossRef]
- Davenport, T.H.; Harris, J.G.; Morison, R. Analytics at Work: Smarter Decisions, Better Results; Harvard Business Press: Boston, MA, USA, 2010. [Google Scholar]
- Li, Y.; Huang, C.; Ding, L.; Li, Z.; Pan, Y.; Gao, X. Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. Methods 2019, 166, 4–21. [Google Scholar] [CrossRef]
- Shang, C.; You, F. Data Analytics and Machine Learning for Smart Process Manufacturing: Recent Advances and Perspectives in the Big Data Era. Engineering 2019, 5, 1010–1016. [Google Scholar] [CrossRef]
- Gobert, J.D.; Sao Pedro, M.A. Digital assessment environments for scientific inquiry practices. In the Wiley Handbook of Cognition and Assessment: Frameworks, Methodologies, and Applications; Rupp, A.A., Leighton, J.P., Eds.; Wiley: West Sussex, UK, 2017; pp. 508–534. [Google Scholar]
- Athey, S.; Imbens, G.W. Machine learning methods for estimating heterogeneous causal effects. Stat 2015, 1050, 1–26. [Google Scholar]
- Belloni, A.; Chernozhukov, V.; Hansen, C. Inference on treatment effects after selection among high-dimensional controls. Rev. Econ. Stud. 2014, 81, 608–650. Available online: http://www.econis.eu/PPNSET?PPN=819207500 (accessed on 1 October 2024). [CrossRef]
- Zhou, C.; Wang, H.; Wang, C.; Hou, Z.; Zheng, Z.; Shen, S.; Cheng, Q.; Feng, Z.; Wang, X.; Lv, H. Geoscience knowledge graph in the big data era. Sci. China Earth Sci. 2021, 64, 1105–1114. [Google Scholar] [CrossRef]
- Frizzo-Barker, J.; Chow-White, P.A.; Mozafari, M.; Ha, D. An empirical study of the rise of big data in business scholarship. Int. J. Inf. Manag. 2016, 36, 403–413. [Google Scholar] [CrossRef]
- van Altena, A.J.; Moerland, P.D.; Zwinderman, A.H.; Olabarriaga, S.D. Understanding big data themes from scientific biomedical literature through topic modeling. J. Big Data 2016, 3, 23. [Google Scholar] [CrossRef]
- Hu, J.; Zhang, Y. Discovering the interdisciplinary nature of Big Data research through social network analysis and visualization. Scientometrics 2017, 112, 91–109. [Google Scholar] [CrossRef]
- National Science Board. FY 2022 Performance and Financial Highlights; NSF 23-003; National Science Foundation: Alexandria, VA, USA, 2022. Available online: https://nsf-gov-resources.nsf.gov/2023-03/FY22%20PerfFinHighlights_web-Final-3-9-23.pdf (accessed on 1 October 2024).
- Card, D.; Chetty, R.; Feldstein, M.S.; Saez, E. Expanding access to administrative data for research in the United States. In American Economic Association, Ten Years and Beyond: Economists Answer NSF’s Call for Long-Term Research Agendas; SSRN-Elsevier: Rochester, NY, USA, 2010. [Google Scholar] [CrossRef]
- Einav, L.; Levin, J. The data revolution and economic analysis. Innov. Policy Econ. 2014, 14, 1–24. [Google Scholar] [CrossRef]
- Lima, I.D.; Rheuban, J.E. Topics and trends in NSF ocean sciences awards. Oceanography 2018, 31, 164–170. [Google Scholar] [CrossRef]
- Klami, M.; Honkela, T. Self-Organized Ordering of Terms and Documents in NSF Awards Data. In Proceedings of the 6th International Workshop on Self-Organizing Maps (WSOM 2007), Bielefeld, Germany, 3–6 September 2007. [Google Scholar] [CrossRef]
- Huang, C.; Notten, A.; Rasters, N. Nanoscience and technology publications and patents: A review of social science studies and search strategies. J. Technol. Transf. 2011, 36, 145–172. [Google Scholar] [CrossRef]
- Rasmussen, L. Increasing Politicization and Homogeneity in Scientific Funding: An Analysis of NSF Grants, 1990–2020. Center for the Study of Partisanship and Ideology (CSPI). Report No. 4. 2021. Available online: https://www.cspicenter.com/p/increasing-politicization-and-homogeneity-in-scientific-funding-an-analysis-of-nsf-grants-1990-2020 (accessed on 5 November 2024).
- Sherwood, R.D.; Hanson, D.L. A review and analysis of the NSF portfolio in regard to research on science teacher education. Electron. J. Res. Sci. Math. Educ. 2008, 12, 1–19. Available online: https://ejrsme.icrsme.com/article/view/7764 (accessed on 1 October 2024).
- González, C. Undergraduate Research, Graduate Mentoring, and the University’s Mission. Science 2001, 293, 1624–1626. [Google Scholar] [CrossRef]
- Link, A.N.; Scott, J.T.U.S. Science Parks: The Diffusion of an Innovation and Its Effects on the Academic Missions of Universities. Int. J. Ind. Organ. 2003, 21, 1323–1356. [Google Scholar] [CrossRef]
- Smilor, R.W.; O’Donnell, N.P.; Stein, G.M.; Welborn, R.S. The Research University and the Development of High-Technology Centers in the United States. Econ. Dev. Q. 2007, 21, 203–222. [Google Scholar] [CrossRef]
- Zhu, T.; Zhang, X.; Liu, X. Can University Scientific Research Activities Promote High-Quality Economic Development? Empirical Evidence from Provincial Panel Data. Rev. Econ. Assess. 2022, 1, 34–50. [Google Scholar] [CrossRef]
- Klenke, K. Qualitative Research in the Study of Leadership; Emerald Group Publishing Limited: Bradford, UK, 2016. [Google Scholar]
- Volkova, N.P.; Rizun, N.O.; Nehrey, M.V. Data science: Opportunities to transform education. CTE Workshop Proc. 2019, 6, 48–73. [Google Scholar] [CrossRef]
- The Carnegie Classification of Institutions of Higher Education. October 2023. Available online: https://carnegieclassifications.acenet.edu/ (accessed on 1 October 2024).
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2022; Available online: https://www.R-project.org/ (accessed on 1 October 2024).
- Singh, V.K.; Banshal, S.K.; Singhal, K.; Uddin, A. Scientometric mapping of research on ‘Big Data’. Scientometrics 2015, 105, 727–741. [Google Scholar] [CrossRef]
- Park, H.W.; Leydesdorff, L. Decomposing social and semantic networks in emerging “big data” research. J. Informetr. 2013, 7, 756–765. [Google Scholar] [CrossRef]
- Wamba, S.F.; Akter, S.; Edwards, A.; Chopin, G.; Gnanzou, D. How ‘big data’ can make big impact: Findings from a systematic review and a longitudinal case study. Int. J. Prod. Econ. 2015, 165, 234–246. [Google Scholar] [CrossRef]
- Alattar, F.; Shaalan, K. Emerging Research Topic Detection Using Filtered-LDA. AI 2021, 2, 578–599. [Google Scholar] [CrossRef]
- Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
- Silge, J.; Robinson, D. Text Mining with R: A Tidy Approach; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2017. [Google Scholar]
- Ahadi, A.; Singh, A.; Bower, M.; Garrett, M. Text mining in education—A bibliometrics-based systematic review. Educ. Sci. 2022, 12, 210. [Google Scholar] [CrossRef]
- Buyya, R.; Yeo, C.S.; Venugopal, S.; Broberg, J.; Brandic, I. Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Gener. Comput. Syst. 2009, 25, 599–616. [Google Scholar] [CrossRef]
- US Department of Education. FACT SHEET: Biden-Harris Administration Highlights a Record of Championing Historically Black Colleges and Universities (HBCUs). 2023. Available online: https://www.ed.gov/news/press-releases/fact-sheet-biden-harris-administration-highlights-record-championing-historically-black-colleges-and-universities-hbcus (accessed on 1 October 2024).
- O’Driscoll, A.; Daugelaite, J.; Sleator, R.D. ‘Big data’, Hadoop and cloud computing in genomics. J. Biomed. Inform. 2013, 46, 774–781. [Google Scholar] [CrossRef] [PubMed]
- Rodríguez-Mazahua, L.; Rodríguez-Enríquez, C.; Sánchez-Cervantes, J.L.; Cervantes, J.; García-Alcaraz, J.L.; Alor-Hernández, G. A general perspective of Big Data: Applications, tools, challenges, and trends. J. Supercomput. 2016, 72, 3073–3113. [Google Scholar] [CrossRef]
- López Belmonte, J.; Segura-Robles, A.; Moreno-Guerrero, A.; Parra-González, M.E. Machine Learning and Big Data in the Impact Literature. A Bibliometric Review with Scientific Mapping in Web of Science. Symmetry 2020, 12, 495. [Google Scholar] [CrossRef]
- Khanfar, A.A.; Kiani Mavi, R.; Iranmanesh, M.; Gengatharen, D. Determinants of artificial intelligence adoption: Research themes and future directions. Inf. Technol. Manag. 2024, 1–21. [Google Scholar] [CrossRef]
- De Mauro, A.; Greco, M.; Grimaldi, M. What is Big Data? A consensual definition and a review of key research topics. AIP Conf. Proc. 2015, 1644, 97–104. [Google Scholar] [CrossRef]
Directorate | n | Percent |
---|---|---|
ENG | 18,349 | 20.72 |
MPS | 18,140 | 20.49 |
CISE | 17,603 | 19.88 |
GEO | 12,201 | 13.78 |
SBE | 9,466 | 10.69 |
BIO | 8,480 | 9.58 |
EDU | 4,309 | 4.87 |
Total | 88,548 | 100 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
King, A.; Mostafa, S.A. Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses. Analytics 2025, 4, 1. https://doi.org/10.3390/analytics4010001
King A, Mostafa SA. Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses. Analytics. 2025; 4(1):1. https://doi.org/10.3390/analytics4010001
Chicago/Turabian StyleKing, Arielle, and Sayed A. Mostafa. 2025. "Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses" Analytics 4, no. 1: 1. https://doi.org/10.3390/analytics4010001
APA StyleKing, A., & Mostafa, S. A. (2025). Uncovering Patterns and Trends in Big Data-Driven Research Through Text Mining of NSF Award Synopses. Analytics, 4(1), 1. https://doi.org/10.3390/analytics4010001