Next Article in Journal
Assessment of Selected Characteristics of Enrichment Products for Regular and Irregular Aggregates Beneficiation in Pulsating Jig
Next Article in Special Issue
Coupling NCA Dimensionality Reduction with Machine Learning in Multispectral Rock Classification Problems
Previous Article in Journal
Contamination of the Soil–Groundwater–Crop System: Environmental Risk and Opportunities
Previous Article in Special Issue
Partial Least Squares Regression of Oil Sands Processing Variables within Discrete Event Simulation Digital Twin
Article

Effectiveness of Natural Language Processing Based Machine Learning in Analyzing Incident Narratives at a Mine

1
Department of Mining Engineering, University of Utah, Salt Lake City, UT 84112, USA
2
Teck Red Dog Operations, Anchorage, AK 99503, USA
*
Author to whom correspondence should be addressed.
Academic Editor: Yosoon Choi
Minerals 2021, 11(7), 776; https://doi.org/10.3390/min11070776
Received: 22 May 2021 / Revised: 13 July 2021 / Accepted: 14 July 2021 / Published: 17 July 2021
To achieve the goal of preventing serious injuries and fatalities, it is important for a mine site to analyze site specific mine safety data. The advances in natural language processing (NLP) create an opportunity to develop machine learning (ML) tools to automate analysis of mine health and safety management systems (HSMS) data without requiring experts at every mine site. As a demonstration, nine random forest (RF) models were developed to classify narratives from the Mine Safety and Health Administration (MSHA) database into nine accident types. MSHA accident categories are quite descriptive and are, thus, a proxy for high level understanding of the incidents. A single model developed to classify narratives into a single category was more effective than a single model that classified narratives into different categories. The developed models were then applied to narratives taken from a mine HSMS (non-MSHA), to classify them into MSHA accident categories. About two thirds of the non-MSHA narratives were automatically classified by the RF models. The automatically classified narratives were then evaluated manually. The evaluation showed an accuracy of 96% for automated classifications. The near perfect classification of non-MSHA narratives by MSHA based machine learning models demonstrates that NLP can be a powerful tool to analyze HSMS data. View Full-Text
Keywords: mine safety and health; accidents; narratives; machine learning; natural language processing; random forest classification mine safety and health; accidents; narratives; machine learning; natural language processing; random forest classification
MDPI and ACS Style

Ganguli, R.; Miller, P.; Pothina, R. Effectiveness of Natural Language Processing Based Machine Learning in Analyzing Incident Narratives at a Mine. Minerals 2021, 11, 776. https://doi.org/10.3390/min11070776

AMA Style

Ganguli R, Miller P, Pothina R. Effectiveness of Natural Language Processing Based Machine Learning in Analyzing Incident Narratives at a Mine. Minerals. 2021; 11(7):776. https://doi.org/10.3390/min11070776

Chicago/Turabian Style

Ganguli, Rajive, Preston Miller, and Rambabu Pothina. 2021. "Effectiveness of Natural Language Processing Based Machine Learning in Analyzing Incident Narratives at a Mine" Minerals 11, no. 7: 776. https://doi.org/10.3390/min11070776

Find Other Styles
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

1
Back to TopTop