GPT Applications for Construction Safety: A Use Case Analysis
Abstract
1. Introduction and Literature Review
Point of Departure
2. Research Method
2.1. Step 1: Safety Use Case Identification
2.1.1. Use Case #1
2.1.2. Use Case #2
2.1.3. Use Case #3
2.1.4. Use Case #4
2.2. Step 2: Safety Use Case Integration in GPT
2.3. Step 3: Safety Use Case Analysis
2.3.1. Interactivity
- GPT Assessment Procedure: to evaluate the interactive ability of GPT qualitatively, two researchers observed and assessed the responses of GPT based on following criteria.
- ○
- Comprehensive Language Processing: GPT’s ability to understand and interpret the nuances and variations in human language was evaluated, particularly its handling of complex queries and generation of coherent follow-up responses.
- ○
- Contextual Understanding: GPT’s capacity to maintain conversational continuity by remembering and integrating previous information and context into its subsequent responses was assessed.
- ○
- Human-like Responses: the degree to which GPT’s responses replicate human conversational patterns was examined, focusing on its ability to demonstrate empathy, personalization, and a natural flow in interactions.
- Assessed Use Cases: This assessment was conducted for use cases #1 (Personalized Safety Training and Education) and #4 (Real-time Safety Management Assistance), where user interaction is critical. Use case #2 (Automated Post-Accident Analysis) and #3 (Automated Generation of Safety Guidelines and Checklists) were excluded as no conversation takes place between the user and GPT in those use cases.
2.3.2. Evidence-Based Reasoning
- ○
- GPT Assessment Procedure: Evidence-based reasoning was assessed by comparing the generated information to (1) established regulations and standards, (2) relevant safety documents, and (3) a similar analysis conducted by a human expert, whenever applicable. The following criteria were used for assessment:
- ▪
- Using Factual Information: GPT’s ability to generate answers based on reliable, factual information was evaluated, with a focus on minimizing hallucination.
- ▪
- Detecting Incorrect Inputs: GPT’s capability to identify and flag user mistakes or incorrect input information was assessed.
- ▪
- Extracting Correct Information from User’s Given Documents: GPT’s effectiveness in extracting accurate and relevant information from documents and instructions provided by the user was examined.
- ○
- Assessed Use Cases: this assessment was conducted for all use cases as evidence-based reasoning is equally important for all application areas.
2.3.3. Critical Analysis and Extrapolation
- GPT Assessment Procedure: GPT’s performance was qualitatively evaluated by two researchers and also benchmarked against the analysis conducted by human experts (from NIOSH and OSHA) for the same task, providing a comparative perspective on their ability to critically understand and apply safety-related information. Existing safety regulations and documents were also used for comparison. The following criteria were used for assessment:
- ○
- Depth of Analysis: GPT’s ability to thoroughly analyze textual data, identifying key information and relevant details that are crucial for understanding safety contexts, was assessed. This analysis was compared to the depth achieved by human experts from NIOSH and OSHA wherever applicable.
- ○
- Connection and Contextualization: GPT’s ability to link disparate pieces of information to form a cohesive understanding of safety scenarios was evaluated. The assessment includes determining whether GPT can contextualize safety information in a manner comparable to insights from existing safety regulations and documents.
- ○
- Inferencing and Extrapolation: GPT’s ability to infer and extrapolate from analyzed data to predict potential safety risks or recommend preventive measures was examined.
- Assessed Use Cases: This assessment was conducted for use cases #2 (Automated Post-Accident Analysis), #3 (Automated Generation of Safety Guidelines and Checklists), and #4 (Real-time Safety Management Assistance) where critical analysis and extrapolation from given information is important for GPT to deliver appropriate results.
2.3.4. Knowledge and Comprehension
- GPT Assessment Procedure: GPT’s performance was qualitatively evaluated by comparing the generated information to (1) regulations and standards, (2) relevant safety documents, and (3) human responses whenever applicable. The following criteria were used for this assessment.
- ○
- Breadth of Knowledge: GPT’s ability to cover a wide range of topics was assessed by assessing its responses to diverse queries from different domains of safety management to evaluate its capacity to retrieve and accurately present information across a broad spectrum of subjects.
- ○
- Depth of Comprehension: GPT’s depth of understanding was evaluated, focusing on its ability to grasp complex concepts and contextual nuances in its responses.
- Assessed Use Cases: this assessment was conducted for all use cases.
3. Results of the Safety Use Case Analysis
3.1. Analysis Results for Use Case #1 (Personalized Safety Training and Education)
- Introduction: GPT introduced itself as a virtual trainer, establishing a welcoming and engaging tone similar to that of a human trainer. This initial interaction was designed to build rapport and ease the trainee into the learning environment, setting the stage for effective communication and learning.
- Training Content Delivery: GPT delivered educational content focused on fall-related hazards and safe practices for working at height. After presenting the material, GPT invited questions from the user to ensure clarity and understanding. This interaction allowed the trainee to engage actively with the content, moving the training forward based on the user’s responses or expressed understanding.
- Safety Scenario: GPT presented customized safety scenarios along with questions to evaluate the user’s understanding. These scenarios were crafted to simulate real-life situations, thereby testing the practical application of the learned safety measures and GPT’s ability to assess critical understanding.
- Provide Feedback: Based on the user’s answers to scenario questions, GPT provided personalized feedback, identifying areas of strength and those needing improvement. This step was crucial for evaluating GPT’s instructional effectiveness and enhancing the trainee’s learning experience. The process progressed with the user either asking further questions or confirming their understanding of the feedback.
- Adjust Scenario Complexity and Introduce New Scenarios: Depending on the user’s level of understanding, GPT adjusted the complexity of the safety scenarios. This iterative process of scenario presentation and feedback continued until the user consistently demonstrated satisfactory responses, ensuring a deep and practical understanding of the safety topics covered.
- Closure: The training session concluded with GPT summarizing the key points and confirming that the user had achieved a comprehensive understanding of the material, akin to the closure provided by a human trainer. This final interaction was designed to reinforce the training outcomes and provide the trainee with confidence in their new knowledge.
- Comprehensive Language Processing: GPT exhibited the capability to process and understand the user’s questions correctly. It effectively comprehended the nuances and variations in the user’s language, successfully interpreting complex queries. For instance, when presented with a slightly jumbled question, “What’s my next step if I notice the safety harness seems wrecked just before I’m supposed to start working up high?”, GPT accurately deciphered the underlying safety concern despite the informal phrasing and potential ambiguity around “seems wrecked”. It responded appropriately: “Cease use immediately and report the issue to ensure a replacement harness is provided.” Additionally, GPT handled a follow-up question accurately: “And if there’s no replacement available immediately?”, to which it replied, “Utilize alternative safety measures such as a scaffold with guardrails or delay the task until the proper equipment is available.” These responses were relevant, but more importantly for this evaluation criteria, contextually appropriate, demonstrating a high level of linguistic accuracy and detail.
- Contextual Understanding: GPT’s ability to maintain continuity and context throughout the conversation was notably demonstrated during a session. At the beginning of the training, the user asked, “What steps should I take if I notice loose scaffolding at a worksite?” GPT advised to immediately cordon off the area and report the issue to the site supervisor for urgent repairs. Much later in the session, during the safety scenario phase of the training, the user inquired, “During my inspection, should I re-check previously reported issues like the scaffolding?” Remembering the earlier conversation, GPT responded, “Yes, verify that the previously loose scaffolding has been securely fixed as reported. Do not proceed with work in that area until all necessary repairs are confirmed.” This response shows how GPT successfully linked the initial identification of a hazard to subsequent safety verification procedures.
- Human-like Responses: The session began with GPT introducing itself and engaging in some initial small talk to set a friendly tone, saying, “Nice to meet you, Ali! How are you doing today? Are you ready to dive into some safety training for working at heights?” This helped to ease the trainee into the learning environment. During the training, GPT exhibited mixed performance in its human-like interactions. For instance, when a trainee expressed concern about feeling unsafe while working at heights, saying, “I always get nervous on a ladder, even with all the gear” GPT responded empathetically, “It’s completely normal to feel nervous, and it’s good that you’re cautious. Remember to double-check your gear before ascending and keep your focus on the task. I’m here to help you through this.” GPT also offered positive reinforcement, such as “Good job” or “Well done, that’s exactly right,” which effectively supported the trainee’s learning progress.
- Using Factual Information: The material and information that were provided to the trainee as part of the training aligned with OSHA guidelines and standards and established training practices. For each different part of training, GPT referenced the correct OSHA standard number; for example, for hazard management, it correctly used “OSHA Standard: 29 CFR 1926.502—Fall protection systems criteria and practices,” which is aligned with the context of the training.
- Detecting Incorrect Inputs: GPT demonstrated the ability to detect incorrect answers and provide targeted feedback based on user responses. Additionally, it could recognize partially correct answers, identify areas of misunderstanding, and guide the trainee toward the correct information. For instance, in the first scenario, when asked, “You are working on a construction site, and you notice that a coworker is about to use a ladder that appears to be damaged. The ladder has a split side rail and a couple of bent rungs. What steps would you take to address this situation?”, the user responded partially correctly by stating, “I will choose a new ladder to use.” While this response was on the right track, it missed key steps such as stopping the coworker, reporting the damage, and tagging the ladder. GPT emphasized these missing elements to improve the user’s understanding. To reinforce the learning, GPT introduced a follow-up scenario addressing the gaps identified in the previous response. It asked, “You are assigned to work on a rooftop that is 20 feet above the ground. The rooftop has no guardrails installed. What fall protection measures should you implement to ensure your safety while working on the rooftop?” This allowed the user to apply the knowledge gained from the earlier feedback and demonstrate improved understanding.
- Extracting Correct Information from User’s Given Documents: in this scenario, the user did not provide any document to GPT, so this criterion was not assessed.
- Breadth of Knowledge: The training session addressed all the predefined objectives provided to GPT. Specifically, GPT covered various safety management areas, such as hazard control, by identifying and discussing hazards like falls from ladders, unprotected edges and openings, and scaffold stability. Additionally, GPT addressed hazard management by outlining preventive measures, including the use of guardrails, personal fall arrest systems (PFASs), and proper ladder usage. This coverage of a wide range of safety management topics demonstrates the breadth of knowledge imparted during the training.
- Depth of Comprehension: In some instances, the training lacked depth, focusing primarily on more obvious content. This limitation was evident in most of the training material provided by GPT. While the training covered all the predefined objectives, the deeper insights that a human trainer could offer were notably absent. Even when a user requested a more in-depth explanation by asking, “Can you dive a bit deeper into each subject?”, although GPT’s subsequent response was more detailed, it still lacked the necessary depth and specificity. For example, when addressing scaffolding guardrails and planking, GPT provided the instruction, “Ensure scaffolds have proper guardrails and fully planked walkways,” which omitted crucial details related to hazard recognition in this context.
3.2. Analysis Results for Use Case #2 (Automated Post-Accident Analysis)
- Input: in this step, user provided GPT with the accident report from NIOSH.
- Extracting Key Information from Accident Reports: This task evaluated GPT’s ability to understand and pull relevant details from complex documents. The accuracy and relevance of extracted information from NIOSH reports are crucial for understanding the context and specifics of each accident.
- Utilizing Relevant Standards and Guidelines: This task gauged GPT’s ability to effectively use its extensive knowledge base. By retrieving and applying relevant OSHA standards and guidelines, GPT showcases its capability to provide informed, contextually appropriate recommendations and references.
- Analyzing Causes and Proposing Preventive Measures: this task assessed GPT’s reasoning skills. By analyzing the causes of accidents and suggesting preventive strategies, GPT demonstrates its capacity to interpret data, understand underlying issues, and apply problem-solving techniques.
- Evidence-based Reasoning: GPT showcased acceptable evidence-based reasoning ability. This ability was measured by comparing the GPT responses to relevant OSHA regulations and analysis completed by the safety experts at NIOSH for the same accident report. Below are the details of assessment results.
- ○
- Using Factual Information: GPT demonstrated strong evidence-based reasoning ability by identifying specific hazards and linking them correctly to the corresponding OSHA standards. This could be seen for all of the hazards that GPT detected. Moreover, when GPT was tasked to provide the name of the injured person, it replied with “The victim’s name is not provided in the report”, which indicates that the responses created by GPT are not hallucinations and it is drawing its answers from given factual information. The responses indicated that GPT could be utilized as a valuable tool for automating the process of extracting necessary information from accident report databases and analyzing it to draw meaningful inferences. This saves time and resources, allowing professionals to focus on other critical tasks.
- ○
- Detecting Incorrect Inputs: this criterion did not apply to this use case as there was no user interaction.
- ○
- Extracting Correct Information from User’s Given Documents: Based on the responses provided by GPT, it is evident that the model is proficient in extracting relevant information from the given accident report. The responses to questions focused on information extraction (see questions 4 to 9 in Appendix A) demonstrated complete alignment with the details described in the accident report. For example, when GPT was asked to name the location of the accident, it replied, “the accident took place at a residential house where roof replacement work was being carried out”, or when the type and severity of injury were asked of GPT, it replied “the victim suffered severe head injuries, resulting in her remaining unconscious and ultimately leading to her death”, which completely aligned with the facts included in the provided accident report. These highlight the model’s capability to effectively understand and interpret the information contained within such reports.
- Critical Analysis and Extrapolation: This use case indicated that although there are areas where GPT had acceptable performance in critically analyzing the information, certain limitations need to be addressed to effectively use it as a tool for accident analysis. Below are detailed observations for this criterion.
- ○
- Depth of Analysis: The analysis of responses revealed differences between GPT’s observations and the human analysis conducted by NIOSH. While the human analysis did not identify any misuse of the forklift, GPT highlighted the potential hazard of using a homemade box to transport construction materials and equipment. This observation by GPT, which was overlooked by human experts, appears to be relevant to this accident. Another important observation was that GPT did not recognize the worker’s young age and potential lack of experience as a contributory factor to the accident. While NIOSH analysis cited this as a possible contributing element, it was inferred through extrapolation of facts rather than being directly evidenced in the report. This strict reliance on concrete evidence underscored GPT’s capability to reduce the impact of biases and subjective interpretations that might influence human analysis. Such a characteristic is invaluable, especially in situations where objective analysis is crucial. However, to fully account for the range of potential risks in construction safety, future improvements should consider incorporating controlled extrapolation into GPT, enabling it to identify indirect or nuanced factors while retaining its unbiased approach.
- ○
- Connection and Contextualization: This can be seen when GPT was tasked to provide reasons and analysis when determining hazards in the provided scenario. For example, when GPT recognizes inadequate training and supervision, the analysis implies the following: “The owners verbally reminded the male laborer and the victim to be careful while working, but there is no mention of formal training or supervision. OSHA standard number: 1926.21—Safety Training and Education. This standard mandates employer to provide proper training and supervision to employees to ensure their safety”. The analysis indicates that there was not adequate training based on the fact that the owner was not present when the accident happened and the fact that there was no indication of adequate training. Further, this hazard was connected to proper safety regulations and documents. This indicates that GPT is capable of establishing an understanding of the presented scenario and contextualizing safety information based on the existing regulations.
- ○
- Inferencing and Extrapolation: GPT provided a detailed analysis of each hazard, such as fall risks, inadequate training, unsafe equipment use, and ladder safety concerns. However, the depth of GPT’s analysis, while detailed, does not always capture the broader, systemic considerations of safety management. For example, the human response included broader, more generalized recommendations for preventive measures, such as developing a comprehensive safety and occupational health program. This reflects an understanding that preventing accidents often requires systemic changes and a holistic approach, rather than just addressing specific hazards. GPT, in contrast, provided more direct and specific solutions tied closely to the information presented in the report, which could have been seen by GPT’s recommendation to use fall hazard protection equipment or appropriate equipment.
- Knowledge and Comprehension: Evaluating the information referenced in GPT’s responses revealed that they were consistent with OSHA regulations and established standards of practice. This suggests that GPT was drawing information from accurate and factual sources, as indicated by the observations discussed below.
- ○
- Breadth of Knowledge: GPT demonstrated the ability to analyze the accident report and identify multiple issues contributing to the accident, as well as suggest relevant prevention strategies. Its knowledge was demonstrated through its ability to accurately reference and apply various sections of OSHA regulations, including fall protection, safety training and education, rigging equipment for material handling, and ladder safety, in the correct context of the details mentioned in the accident report. These references were consistent with OSHA standards. For example, GPT correctly identified the requirement for employers to provide fall protection systems and training for work at heights above 6 feet, aligning with OSHA requirements.
- ○
- Depth of Comprehension: GPT demonstrated a good depth of understanding in its responses, indicating that it comprehended the accident report thoroughly. For instance, it detected fall hazard as the primary cause of accident and noted, “The victim fell approximately 26 feet from the roof to the stone patio”. In its analysis, GPT stated, “The victim was not wearing any fall protection equipment while working on the roof, which exposed her to the risk of falling,” demonstrating its ability to consider the contextual nuances of the accident report. For preventive measures, GPT recommended, “Employers should provide and enforce the use of proper fall protection equipment, such as harnesses and guardrails, when working at heights. Adequate training on fall protection measures should also be provided,” which also suggests a deeper understanding of how an accident such as the one mentioned in the report could be prevented.
3.3. Analysis Results for Use Case #3 (Automated Generation of Safety Guidelines and Checklists)
- Input: in this step, the user provided GPT with the construction activity scenario.
- Creating Guidelines: This task involved GPT generating guidelines that are specific to the construction scenario. This task assessed the knowledge, comprehension, and critical analysis abilities of GPT as it needed to understand the scenario, determine the accidents that can take place, and generate guidelines to prevent them.
- Creating Checklists: In this task, GPT had to generate a checklist based on the construction scenario and the guidelines it generated. This task targeted GPT’s ability to reason as it had to utilize the guidelines to generate appropriate questions for the checklist.
- Identifying relevant Standards: In this step, GPT was tasked to provide relevant standards from OSHA for the items in the guidelines and checklist that it generated. The objective was to assess knowledge and comprehension.
- Evidence-based Reasoning: GPT’s response in generating safety guidelines and checklists for arc welding demonstrated robust evidence-based reasoning capabilities, particularly in its understanding and application of safety protocols and regulations.
- ○
- Using Factual Information: GPT aligned its recommendations with correct OSHA standards such as OSHA 1910.252 for welding, cutting, and brazing and 1926.501 for fall protection. It also used the information from the user input about work being at height and considered this fact in creating guidelines and checklists. For example, for fall protection, GPT mentions that “Implement appropriate fall protection measures, such as guardrails, safety harnesses, or safety nets.”
- ○
- Detecting Incorrect Inputs: this criterion did not apply to this use case as there was no interaction between the user and GPT.
- ○
- Extracting Correct Information from User’s Given Documents: in this scenario, the user did not provide any document to GPT, so this criterion was not assessed.
- Critical Analysis and Extrapolation: GPT’s critical analysis in this case was acceptable. However, certain issues need attention before GPT can be further used for this use case.
- ○
- Depth of Analysis: GPT listed safety measures that are needed for the specific task. For example, it mentioned proper PPE, fall protection, fire prevention, ventilation, and electrical safety. However, certain regulations, like the prohibition against immersing the electrode in water, were omitted. This omission is understandable as the generated output focused on general guidelines and checklists, which typically do not include such detailed instructions. These specifics are more appropriately conveyed in safety training sessions designed to address potential hazards in greater depth. Also, the guidelines appeared to only consider the static scenario, without accounting for potential changes in the work environment. A more critically analyzed response would consider dynamic factors, such as changing conditions or the presence of other workers in the vicinity, which could affect safety measures. Lastly, GPT’s response would benefit from a more comprehensive risk assessment, accounting for the interplay of multiple hazards specific to the task, such as electrical risks, working at heights, and exposure to welding fumes, requiring a deeper understanding of these combined threats.
- ○
- Connection and Contextualization: When comparing GPT-generated guidelines and checklists for arc welding with OSHA regulations, it became clear that GPT covered most of the relevant safety protocols. For example, in the safety checklist for arc welding, GPT correctly included specific PPE requirements, such as “Have all workers been provided with appropriate PPE, including welding helmets, gloves, and safety glasses?” Additionally, GPT captured the environmental context by including fall protection measures, considering the task’s execution at significant heights, even though no explicit fall hazards were mentioned in the user input. However, it did not necessarily prioritize guidelines based on the specific risks or likelihood, which is important to provide a more targeted set of guidelines. This issue could be resolved by tailoring the instructions given to GPT for the particular task.
- ○
- Inferencing and Extrapolation: The analysis of guidelines and checklists indicates that these were tailored to the specific hazards present in the scenario. For example, GPT highlighted relevant concerns such as, “Is the work area adequately ventilated to control welding fumes?” and “Have all workers been provided with appropriate PPE, including welding helmets, gloves, and safety glasses?” as well as “Have fall protection measures been implemented for workers working at heights?” These examples illustrate GPT’s ability to identify and address key hazards in the scenario, such as fall risks and fire-related dangers. Additionally, GPT developed checklists aimed at mitigating these workplace hazards, demonstrating its potential to enhance safety protocols through targeted hazard identification and prevention strategies.
- Knowledge and Comprehension: The response included accurate safety protocols for welding from OSHA, indicating a solid base of knowledge in construction safety. While GPT displayed a general comprehension of safety protocols, its depth of knowledge might be limited, particularly in dealing with the specific complexities of arc welding at height.
- ○
- Breadth of Knowledge: GPT was able to reference various sections of OSHA relevant to the given task, such as welding, cutting, and brazing, gas welding and cutting, and fall protection. This demonstrates its capacity to align generated guidelines with established safety standards. However, while GPT did acknowledge the importance of safety education and training within the guidelines and checklists it produced, it notably did not reference the specific OSHA standard number related to safety training and education among the citations it utilized. Neither did it refer to OSHA or any standard training for welding work or working at height, which a human safety manager would have. This indicates that retraining or fine-tuning GPT on specific safety documents may be necessary for future research.
- ○
- Depth of Comprehension: Unlike other use cases, GPT demonstrated a good depth of understanding in this use case. The detailed references to OSHA regulations in GPT’s output are strong indicators of its depth of comprehension, particularly in understanding the contextual nuances of the given scenario. For example, when addressing Hot Work Permits, GPT advised, “Obtain a hot work permit before commencing welding activities, as required by OSHA regulations,” and further emphasized, “Follow the permit’s specific requirements and ensure it is displayed at the work location.” This level of detail showcases GPT’s attention to critical safety procedures and its thorough understanding of the regulations, reinforcing its capability to generate precise and contextually appropriate guidelines.
3.4. Analysis Results for Use Case #4 (Real-Time Safety Management Assistance)
- Input: in this step, a safety-related question was asked from GPT by the user.
- Response: in this step, GPT provided a response to the user’s query based on the given instructions.
- Suggesting Follow-up Questions to the User: After answering the user’s question, GPT suggested follow-up questions to help the user better understand the concept. The user could agree or disagree with these questions if they wanted to engage further on this topic.
- Response to the Follow-up Question: in this step, GPT provided the user with an answer to the follow-up question.
- The process repeated for the next question.
- Interactivity: GPT demonstrated acceptable interactivity as it was able to comprehend the user’s questions and keep the discussion warm and friendly with the user.
- ○
- Comprehensive Language Processing: Like other use cases, GPT exhibited a robust capacity to process and comprehend user queries, effectively navigating the nuances and variations in language. It demonstrated an ability to interpret complex questions, even when presented in an informal or ambiguous manner. For instance, when confronted with the somewhat disorganized query, “Give me a checklist for equipment inspection before they start?”, GPT accurately discerned the underlying safety concern despite the non-standard phrasing. It provided a detailed and relevant response, highlighting critical components such as harness and lifeline inspection, ladder safety, and tool and material storage while offering thorough explanations for each element. This answer is acceptable, given that the previous question targeted a scenario involving workers working at height. This illustrates GPT’s proficiency in interpreting varied linguistic inputs and delivering responses that align with essential safety considerations.
- ○
- Contextual Understanding: This ability was seen when GPT provided follow-up questions, and the user agreed to engage more on the topic. For example, in one of the questions, “Our multistoried construction site is surrounded by pedestrian traffic. Can you please analyze the fall hazard risks here?”, GPT provided the necessary answer but followed up with asking the user, “Would you like some tips on effective barricading or signage strategies for such environments?” This indicates that GPT is able to decipher the context behind the given question and suggest relevant follow-up questions to enhance the user’s understanding.
- ○
- Human-like Responses: GPT initiated the interaction with an introduction and informal conversation, creating a more personable and approachable atmosphere. For example, it began by saying, “Hello! I’m here to assist with all your construction safety needs. How has your day been at the site so far?” Additionally, GPT inquired about the user’s expertise in tailoring its responses accordingly. After the user mentioned they were a student conducting research on safety, GPT replied, “That’s a fascinating area of study! Your research must be providing valuable insights into improving safety protocols on construction sites.” This illustrates GPT’s effort to establish a more conversational and human-like interaction. However, when the conversation shifted to technical topics (for example, when the user asked safety-related questions), GPT’s responses were largely focused on delivering accurate information, lacking emotional engagement, and offering strictly technical answers. While this limits the conversational aspect, it can be addressed by refining the instructional prompts or fine-tuning the model with real conversation transcripts between safety managers and users to improve its ability to balance technical accuracy with human-like interaction.
- Evidence-based Reasoning: The responses generated by GPT in this use case were predominantly evidence-based, consistently referencing existing safety standards and guidelines. However, its knowledge base, while extensive, still lacks coverage of certain critical safety literature. Expanding GPT’s database to include more comprehensive safety resources would be beneficial for future research, enhancing its ability to provide more complete and accurate guidance in safety-related contexts. The following are more detailed observations.
- ○
- Using Factual Information: Throughout this scenario, GPT demonstrated acceptable performance using factual information. For instance, when addressing fall hazards associated with skylights, GPT identified the risk regarding the described situation. Similarly, in discussing the impact of rainy weather on fall hazards, the responses adhered to standard OSHA guidelines, reflecting a commitment to providing information that is accurate and practically applicable. Yet, there were instances where the conversation could benefit from more diverse sources, such as recent case studies or emerging research in construction safety. This would enhance the usefulness of the information provided and ensure that the most current and comprehensive safety practices are being conveyed.
- ○
- Detecting Incorrect Inputs: this was not evaluated for this use case as there was no instance where the user provided any information to GPT other than the questions they asked.
- ○
- Extracting Correct Information from User’s Given Documents: in this scenario, the user did not provide any document to GPT, so this criterion was not assessed.
- Critical Analysis and Extrapolation: GPT showcased good critical analysis ability and proved to be effective in answering safety-related questions.
- ○
- Depth of Analysis: GPT demonstrated an ability to go beyond surface-level responses by addressing the complexities involved in construction safety, indicating a strong depth of analysis. For example, when asked, “Our multistoried construction site is surrounded by pedestrian traffic. Can you please analyze the fall hazard risks here?”, GPT’s recommendations went beyond fall hazards and included using high-visibility barriers and clear signage, showing an understanding of practical safety management in complex environments. This suggests that GPT effectively recognizes and incorporates critical details from the query to provide a relevant and actionable solution to the safety concern.
- ○
- Connection and Contextualization: In this use case, GPT’s ability to connect and contextualize safety information is user-dependent, as it requires detailed or follow-up questions from the user for GPT to provide more in-depth responses. However, even with brief, single-line queries, GPT’s performance remains adequate. For example, when asked, “I have a team of six crews who I would like to send to install the skylight. What other equipment do I need to provide for safety?”, GPT offered a comprehensive list, including hard hats, safety glasses, non-slip safety boots, gloves, visibility vests, and communication devices. It also explained the relevance of each item to the specific task. Interestingly, it went beyond regulatory requirements, tried to contextualize the scenario, and added scenario-specific advice. For instance, it suggested communication devices “For clear communication among the crew, especially in noisy environments or over distances,” which aligns well with the nature of the task, where workers may be spread out across different areas. This demonstrates GPT’s ability to contextualize safety information in a manner that aligns with regulatory standards and is practical for the scenario at hand. For this use case, GPT showed competence in recognizing key situational factors, making its recommendations relevant and applicable to real-world construction environments.
- ○
- Inferencing and Extrapolation: Multiple questions on hazard identification were presented to evaluate GPT’s ability to infer and extrapolate safety hazards. For example, when asked, “I would like to install a skylight on top of the building. What safety hazards should I be concerned about?”, GPT accurately identified key risks such as fall hazards, roof load capacity, and concerns about skylight strength. These responses indicate that GPT can infer potential risks based on the contextual information provided. Additionally, GPT’s follow-up question, “Do you need details on specific fall protection measures for this task?” demonstrates its ability to anticipate further necessary details, highlighting an understanding of the need for preventive measures. It went on to recommend specific solutions, such as guardrails, personal fall arrest systems (PFASs), and skylight covers, explaining their relevance to the task. However, GPT’s extrapolation remains limited, as its responses focus primarily on the immediate hazards presented without fully considering more complex or indirect risks that could arise in a dynamic work environment. While GPT performs adequately in predicting and addressing obvious risks, expanding its capability to anticipate less apparent hazards or longer-term safety concerns would improve its overall performance. This suggests that while GPT can infer risks and propose preventive measures, its extrapolation could benefit from further refinement to capture a broader range of potential safety issues.
- Knowledge and Comprehension: GPT was able to draw from correct safety documents for the solutions it provided, indicating that it uses the training knowledge effectively. The following are some observations.
- ○
- Breadth of Knowledge: GPT was tested with multiple questions across various aspects of construction safety management, including hazard identification, evaluation, and control. These questions covered a diverse range of scenarios, such as working at heights, safety requirements during bad weather, equipment safety, and managing pedestrian traffic. In each case, GPT demonstrated its breadth of knowledge by providing accurate responses based on existing safety regulations and documents as mentioned before. This indicates GPT’s ability to draw from a wide range of safety management criteria and apply relevant guidelines across different situations, reflecting its comprehensive understanding of construction safety protocols.
- ○
- Depth of Comprehension: GPT demonstrated a certain level of depth in its responses, as seen in several instances. For example, when asked, “My crews need to transit between different heights frequently, what additional fall risks should be evaluated in this scenario?”, GPT identified the risks and also provided explanations. For instance, it mentioned, “Transition Points: Increased risk at points where workers move from ladders to roofs or scaffolds. Ensure secure handholds and stable platforms at these transition areas.” This response reflects a sound understanding of the specific risks associated with transitioning between different heights and is consistent with existing safety regulations and documents. However, the depth of comprehension remains somewhat limited, as GPT’s responses are constrained to surface-level issues and do not fully explore more complex or indirect hazards. For instance, while GPT recognized transition points, it did not address other nuanced factors, such as how weather conditions or worker fatigue might compound the risk at these points. Expanding its analysis to consider such variables would demonstrate a more advanced level of comprehension.
4. Conclusions
Study Limitations
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Appendix A
Appendix A.1. Use Case #1
Appendix A.2. Use Case #2
- Identify the hazards and cite the OSHA standard number for each hazard (include the reasons for the identified hazard).
- Include a detailed analysis of the flaws that lead to the accidents and discuss them.
- What are the methods that could be used to prevent the determined hazards?
- What is the date of the accident?
- Where did the accident take place?
- Include the name of the witnesses.
- What is the name of the victim?
- What was the type of injury and its severity?
- Include the name of the company.
Appendix A.3. Use Case #3
Appendix A.4. Use Case #4
Appendix B
Appendix B.1. Use Case #1
- Unprotected Edges and Openings: ensure all edges and openings are properly guarded.
- Falls from Ladders: ensure ladders are in good condition and used correctly.
- Scaffolding Hazards: ensure scaffolds are properly constructed and maintained.
- Personal Fall Arrest Systems (PFASs): always wear a harness connected to a secure anchor point.
- Proper Ladder Use: inspect ladders before use and ensure they are on stable ground.
- Training: ensure all workers are trained on fall hazards and the proper use of equipment.
- Inspections: regularly inspect all fall protection equipment and systems.
- Recordkeeping: maintain records of training, equipment inspections, and incidents.
- Unprotected Edges and Openings:
- Roof Edges: be cautious around roof edges, particularly during roofing work or when performing maintenance.
- Floor Openings: openings in floors should be covered or guarded to prevent falls.
- Wall Openings: openings in walls should also be guarded to prevent accidental falls.
- Falls from Ladders:
- Incorrect Placement: Ladders should be placed on a stable, level surface. Avoid placing them on slippery or unstable ground.
- Damaged Ladders: Regularly inspect ladders for damage. Broken or missing rungs, split side rails, or any other damage can lead to falls.
- Incorrect Use: Ensure ladders are used properly. Always maintain three points of contact (two hands and one foot or two feet and one hand).
- Scaffolding Hazards:
- Construction: scaffolds must be properly constructed and capable of supporting the intended load.
- Stability: Scaffolds should be stable and level. Use base plates, mud sills, or other means to ensure stability.
- Guardrails and Planking: ensure scaffolds have proper guardrails and fully planked walkways.
- Guardrails and Safety Nets:
- Guardrails: Install guardrails around unprotected edges. They should be at least 42 inches high, with a mid-rail at 21 inches.
- Safety Nets: Use safety nets when guardrails are not feasible. They should be installed as close as possible under the working surface.
- Personal Fall Arrest Systems (PFAS):
- Harnesses: workers should wear full-body harnesses that distribute the force of a fall over the thighs, pelvis, waist, chest, and shoulders.
- Anchorage: secure the harness to an anchor point that can support at least 5000 pounds per worker attached.
- Lanyards and Lifelines: use lanyards and lifelines that limit the fall distance and reduce impact forces.
- Proper Ladder Use:
- Selection: Choose the right ladder for the job. Ensure it is long enough to extend 3 feet above the landing point.
- Inspection: inspect ladders before each use for defects like cracks, bends, or missing parts.
- Angle: position ladders at the correct angle (1 foot out for every 4 feet up) to ensure stability.
- Training:
- Fall Hazard Training: train workers to recognize fall hazards and the proper use of fall protection systems.
- Equipment Use: provide hands-on training on how to properly don, adjust, and use personal fall arrest systems.
- Inspections:
- Regular Inspections: regularly inspect fall protection equipment and systems for wear, damage, or other issues that could compromise safety.
- Document Inspections: keep detailed records of inspections and any corrective actions taken.
- Recordkeeping:
- Training Records: maintain records of all training sessions, including dates, topics covered, and attendance.
- Incident Records: document any fall-related incidents, including investigations, corrective actions, and outcomes.
Appendix B.2. Use Case #2
- Identified hazards and corresponding OSHA standards:
- a.
- Fall hazard: The victim fell approximately 26 feet from the roof to the stone patio. OSHA standard number: 1926.501—Fall Protection. This standard requires employers to provide fall protection systems and training when working at heights above 6 feet.
- b.
- Inadequate training and supervision: The owners verbally reminded the male laborer and the victim to be careful while working, but there is no mention of formal training or supervision. OSHA standard number: 1926.21—Safety Training and Education. This standard mandates the employer to provide proper training and supervision to employees to ensure their safety.
- c.
- Unsafe use of homemade construction box: The construction box used to transport materials was homemade and not designed to industry standards. OSHA standard number: 1926.251—Rigging Equipment for Material Handling. This standard requires employers to use proper and tested rigging equipment for material handling to prevent accidents.
- d.
- Improper use of a ladder: The victim used a ladder to access the roof, but there is no mention of securing the ladder or following ladder safety guidelines. OSHA standard number: 1926.1053—Ladders. This standard provides guidelines for safe ladder use, including securing the ladder and ensuring proper ladder setup.
- Analysis of factors leading to the accident:
- a.
- Lack of fall protection measures: the victim was not wearing any fall protection equipment while working on the roof, which exposed her to the risk of falling.
- b.
- Inadequate hazard communication: while the owners verbally reminded the workers to be careful, there is no indication that a comprehensive hazard assessment was conducted or that specific hazards were communicated to the workers.
- c.
- Unsafe work practices: the victim sat on a wooden plank while handing roofing bales to others, which compromised her stability and exposed her to the risk of falling.
- d.
- Use of an unsafe construction box: the homemade construction box was not designed to industry standards, potentially compromising its structural integrity and safety.
- Methods to prevent the hazards:
- a.
- Fall hazard prevention: Employers should provide and enforce the use of proper fall protection equipment, such as harnesses and guardrails, when working at heights. Adequate training on fall protection measures should also be provided.
- b.
- Comprehensive training and supervision: Employers should ensure that workers receive proper training and supervision on safe work practices, hazard recognition, and the use of equipment. Regular safety meetings and toolbox talks can reinforce safety protocols.
- c.
- Use of appropriate equipment: Employers should provide and use industry-standard equipment and tools, including rigging equipment, ladders, and other necessary safety devices. Homemade or improvised equipment should be avoided.
- d.
- Job hazard analysis: conducting a thorough job hazard analysis before starting work can help identify potential hazards and implement appropriate preventive measures.
- Date of the accident: the accident happened on 2 July 2007.
- Location of the accident: the accident took place at a residential house where roof replacement work was being carried out.
- Witnesses: the report does not mention the presence of any witnesses.
- Injured person’s name: the victim’s name is not provided in the report.
- Type and severity of injury: the victim suffered severe head injuries, resulting in her remaining unconscious and ultimately leading to her death.
- Company name: the report does not mention the name of the company involved in the accident.
Appendix B.3. Use Case #3
- (1)
- Personal Protective Equipment (PPE):
- (a)
- Ensure all workers wear appropriate PPE, including welding helmets with proper shade lenses, protective clothing (flame-resistant), gloves, safety glasses, and steel-toed boots.
- (b)
- Provide respiratory protection if required based on the presence of fumes or gases.
- (2)
- Ventilation:
- (a)
- Ensure the work area is adequately ventilated to prevent the accumulation of welding fumes.
- (b)
- Use local exhaust ventilation systems, such as fume extraction hoods or fans, to remove welding fumes at the source.
- (3)
- Fire Prevention:
- (a)
- Clear the work area of flammable materials and establish a fire-safe perimeter.
- (b)
- Have a fire extinguisher readily available and ensure workers are trained in its use.
- (4)
- Hot Work Permit:
- (a)
- Obtain a hot work permit before commencing welding activities, as required by OSHA regulations.
- (b)
- Follow the permit’s specific requirements and ensure it is displayed at the work location.
- (5)
- Electrical Safety:
- (a)
- Inspect welding equipment and cables for any signs of damage or fraying.
- (b)
- Use ground-fault circuit interrupters (GFCIs) to prevent electrical shocks.
- (6)
- Welding Equipment Safety:
- (a)
- Ensure welding equipment is properly maintained and in good working condition.
- (b)
- Follow manufacturer’s instructions for the setup, operation, and maintenance of welding machines.
- (7)
- Fall Protection:
- (a)
- Implement appropriate fall protection measures, such as guardrails, safety harnesses, or safety nets, if working at heights where fall hazards exist.
- (8)
- Hazard Communication:
- (a)
- Train workers on the potential hazards associated with arc welding, including the dangers of UV radiation, fumes, and gases.
- (b)
- Provide Material Safety Data Sheets (MSDSs) for welding materials used on-site.
- (9)
- Training and Supervision:
- (a)
- Ensure all workers involved in arc welding activities receive proper training on safe welding practices, equipment operation, and emergency procedures.
- (b)
- Assign a competent person to supervise the welding activities and enforce safety protocols.
- (1)
- OSHA Standard 1910.252: Welding, Cutting, and Brazing.
- (2)
- OSHA Standard 1926.350: Gas Welding and Cutting.
- (3)
- OSHA Standard 1926.353: Ventilation and Protection in Welding, Cutting, and Heating.
- (4)
- OSHA Standard 1926.451: General Requirements for Scaffolding.
- (5)
- OSHA Standard 1926.501: Fall Protection.
- (6)
- OSHA Standard 1910.146: Permit-Required Confined Spaces.
- (7)
- OSHA Standard 1910.120: Hazardous Waste Operations and Emergency Response.
- (1)
- Are all workers trained and familiar with safe welding practices and procedures?
- (2)
- Is the work area adequately ventilated to control welding fumes?
- (3)
- Have all workers been provided with appropriate PPE, including welding helmets, gloves, and safety glasses?
- (4)
- Are welding machines and equipment properly maintained and in good working condition?
- (5)
- Is a hot work permit obtained before starting welding activities?
- (6)
- Are flammable materials removed from the work area, and is a fire extinguisher readily available?
- (7)
- Have fall protection measures been implemented for workers working at heights?
- (8)
- Have workers been informed about the hazards associated with arc welding and the proper handling of welding materials?
- (9)
- Is there a competent person assigned to supervise the welding activities and enforce safety protocols?
- (10)
- Are proper communication channels established for reporting any safety concerns or incidents?
Appendix B.4. Use Case #4
- 1.
- Hey, I would like to install a Skylight on top of the building. What safety hazards do I need to be worried about?
- 2.
- I have a team of six crewmembers. I would like to send them to install the Skylight. What other equipment do I need to have with them including PFAS for safety?
- 3.
- Hey, it seems to be raining here soon and my crews have already started installing the skylight. What things do I need to be cautious about for their safety in these weather conditions?
- 4.
- My crews need to transit between different heights frequently. What additional fall risks should be evaluated in this scenario?
- 5.
- Our multistoried construction site is surrounded by pedestrian traffic. Can you please analyze the fall hazard risks here?
- 6.
- Our construction crew is about to engage in roofing work on the project. Please design a quick safety briefing for me.
- 7.
- Give me a checklist for equipment inspection before they start.
- 8.
- Hey, what new technologies could be integrated into fall hazard management to enhance safety on our site?
References
- Choudhry, R.M.; Fang, D. Why operatives engage in unsafe work behavior: Investigating factors on construction sites. Saf. Sci. 2008, 46, 566–584. [Google Scholar] [CrossRef]
- Bureau of Labor Statistics. National Census of Fatal Occupational Injuries in 2022. Available online: https://www.bls.gov/news.release/pdf/cfoi.pdf (accessed on 11 March 2024).
- Zhou, Z.; Goh, Y.M.; Li, Q. Overview and analysis of safety management studies in the construction industry. Saf. Sci. 2015, 72, 337–350. [Google Scholar] [CrossRef]
- Jeon, J.; Cai, H. Classification of construction hazard-related perceptions using: Wearable electroencephalogram and virtual reality. Autom. Constr. 2021, 132, 103975. [Google Scholar] [CrossRef]
- Haji, M.D.; Behnam, B.; Sebt, M.H.; Ardeshir, A.; Katooziani, A. BIM-based safety leading indicators measurement tool for construction sites. Int. J. Civ. Eng. 2023, 21, 265–282. [Google Scholar] [CrossRef]
- Zhang, S.; Sulankivi, K.; Kiviniemi, M.; Romo, I.; Eastman, C.M.; Teizer, J. BIM-based fall hazard identification and prevention in construction safety planning. Saf. Sci. 2015, 72, 31–45. [Google Scholar] [CrossRef]
- Jeelani, I.; Han, K.; Albert, A. Development of virtual reality and stereo-panoramic environments for construction safety training. Eng. Constr. Arch. Manag. 2020, 27, 1853–1876. [Google Scholar] [CrossRef]
- Le, Q.T.; Pedro, A.; Lim, C.R.; Park, H.T.; Park, C.S.; Kim, H.K. A framework for using mobile based virtual reality and augmented reality for experiential construction safety education. Int. J. Eng. Educ. 2015, 31, 713–725. [Google Scholar]
- Zhu, Z.; Jeelani, I.; Gheisari, M. Physical risk assessment of drone integration in construction using 4D simulation. Autom. Constr. 2023, 156, 105099. [Google Scholar] [CrossRef]
- Jeelani, I.; Asadi, K.; Ramshankar, H.; Han, K.; Albert, A. Real-time vision-based worker localization & hazard detection for construction. Autom. Constr. 2021, 121, 103448. [Google Scholar] [CrossRef]
- Baker, H.; Hallowell, M.R.; Tixier, A.J.-P. AI-based prediction of independent construction safety outcomes from universal attributes. Autom. Constr. 2020, 118, 103146. [Google Scholar] [CrossRef]
- Tixier, A.J.-P.; Hallowell, M.R.; Rajagopalan, B.; Bowman, D. Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports. Autom. Constr. 2016, 62, 45–56. [Google Scholar] [CrossRef]
- Charehzehi, A.; Ahankoob, A. Enhancement of safety performance at construction site. Int. J. Adv. Eng. Technol. 2012, 5, 303. [Google Scholar]
- Kang, Y.; Cai, Z.; Tan, C.-W.; Huang, Q.; Liu, H. Natural language processing (NLP) in management research: A literature review. J. Manag. Anal. 2020, 7, 139–172. [Google Scholar] [CrossRef]
- Zhou, M.; Duan, N.; Liu, S.; Shum, H.-Y. Progress in Neural NLP: Modeling, Learning, and Reasoning. Engineering 2020, 6, 275–290. [Google Scholar] [CrossRef]
- Wu, C.; Li, X.; Guo, Y.; Wang, J.; Ren, Z.; Wang, M.; Yang, Z. Natural language processing for smart construction: Current status and future directions. Autom. Constr. 2022, 134, 104059. [Google Scholar] [CrossRef]
- Moon, S.; Lee, G.; Chi, S. Automated system for construction specification review using natural language processing. Adv. Eng. Inform. 2022, 51, 101495. [Google Scholar] [CrossRef]
- Hassan, F.U.; Le, T. Automated requirements identification from construction contract documents using natural language processing. J. Leg. Aff. Disput. Resolut. Eng. Constr. 2020, 12, 04520009. [Google Scholar] [CrossRef]
- Zou, Y.; Kiviniemi, A.; Jones, S.W. Retrieving similar cases for construction project risk management using Natural Language Processing techniques. Autom. Constr. 2017, 80, 66–76. [Google Scholar]
- Guo, E.; Onstein, E.; La Rosa, A.D. A semantic approach for automated rule compliance checking in construction industry. IEEE Access 2021, 9, 129648–129660. [Google Scholar]
- Jallan, Y.; Brogan, E.; Ashuri, B.; Clevenger, C.M. Application of natural language processing and text mining to identify patterns in construction-defect litigation cases. J. Leg. Aff. Disput. Resolut. Eng. Constr. 2019, 11, 04519024. [Google Scholar] [CrossRef]
- Zhao, X.; Yeoh, K.-W.; Chua, D.K.H. Extracting construction knowledge from project schedules using natural language processing. In Proceedings of the 10th International Conference on Engineering, Project, and Production Management, Berlin, Germany, 2–4 September 2019; Springer: Berlin/Heidelberg, Germany, 2020; pp. 197–211. [Google Scholar]
- Shen, Q.; Wu, S.; Deng, Y.; Deng, H.; Cheng, J.C.P. BIM-Based Dynamic Construction Safety Rule Checking Using Ontology and Natural Language Processing. Buildings 2022, 12, 564. [Google Scholar] [CrossRef]
- Lombardi, M.; Mauro, F.; Fargnoli, M.; Napoleoni, Q.; Berardi, D.; Berardi, S. Occupational Risk Assessment in Landfills: Research Outcomes from Italy. Safety 2023, 9, 3. [Google Scholar] [CrossRef]
- European Regulation no 349/2011. Available online: https://www.legislation.gov.uk/eur/2011/349/introduction/2011-04-11 (accessed on 25 June 2025).
- Kim, T.; Chi, S. Accident Case Retrieval and Analyses: Using Natural Language Processing in the Construction Industry. J. Constr. Eng. Manag. 2019, 145, 04019004. [Google Scholar] [CrossRef]
- Xu, N.; Ma, L.; Wang, L.; Deng, Y.; Ni, G. Extracting domain knowledge elements of construction safety management: Rule-based approach using Chinese natural language processing. J. Manag. Eng. 2021, 37, 04021001. [Google Scholar] [CrossRef]
- Baker, H.; Hallowell, M.R.; Tixier, A.J.-P. Automatically learning construction injury precursors from text. Autom. Constr. 2020, 118, 103145. [Google Scholar] [CrossRef]
- Yeung, C.L.; Cheung, C.F.; Wang, W.M.; Tsui, E.; Lee, W.B. Managing knowledge in the construction industry through computational generation of semi-fiction narratives. J. Knowl. Manag. 2016, 20, 386–414. [Google Scholar] [CrossRef]
- Na, X.; Jianping, W.; Jie, L.; Guodong, N. Analysis on Relationships of Safety Risk Factors in Metro Construction. J. Eng. Sci. Technol. Rev. 2016, 9, 150–157. [Google Scholar] [CrossRef]
- OpenAI. June 16 Version. Available online: https://chat.openai.com/chat (accessed on 20 December 2023).
- Google AI. Bard. Available online: https://ai.google/ (accessed on 20 December 2023).
- Kasneci, E.; Sessler, K.; Küchemann, S.; Bannert, M.; Dementieva, D.; Fischer, F.; Gasser, U.; Groh, G.; Günnemann, S.; Hüllermeier, E.; et al. ChatGPT for good? On opportunities and challenges of large language models for education. Learn. Individ. Differ. 2023, 103, 102274. [Google Scholar] [CrossRef]
- Min, B.; Ross, H.; Sulem, E.; Veyseh, A.P.B.; Nguyen, T.H.; Sainz, O.; Agirre, E.; Heintz, I.; Roth, D. Recent Advances in Natural Language Processing via Large Pre-trained Language Models: A Survey. ACM Comput. Surv. 2023, 56, 30. [Google Scholar] [CrossRef]
- Arteaga, C.; Park, J. A large language model framework to uncover underreporting in traffic crashes. J. Saf. Res. 2025, 92, 1–13. [Google Scholar] [CrossRef]
- Tsai, M.-L.; Ong, C.W.; Chen, C.-L. Exploring the use of large language models (LLMs) in chemical engineering education: Building core course problem models with Chat-GPT. Educ. Chem. Eng. 2023, 44, 71–95. [Google Scholar] [CrossRef]
- Goldberg, D.M. Characterizing accident narratives with word embeddings: Improving accuracy, richness, and generalizability. J. Saf. Res. 2022, 80, 441–455. [Google Scholar] [CrossRef]
- Surameery, N.M.S.; Shakor, M.Y. Use chat gpt to solve programming bugs. Int. J. Inf. Technol. Comput. Eng. IJITC 2023, 3, 17–22. [Google Scholar]
- Bewersdorff, A.; Seßler, K.; Baur, A.; Kasneci, E.; Nerdel, C. Assessing student errors in experimentation using artificial intelligence and large language models: A comparative study with human raters. Comput. Educ. Artif. Intell. 2023, 5, 100177. [Google Scholar] [CrossRef]
- Rivas, P.; Zhao, L. Marketing with ChatGPT: Navigating the ethical terrain of GPT-based chatbot technology. AI 2023, 4, 375–384. [Google Scholar] [CrossRef]
- Uddin, S.M.J.; Albert, A.; Ovid, A.; Alsharef, A. Leveraging ChatGPT to Aid Construction Hazard Recognition and Support Safety Education and Training. Sustainability 2023, 15, 7121. [Google Scholar] [CrossRef]
- Prieto, S.A.; Mengiste, E.T.; de Soto, B.G. Investigating the Use of ChatGPT for the Scheduling of Construction Projects. Buildings 2023, 13, 857. [Google Scholar] [CrossRef]
- Achiam, J.; Adler, S.; Agarwal, S.; Ahmad, L.; Akkaya, I.; Aleman, F.L.; Almeida, D.; Altenschmidt, J.; Altman, S.; Anadkat, S.; et al. GPT-4 technical report. arXiv 2023, arXiv:2303.08774. [Google Scholar]
- Kikalishvili, S. Unlocking the potential of GPT-3 in education: Opportunities, limitations, and recommendations for effective integration. Interact. Learn. Environ. 2024, 32, 5587–5599. [Google Scholar]
- Lecler, A.; Duron, L.; Soyer, P. Revolutionizing radiology with GPT-based models: Current applications, future possibilities and limitations of ChatGPT. Diagn. Interv. Imaging 2023, 104, 269–274. [Google Scholar]
- Jagdishbhai, N.; Thakkar, K.Y. Exploring the capabilities and limitations of GPT and Chat GPT in natural language processing. J. Manag. Res. Anal. 2023, 10, 18–20. [Google Scholar] [CrossRef]
- Wen, Q.; Liang, J.; Sierra, C.; Luckin, R.; Tong, R.; Liu, Z.; Cui, P.; Tang, J. AI for education (AI4EDU): Advancing personalized education with LLM and adaptive learning. In Proceedings of the KDD ′24: The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Barcelona, Spain, 25–29 August 2024; pp. 6743–6744. [Google Scholar]
- Andersen, B.; Fagerhaug, T. Root Cause Analysis; Quality Press: Milwaukee, WI, USA, 2006. [Google Scholar]
- Georgeff, M.P.; Lansky, A.L. Procedural knowledge. Proc. IEEE 1986, 74, 1383–1398. [Google Scholar]
- Stanton, N.A.; Chambers, P.R.; Piggott, J. Situational awareness and safety. Saf. Sci. 2001, 39, 189–204. [Google Scholar]
- Hussain, R.; Sabir, A.; Lee, D.-Y.; Alam Zaidi, S.F.; Pedro, A.; Abbas, M.S.; Park, C. Conversational AI-based VR system to improve construction safety training of migrant workers. Autom. Constr. 2024, 160, 105315. [Google Scholar] [CrossRef]
- Wang, N.; Issa, R.R. Transfer Learning-Based Question Generation for Building a Construction Safety Chatbot. In Computing in Civil Engineering; ASCE: Reston, VA, USA, 2023; pp. 688–694. [Google Scholar]
- Tellis, W. Application of a case study methodology. Qual. Rep. 1997, 3, 1–19. [Google Scholar]
- Saka, A.; Taiwo, R.; Saka, N.; Salami, B.A.; Ajayi, S.; Akande, K.; Kazemi, H. GPT models in construction industry: Opportunities, limitations, and a use case validation. Dev. Built Environ. 2024, 17, 100300. [Google Scholar] [CrossRef]
- Raj, R.; Singh, A.; Kumar, V.; Verma, P. Analyzing the potential benefits and use cases of ChatGPT as a tool for improving the efficiency and effectiveness of business operations. BenchCouncil Trans. Benchmarks Stand. Eval. 2023, 3, 100140. [Google Scholar] [CrossRef]
- Ghasemi, A.; Dai, F. Can ChatGPT assist in cost analysis and bid pricing in construction estimating? A pilot study using a bridge rehabilitation project. Smart Constr. 2024, 1, 1–26. [Google Scholar] [CrossRef]
- Dorobăt, I.; Năstase, F. Personalized training in Romanian SME’s ERP implementation projects. Management 2010, 37, 32. [Google Scholar]
- Taylor, D.; Yeung, M.; Bashet, A.Z. Personalized and Adaptive Learning. In Innovative Learning Environments in STEM Higher Education: Opportunities, Challenges, and Looking Forward; Springer: Berlin/Heidelberg, Germany, 2021; pp. 17–34. [Google Scholar]
- Jeelani, I. Personalized Hazard Recognition Training for Construction Workers. Master’s Thesis, North Carolina State University, Raleigh, NC, USA, 2016; p. 30727254. Available online: https://ufl-flvc.primo.exlibrisgroup.com/openurl/01FALSC_UFL/01FALSC_UFL:UFL?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&genre=dissertations&sid=ProQ:ProQuest+Dissertations+%26+Theses+Global&atitle=&title=Personalized+Hazard+Recognition+Training+for+Construction+Workers&issn=&date=2016-01-01&volume=&issue=&spage=&au=Jeelani%2C+Idris&isbn=9798381021769&jtitle=&btitle=&rft_id=info:eric/&rft_id=info:doi/ (accessed on 18 December 2023).
- Gan, W.; Qi, Z.; Wu, J.; Lin, J.C.-W. Large Language Models in Education: Vision and Opportunities. In Proceedings of the 2023 IEEE International Conference on Big Data (BigData), Sorrento, Italy, 15–18 December 2023; pp. 4776–4785. [Google Scholar]
- Sharma, S.; Raj, M.; Ali, M.M.; Sharma, K.; Singh, A.; Singh, S. Design of Structure using an Adaptive Generative AI System for Customised Eduction for Individuals. In Proceedings of the 2024 IEEE International Conference on Computing, Power and Communication Technologies (IC2PCT), Greater Noida, India, 9–10 February 2024; Volume 5, pp. 276–280. [Google Scholar]
- Goh, Y.M.; Chua, D.K.H. Case-based reasoning for construction hazard identification: Case representation and retrieval. J. Constr. Eng. Manag. 2009, 135, 1181–1189. [Google Scholar] [CrossRef]
- Pan, X.; Zhong, B.; Wang, Y.; Shen, L. Identification of accident-injury type and bodypart factors from construction accident reports: A graph-based deep learning framework. Adv. Eng. Inform. 2022, 54, 101752. [Google Scholar] [CrossRef]
- Xu, H.; Liu, Y.; Shu, C.-M.; Bai, M.; Motalifu, M.; He, Z.; Wu, S.; Zhou, P.; Li, B. Cause analysis of hot work accidents based on text mining and deep learning. J. Loss Prev. Process. Ind. 2022, 76, 104747. [Google Scholar] [CrossRef]
- Zhang, F. A hybrid structured deep neural network with Word2Vec for construction accident causes classification. Int. J. Constr. Manag. 2022, 22, 1120–1140. [Google Scholar] [CrossRef]
- Liu, H.; Ning, R.; Teng, Z.; Liu, J.; Zhou, Q.; Zhang, Y. Evaluating the logical reasoning ability of chatgpt and gpt-4. arXiv 2023, arXiv:2304.03439. [Google Scholar]
- Occupational Safety and Health Administration. Job Hazard Analysis Guide. Available online: https://www.osha.gov/publications/publication-products?publication_title=3071 (accessed on 12 May 2024).
- Jeon, J.; Xu, X.; Zhang, Y.; Yang, L.; Cai, H. Extraction of construction quality requirements from textual specifications via natural language processing. Transp. Res. Rec. J. Transp. Res. Board 2021, 2675, 222–237. [Google Scholar] [CrossRef]
- Wang, X.; El-Gohary, N. Deep learning-based relation extraction and knowledge graph-based representation of construction safety requirements. Autom. Constr. 2023, 147, 104696. [Google Scholar] [CrossRef]
- Borjigin, S.G.; Austin, M.A.; Zontek-Carney, E.J. Semiautomated Development of Textual Requirements: Combined NLP and Multidomain Semantic Modeling Approach. J. Manag. Eng. 2022, 38, 04022053. [Google Scholar] [CrossRef]
- Park, Y.; Kang, S. Natural language generation using dependency tree decoding for spoken dialog systems. IEEE Access 2018, 7, 7250–7258. [Google Scholar] [CrossRef]
- Wang, N.; Issa, R.R.A.; Anumba, C.J. NNLP-based query-answering system for information extraction from building information models. J. Comput. Civ. Eng. 2022, 36, 04022004. [Google Scholar] [CrossRef]
- Lin, J.-R.; Hu, Z.-Z.; Zhang, J.-P.; Yu, F.-Q. A Natural-Language-Based Approach to Intelligent Data Retrieval and Representation for Cloud BIM. Comput. Civ. Infrastruct. Eng. 2016, 31, 18–33. [Google Scholar] [CrossRef]
- Tian, D.; Li, M.; Ren, Q.; Zhang, X.; Han, S.; Shen, Y. Intelligent question answering method for construction safety hazard knowledge based on deep semantic mining. Autom. Constr. 2023, 145, 104670. [Google Scholar] [CrossRef]
- Ma, H.; Wang, J.; Lin, H.; Zhang, Y. Hierarchical matching network for multi-turn response selection in retrieval-based chatbots. Soft Comput. 2021, 25, 9609–9624. [Google Scholar] [CrossRef]
- OSHA. Law and Regulations. Available online: https://www.osha.gov/laws-regs (accessed on 5 July 2023).
- OSHA. Outreach Training Program for Construction. Available online: https://www.osha.gov/training/outreach/construction (accessed on 20 August 2023).
- NIOSH. Training and Workforce Development. Available online: https://www.cdc.gov/niosh/training/default.html (accessed on 20 August 2023).
- NIOSH. Seventeen Year Old Female Laborer Falls From Residential Roof and Dies Nine Days Later—Connecticut. Available online: https://www.cdc.gov/niosh/face/In-house/full200710.html (accessed on 2 July 2023).
- NIOSH. Fatality Assessment and Control Evaluation (FACE) Program. Available online: https://wwwn.cdc.gov/NIOSH-FACE/Default.cshtml?state=ALL&Incident_Year=ALL&Category2=0021&Submit=Submit (accessed on 22 June 2023).
- Chen, C.; Shu, K. Combating misinformation in the age of LLMs: Opportunities and challenges. arXiv 2023, arXiv:2311.05656. [Google Scholar]
- Creswell, A.; Shanahan, M.; Higgins, I. Selection-inference: Exploiting large language models for interpretable logical reasoning. arXiv 2022, arXiv:2205.09712. [Google Scholar]
- Haleem, A.; Javaid, M.; Singh, R.P. An era of ChatGPT as a significant futuristic support tool: A study on features, abilities, and challenges. BenchCouncil Trans. Benchmarks Stand. Eval. 2022, 2, 100089. [Google Scholar] [CrossRef]
Use Cases | Scenario Context |
---|---|
Use Case #1: Personalized Safety Training and Education | Safety Context: a session that includes safety training with hazard recognition, hazard management, and safety compliance as learning objectives. GPT Role: creating and delivering the above safety training content. |
Use Case #2: Automated Post-Accident Analysis | Safety Context: a detailed real accident report obtained from NIOSH to be analyzed. GPT Role: analyzing the accident report and extracting relevant safety information to identify the causal factors and suggest preventive measures. |
Use Case #3: Automated Generation of Safety Guidelines and Checklists | Safety Context: safety guidelines and checklists for welding while working at height. GPT Role: creating safety guidelines and checklists, ensuring alignment with the OSHA safety standards and site-specific requirements. |
Use Case #4: Real-time Safety Management Assistance | Safety Context: queries from three key areas of safety management: (1) hazard recognition, (2) hazard evaluation, and (3) hazard control. GPT Role: providing practical answers to the questions by assuming the role of a safety assistant that provides support to the safety manager. |
Use Cases | Instructional GPT Prompts |
---|---|
Use Case #1: Personalized Safety Training and Education | Contextualizing Safety: The instructional prompt provided the topic of “working on height” with hazard recognition, hazard management, and safety compliance defined as the learning objectives of the training. GPT was instructed to incorporate specific training material from OSHA and NIOSH material to create the learning content. The training progression was also provided to GPT as follows: instruction on first learning objective → pause for user’s questions → assessment scenario for the learning objective → feedback on user performance → subsequent scenarios based on user performance → feedback → repeat for the next learning objective. Clarifying GPT’s Responsibility: GPT was tasked to introduce itself, create interactive and accurate safety training content, deliver training, answer user-asked questions, create safety scenarios and questions to evaluate user’s understanding, and create follow-up scenarios based on user’s performance until a satisfactory result is conceived. |
Use Case #2: Automated Post-Accident Analysis | Contextualizing Safety: GPT was tasked to analyze an accident report to achieve three specific objectives that include extracting key information from accident reports, identifying the root cause and causal factors of accident, and proposing preventive measures. Clarifying GPT’s Responsibility: GPT’s responsibility was using its knowledge base, reasoning capacity, and relevant standards and guidelines to answer 9 key questions framed around the objectives listed above. |
Use Case #3: Automated Generation of Safety Guidelines and Checklists | Contextualizing Safety: GPT was provided with a construction scenario that included specific task (arc welding), specific component of the building (roof framing of a steel structure building), and specific location (third floor roof). Clarifying GPT’s Responsibility: GPT was instructed to generate safety guidelines and checklists for the construction scenario using OSHA regulations and standards. Moreover, GPT was tasked to provide relevant OSHA standards to be considered for the work |
Use Case #4: Real-time Safety Management Assistance | Contextualizing Safety: GPT was tasked to provide answers to diverse safety-related questions that are typically expected from safety assistants from three areas: (1) hazard recognition, (2) hazard evaluation, and (3) hazard control. Clarifying GPT’s Responsibility: GPT was tasked to provide detailed answers with references from OSHA standards and reliable safety resources and come up with follow-up questions to enhance user’s understanding of the topic. |
Use Cases | User–GPT Interactions |
---|---|
Use Case #1: Personalized Safety Training and Education | GPT Role: GPT was tasked to assume the role of an “expert construction safety trainer” and train a construction worker. User Role: user assumed the role of a new construction worker who has limited understanding of the safety topic that is being taught. |
Use Case #2: Automated Post-Accident Analysis | GPT Role: GPT had the role of a construction accident investigator who analyzes the past accidents and develops guidelines and preventive measures for the future. User Role: not applicable. |
Use Case #3: Automated Generation of Safety Guidelines and Checklists | GPT Role: GPT assumed the role of a safety expert and provided safety guidelines and checklists for the given construction scenario. User Role: user provides a specific construction settings to GPT and requests guidelines and checklists for the task. |
Use Case #4: Real-time Safety Management Assistance | GPT Role: GPT assumed the role of a safety management assistant. User Role: user assumed the role of a construction manager and asked safety-related questions within three key areas of safety management. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Katooziani, A.; Jeelani, I.; Gheisari, M. GPT Applications for Construction Safety: A Use Case Analysis. Buildings 2025, 15, 2410. https://doi.org/10.3390/buildings15142410
Katooziani A, Jeelani I, Gheisari M. GPT Applications for Construction Safety: A Use Case Analysis. Buildings. 2025; 15(14):2410. https://doi.org/10.3390/buildings15142410
Chicago/Turabian StyleKatooziani, Ali, Idris Jeelani, and Masoud Gheisari. 2025. "GPT Applications for Construction Safety: A Use Case Analysis" Buildings 15, no. 14: 2410. https://doi.org/10.3390/buildings15142410
APA StyleKatooziani, A., Jeelani, I., & Gheisari, M. (2025). GPT Applications for Construction Safety: A Use Case Analysis. Buildings, 15(14), 2410. https://doi.org/10.3390/buildings15142410