Predictive Maintenance MCP: An Open-Source Framework for Bridging Large Language Models and Industrial Condition Monitoring via the Model Context Protocol
Featured Application
Abstract
1. Introduction
- Governed perimeter for PdM. The LLM is constrained to explicit MCP resources and tool invocations, limiting unsupported inferences and enabling controlled access to plant data.
- Deterministic tool chain. Established analyses are exposed as typed server-side tools, improving reproducibility compared to unconstrained text-only interaction.
- Persistent artifacts for the chain of evidence. Each tool execution produces inspectable reports and metadata that record inputs, parameters, and results, supporting auditability and post hoc review.
- MCP standardization as an integration mechanism. The PoC provides an open-source MCP reference implementation tailored to vibration-based condition monitoring, intended as a baseline for community-driven extensions toward industrial deployment.
2. Background
2.1. Model Context Protocol (MCP)
- Resources are endpoints for direct and structured access to data and context, including local files, databases, and technical documents;
- Tools are server-side functions with typed inputs and outputs that perform deterministic operations such as signal analysis, normative calculations and database queries;
2.2. Vibration and Envelope Analysis for Rolling Bearing Diagnostics
2.3. Vibration Severity Assessment
2.4. Anomaly Detection with Semi-Supervised Models
- One-Class SVM (OC-SVM) which learns a decision boundary enclosing the normal region in feature space with radial basis function kernel and controlling the expected fraction of outliers through parameter . At inference, segments outside the learned boundary are flagged as anomalous [49].
- guiding the user in selecting the baseline and parameters while avoiding arbitrary assumptions;
- interpreting numerical outputs with conservative language;
- linking novelty detection outcomes to subsequent physical verification such as FFT and envelope analysis and reporting, reducing the risk of conclusions not supported by evidence.
3. System Architecture
- FFT analysis and spectral component identification;
- Envelope analysis for rolling element bearing diagnostics;
- Vibration severity assessment according to ISO 20816-3;
- Anomaly detection based on machine learning models trained on reference baselines;
- Generation of reports and persistent artifacts for auditing and reuse.
4. Implemented Capabilities
4.1. Resources: Data and Context Access
- Vibration signals in common formats, with optional metadata;
- Technical documentation useful for extracting machine and component parameters to support diagnosis.
4.2. Tools
- Frequency Domain Analysis. Performs FFT with configurable options and automatically identifies dominant and harmonic components, producing peak and indicator tables useful for initial spectral characterization.
- Envelope Analysis for Rolling Bearing Defects. Implements band-pass filtering, envelope extraction using the Hilbert transform, and an envelope spectrum, allowing comparison between observed peaks and characteristic frequencies.
- Vibration Severity Assessment. Calculates the broadband normative indicator given by the RMS of the speed and determines the severity zone, making explicit the required assumptions such as the machine group and installation and support conditions [29].
- Anomaly Detection. Builds features through sliding windows, applies standardization and PCA, and trains classical models such as OC-SVM and LOF on healthy baselines. The output includes scores and anomaly labels by segment and aggregates that can be used as a screening module.
- Documentation Utility. Supports the extraction of identifiers and parameters from manuals and the calculation of characteristic defect frequencies from catalog data, which can be used as a reference for spectral interpretation (e.g., overlay on the envelope spectrum).
- Report Generation. For the main analyses, tools are available that generate interactive HTML reports (graphs, peak tables, markers) with execution metadata. The reports are designed as inspectable and reusable technical output, complementary to the textual summary produced by the assistant.
5. Use Cases and Workflows
5.1. Demonstration Data and Provenance
5.2. Use Case 1: FFT Analysis for Spectral Baseline
- “Generate FFT report for real_train/baseline_1.csv”.
5.3. Use Case 2: Envelope Analysis for BPFI and BPFO Identification
- “Generate envelope report for real_train/InnerRaceFault_vload_1.csv”
5.4. Use Case 3: Evaluation of Vibration Severity According to ISO 20816-3
- “Evaluate real_train/baseline_1.csv against ISO 20816-3”
5.5. Use Case 4: Complete Diagnosis via Guided Workflow
5.6. Use Case 5: Anomaly Detection on Features in the Time Domain
5.7. Use Case 6: Reading Manuals and Calculating Characteristic Frequencies
6. Discussion
- The demonstration package is limited and does not represent typical plant heterogeneity. Therefore, the workflows should be considered as reproducible examples and not as field-validated procedures.
- Analyses such as envelope analysis, peak picking, and anomaly detection depend on parameterized choices. In the PoC, these choices are explicit and traceable, but not yet standardized for machine families and failure modes.
- Extraction from manuals and catalogs, and the automatic derivation of characteristic frequencies, requires consolidation to avoid propagation of errors in spectral readings.
- The current architecture is oriented towards offline analysis and report generation. Extensions to streaming, real-time constraints, edge deployment, and integration with SCADA and CMMS require additional components.
- The use of LLMs in maintenance contexts requires caution because the outputs can influence operational decisions with impacts on people, assets, and production continuity. The system assumes that the LLM is solely for operator support. It coordinates deterministic tools and synthesizes computed results, while decision-making remains human [32]. In real-world deployments, additional checks and mitigations against prompt injection or untrusted input are necessary. Furthermore, ISO functions support computation and classification but are not a substitute for the correct selection of the machine class and measurement conditions [29].
7. Conclusions
Funding
Data Availability Statement
Conflicts of Interest
References
- Ali, M.I.; Lai, N.S.; Abdulla, R. Predictive maintenance of rotational machinery using deep learning. Int. J. Electr. Comput. Eng. (IJECE) 2024, 14, 1112. [Google Scholar] [CrossRef]
- Zhou, J.; Yang, J.; Qian, Q.; Qin, Y. A comprehensive survey of machine remaining useful life prediction approaches based on pattern recognition: Taxonomy and challenges. Meas. Sci. Technol. 2024, 35, 062001. [Google Scholar] [CrossRef]
- Randall, R.B. Vibration-Based Condition Monitoring: Industrial, Aerospace and Automotive Applications; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar] [CrossRef]
- Mikołajewska, E.; Mikołajewski, D.; Mikołajczyk, T.; Paczkowski, T. Generative AI in AI-Based Digital Twins for Fault Diagnosis for Predictive Maintenance in Industry 4.0/5.0. Appl. Sci. 2025, 15, 3166. [Google Scholar] [CrossRef]
- Ma, S.; Flanigan, K.A.; Bergés, M. State-of-the-Art Review: The Use of Digital Twins to Support Artificial Intelligence-Guided Predictive Maintenance. arXiv 2024. [Google Scholar] [CrossRef]
- Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of machine learning to machine fault diagnosis: A review and roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
- Surucu, O.; Gadsden, S.A.; Yawney, J. Condition Monitoring using Machine Learning: A Review of Theory, Applications, and Recent Advances. Expert Syst. Appl. 2023, 221, 119738. [Google Scholar] [CrossRef]
- Lei, Y.; Li, N.; Guo, L.; Li, N.; Yan, T.; Lin, J. Machinery health prognostics: A systematic review from data acquisition to RUL prediction. Mech. Syst. Signal Process. 2018, 104, 799–834. [Google Scholar] [CrossRef]
- Hendriks, J.; Dumond, P.; Knox, D. Towards better benchmarking using the CWRU bearing fault dataset. Mech. Syst. Signal Process. 2022, 169, 108732. [Google Scholar] [CrossRef]
- Farooq, U.; Ademola, M.; Shaalan, A. Comparative Analysis of Machine Learning Models for Predictive Maintenance of Ball Bearing Systems. Electronics 2024, 13, 438. [Google Scholar] [CrossRef]
- Giraudo, L.; Di Maggio, L.G.; Giorio, L.; Delprete, C. Dynamic Multibody Modeling of Spherical Roller Bearings with Localized Defects for Large-Scale Rotating Machinery. Sensors 2025, 25, 2419. [Google Scholar] [CrossRef]
- Alzubaidi, L.; Zhang, J.; Humaidi, A.J.; Al-Dujaili, A.; Duan, Y.; Al-Shamma, O.; Santamaría, J.; Fadhel, M.A.; Al-Amidie, M.; Farhan, L. Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. J. Big Data 2021, 8, 53. [Google Scholar] [CrossRef] [PubMed]
- Di Maggio, L.G. Intelligent Fault Diagnosis of Industrial Bearings Using Transfer Learning and CNNs Pre-Trained for Audio Classification. Sensors 2023, 23, 211. [Google Scholar] [CrossRef]
- Eang, C.; Lee, S. Predictive Maintenance and Fault Detection for Motor Drive Control Systems in Industrial Robots Using CNN-RNN-Based Observers. Sensors 2024, 25, 25. [Google Scholar] [CrossRef] [PubMed]
- Yang, X.; Fang, C.; Liao, Y.; Yang, J.; Gryllias, K.; Chronopoulos, D. Deep Generative Models in Condition and Structural Health Monitoring: Opportunities, Limitations and Future Outlook. arXiv 2025. [Google Scholar] [CrossRef]
- DI MAGGIO, L.G.; Brusa, E.; Delprete, C. Zero-Shot Generative AI for Rotating Machinery Fault Diagnosis: Synthesizing Highly Realistic Training Data via Cycle-Consistent Adversarial Networks. Appl. Sci. 2023, 13, 12458. [Google Scholar] [CrossRef]
- Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language Models are Few-Shot Learners. arXiv 2020, arXiv:2005.14165. [Google Scholar] [CrossRef]
- OpenAI. GPT-4 Technical Report. arXiv 2023. [Google Scholar] [CrossRef]
- Wu, J.; Gan, W.; Chen, Z.; Wan, S.; Yu, P.S. Multimodal Large Language Models: A Survey. arXiv 2023, arXiv:2311.13165. [Google Scholar] [CrossRef]
- Lynch, K.; Lorenzi, F.; Sheehan, J.; Kabakci-Zorlu, D.; Eck, B. FMEA Builder: Expert Guided Text Generation for Equipment Maintenance. arXiv 2024, arXiv:2411.05054. [Google Scholar] [CrossRef]
- Yigin, B.; Celik, M. A Prescriptive Model for Failure Analysis in Ship Machinery Monitoring Using Generative Adversarial Networks. J. Mar. Sci. Eng. 2024, 12, 493. [Google Scholar] [CrossRef]
- Reddicharla, N.; Ali, M.S. Innovating Oil and Gas Field Operations - Harnessing the Power of Generative Ai for Supporting Workforce Towards Achieving Autonomous Operations. In Proceedings of the ADIPEC, Abu Dhabi, United Arab Emirates, 4–7 November 2024; p. p. D011S020R005. [Google Scholar] [CrossRef]
- Lewis, P.; Perez, E.; Piktus, A.; Petroni, F.; Karpukhin, V.; Goyal, N.; Küttler, H.; Lewis, M.; Yih, W.t.; Rocktäschel, T.; et al. Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. In Proceedings of the Advances in Neural Information Processing Systems; Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M.F., Lin, H., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2020; Volume 33, pp. 9459–9474. [Google Scholar]
- Moran, G.E.; Aragam, B. Towards Interpretable Deep Generative Models via Causal Representation Learning. arXiv 2025. [Google Scholar] [CrossRef]
- Nuti, L.; Veneri, G.; Porciani, D.; Catalano, L.; Iasparro, D.; Lauriola, M.; Capanni, M.; Vitale, M. Agentic AI on Customer Dashboards in Remote Monitoring and Diagnostic Services. In Proceedings of the ADIPEC, Abu Dhabi, United Arab Emirates, 3–6 November 2025; p. D021S080R004. [Google Scholar] [CrossRef]
- Kolt, N. Governing AI Agents. arXiv 2025, arXiv:2501.07913. [Google Scholar] [CrossRef]
- Hou, X.; Zhao, Y.; Wang, S.; Wang, H. Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions. arXiv 2025. [Google Scholar] [CrossRef]
- Ray, P.P. A Survey on Model Context Protocol: Architecture, State-of-the-art, Challenges and Future Directions. TechRxiv 2025. [Google Scholar] [CrossRef]
- ISO 20816-3:2022(en); Mechanical vibration—Measurement and Evaluation of Machine Vibration Part 3: Industrial Machinery with a Power Rating Above 15 kW and Operating Speeds Between 120 r/min and 30 000 r/min. ISO: London, UK, 2022.
- Di Maggio, L.G.; Brusa, E.; Delprete, C. Novelty Detection in Rotating Machinery: Assessment of Unsupervised Machine Learning Models for Medium-Sized Industrial Bearings. In Proceedings of the 2025 International Conference on Control, Automation and Diagnosis (ICCAD), Barcelona, Spain, 1–3 July 2025; pp. 1–7. [Google Scholar] [CrossRef]
- Di Maggio, L.G. Predictive Maintenance MCP Server: An Open-Source Framework for Integrating Large Language Models with Predictive Maintenance and Fault Diagnosis Workflows; Zenodo: Geneva, Switzerland, 2025. [Google Scholar] [CrossRef]
- Di Maggio, L.G. Toward Autonomous LLM-Based AI Agents for Predictive Maintenance: State of the Art, Challenges, and Future Perspectives. Appl. Sci. 2025, 15, 11515. [Google Scholar] [CrossRef]
- Deng, H.; Namoano, B.; Zheng, B.; Khan, S.; Ahmet Erkoyuncu, J. From Prediction to Prescription: Large Language Model Agent for Context-Aware Maintenance Decision Support. PHM Soc. Eur. Conf. 2024, 8, 10. [Google Scholar] [CrossRef]
- Wang, Y.; Yu, Y.; Sun, K.; Lei, P.; Zhang, Y.; Zio, E.; Xia, A.; Li, Y. RmGPT: A Foundation Model With Generative Pre-trained Transformer for Fault Diagnosis and Prognosis in Rotating Machinery. IEEE Internet Things J. 2025, 12, 41562–41573. [Google Scholar] [CrossRef]
- Tao, L.; Liu, H.; Ning, G.; Cao, W.; Huang, B.; Lu, C. LLM-based Framework for Bearing Fault Diagnosis. arXiv 2024. [Google Scholar] [CrossRef]
- Qaid, H.A.A.M.; Zhang, B.; Li, D.; Ng, S.K.; Li, W. FD-LLM: Large Language Model for Fault Diagnosis of Machines. arXiv 2024. [Google Scholar] [CrossRef]
- He, H.; Huang, J.; Li, Q.; Wang, X.; Zhang, F.; Yang, K.; Meng, L.; Chu, F. MaintAGT:Sim2Real-Guided Multimodal Large Model for Intelligent Maintenance with Chain-of-Thought Reasoning. arXiv 2024. [Google Scholar] [CrossRef]
- Sun, Y.; Zhang, Q.; Bao, J.; Lu, Y.; Liu, S. Empowering digital twins with large language models for global temporal feature learning. J. Manuf. Syst. 2024, 74, 83–99. [Google Scholar] [CrossRef]
- Zhang, X.; Sun, W.; Chen, K.; Jiang, R. A multimodal expert system for the intelligent monitoring and maintenance of transformers enhanced by multimodal language large model fine-tuning and digital twins. IET Collab. Intell. Manuf. 2024, 6, e70007. [Google Scholar] [CrossRef]
- Ferdousi, R.; Hossain, M.A.; Yang, C.; Saddik, A.E. DefectTwin: When LLM Meets Digital Twin for Railway Defect Inspection. arXiv 2024. [Google Scholar] [CrossRef]
- Kok, I.; Demirci, O.; Ozdemir, S. When IoT Meet LLMs: Applications and Challenges. arXiv 2024. [Google Scholar] [CrossRef]
- Mahr, F.; Angeli, G.; Sindel, T.; Schmidt, K.; Franke, J. A Reference Architecture for Deploying Large Language Model Applications in Industrial Environments. In Proceedings of the 2024 IEEE 30th International Symposium for Design and Technology in Electronic Packaging (SIITME), Sibiu, Romania, 16–18 October 2024; pp. 19–23. [Google Scholar] [CrossRef]
- Friha, O.; Amine Ferrag, M.; Kantarci, B.; Cakmak, B.; Ozgun, A.; Ghoualmi-Zine, N. LLM-Based Edge Intelligence: A Comprehensive Survey on Architectures, Applications, Security and Trustworthiness. IEEE Open J. Commun. Soc. 2024, 5, 5799–5856. [Google Scholar] [CrossRef]
- Dashdamirli, N. Integration of Large Language Models with microcontrollers to optimize industrial automation processes. InterConf 2024, 46, 500–507. [Google Scholar] [CrossRef]
- Smith, W.A.; Randall, R.B. Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study. Mech. Syst. Signal Process. 2015, 64–65, 100–131. [Google Scholar] [CrossRef]
- Brusa, E.; Bruzzone, F.; Delprete, C.; Di Maggio, L.G.; Rosso, C. Health indicators construction for damage level assessment in bearing diagnostics: A proposal of an energetic approach based on envelope analysis. Appl. Sci. 2020, 10, 8131. [Google Scholar] [CrossRef]
- Abboud, D.; Antoni, J.; Sieg-Zieba, S.; Eltabach, M. Envelope analysis of rotating machine vibrations in variable speed conditions: A comprehensive treatment. Mech. Syst. Signal Process. 2017, 84, 200–226. [Google Scholar] [CrossRef]
- Brusa, E.; Delprete, C.; DI MAGGIO, L.G. Eigen-spectrograms: An interpretable feature space for bearing fault diagnosis based on artificial intelligence and image processing. Mech. Adv. Mater. Struct. 2022, 33, 4639–4651. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Breunig, M.M.; Kriegel, H.P.; Ng, R.T.; Sander, J. LOF: Identifying density-based local outliers. In Proceedings of the Proceedings of the 2000 ACM SIGMOD international conference on Management of data, Dallas TX, USA, 16–18 May 2000; pp. 93–104. [Google Scholar] [CrossRef]
- The Mathworks Inc. Rolling Element Bearing Fault Diagnosis Dataset. GitHub Repository. 2023. Available online: https://github.com/mathworks/RollingElementBearingFaultDiagnosis-Data (accessed on 28 January 2026).





| Topic | Key Points | Representative Sources |
|---|---|---|
| Enterprise LLM assistants for maintenance decision support | Pros: Natural language access to procedures, maintenance records, and operational context; reported productivity gains in an internal study; conversational access can streamline dashboard workflows. Cons: Risk of incorrect and untrustworthy answers on uncontextualized industrial data; strong dependence on data quality and quantity and secure enterprise integration. | [22,25] |
| Context-aware, tool-using prescriptive agents | Pros: Agentic LLM for context-aware maintenance decision support via tool usage; case study shows fault detection without extra training or fine-tuning with performance comparable to baselines. Cons: The prediction-to-prescription transition is still described as not yet mature. | [33] |
| Domain-specialized LLMs for PdM | Pros: Foundation models and adaptation pipelines for rotating machinery show few-shot, cross-condition and cross-dataset potential; approaches include feature textualization or numeric encoding; a multimodal maintenance LLM reports improved accuracy vs generalist LLMs in curated tests. Cons: Requires domain data and dedicated signal-to-text or encoding pipelines; evaluation is often performed on constructed datasets and conditions. | [34,35,36,37] |
| Digital Twins (DT) based on Generative AI for diagnosis and PdM | Pros: Generative AI supports DT-based simulation and synthetic data generation; DT and LLM integration can enhance temporal feature learning and multimodal reasoning in industrial scenarios; DT and LLM pipelines are proposed for inspection tasks with good reported performance in case studies. Cons: Challenges highlighted include computational complexity and data security; DT maturity gaps and missing components for fully automated PdM remain. | [4,5,38,39,40] |
| LLM-assisted documentation for maintenance (e.g., FMEA drafting) | Pros: Expert-guided generation of structured maintenance artifacts; empirical analysis reports that models can generate a substantial portion of key FMEA content. Cons: Requires expert-in-the-loop supervision for correctness and completeness. | [20] |
| Governance, safety, and deployment architectures | Pros: Governing frameworks frame delegation, oversight and liability risks for AI agents; surveys and reference architectures discuss industrial deployment along edge–fog–cloud and trust aspects; edge constraints and challenges are explicitly discussed. Cons: Works emphasize governance gaps and the need for new technical infrastructure; recurring constraints include resource limits, security, and operational deployment complexity. | [26,41,42,43,44] |
| Capability | Status | Output |
|---|---|---|
| FFT analysis with peak identification | Implemented | HTML report, peak tables |
| Envelope analysis for bearings | Implemented | Envelope spectrum report |
| ISO 20816-3 severity assessment | Implemented | Severity zone report |
| Feature-based anomaly detection (OC-SVM/LOF) | Implemented | Scores, report |
| Manual parsing and bearing catalog integration | Prototype | Demonstrative utilities |
| Streaming and edge deployment and SCADA and CMMS integration | Planned | Not in current PoC |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Di Maggio, L.G. Predictive Maintenance MCP: An Open-Source Framework for Bridging Large Language Models and Industrial Condition Monitoring via the Model Context Protocol. Appl. Sci. 2026, 16, 2812. https://doi.org/10.3390/app16062812
Di Maggio LG. Predictive Maintenance MCP: An Open-Source Framework for Bridging Large Language Models and Industrial Condition Monitoring via the Model Context Protocol. Applied Sciences. 2026; 16(6):2812. https://doi.org/10.3390/app16062812
Chicago/Turabian StyleDi Maggio, Luigi Gianpio. 2026. "Predictive Maintenance MCP: An Open-Source Framework for Bridging Large Language Models and Industrial Condition Monitoring via the Model Context Protocol" Applied Sciences 16, no. 6: 2812. https://doi.org/10.3390/app16062812
APA StyleDi Maggio, L. G. (2026). Predictive Maintenance MCP: An Open-Source Framework for Bridging Large Language Models and Industrial Condition Monitoring via the Model Context Protocol. Applied Sciences, 16(6), 2812. https://doi.org/10.3390/app16062812
