You are currently viewing a new version of our website. To view the old version click .
Machines
  • This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
  • Article
  • Open Access

11 November 2025

Chain-Based Outlier Detection: Interpretable Theories and Methods for Complex Data Scenarios †

,
,
,
and
1
School of Artificial Intelligence, Beijing Normal University, Beijing 100875, China
2
Guangdong Provincial/Zhuhai Key Laboratory of IRADS, and Department of Computer Science, BNU-HKBU United International College, Zhuhai 519087, China
3
Hong Kong Baptist University, Hong Kong
4
School of Mechanical Engineering, Wuhan University of Science and Technology, Wuhan 430081, China
This article belongs to the Section Machines Testing and Maintenance

Abstract

Outlier detection is a critical task in the intelligent operation and maintenance (O&M) of transportation equipment, as it helps ensure the safety and reliability of systems like high-speed trains, aircraft, and intelligent vehicles. Nearest neighbor-based detectors generally offer good interpretability, but often struggle with complex data scenarios involving diverse data distributions and various types of outliers, including local, global, and cluster-based outliers. Moreover, these methods typically rely on predefined contamination, which is a critical parameter that directly determines detection accuracy and can significantly impact system reliability in O&M environments. In this paper, we propose a novel chain-based theory for outlier detection with the aim to provide an interpretable and transparent solution for fault detection. We introduce two methods based on this theory: Cascaded Chain Outlier Detection (CCOD) and Parallel Chain Outlier Detection (PCOD). Both methods identify outliers through sudden increases in chaining distances, with CCOD being more sensitive to local data distributions, while PCOD offers higher computational efficiency. Experimental results on synthetic and real-world datasets demonstrate the superior performance of our methods compared to existing state-of-the-art techniques, with average improvements of 11.3% for CCOD and 14.5% for PCOD.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.