Skip to Content

3,555 Results Found

  • Article
  • Open Access
17 Citations
4,693 Views
21 Pages

A Probabilistic Transformation of Distance-Based Outliers

  • David Muhr,
  • Michael Affenzeller and
  • Josef Küng

The scores of distance-based outlier detection methods are difficult to interpret, and it is challenging to determine a suitable cut-off threshold between normal and outlier data points without additional context. We describe a generic transformation...

  • Article
  • Open Access
28 Citations
4,504 Views
16 Pages

How the Outliers Influence the Quality of Clustering?

  • Agnieszka Nowak-Brzezińska and
  • Igor Gaibei

30 June 2022

In this article, we evaluate the efficiency and performance of two clustering algorithms: AHC (Agglomerative Hierarchical Clustering) and KMeans. We are aware that there are various linkage options and distance measures that influence the clus...

  • Article
  • Open Access
15 Citations
5,096 Views
29 Pages

27 October 2022

Trajectory outlier detection is one of the fundamental data mining techniques used to analyze the trajectory data of the Global Positioning System. A comprehensive literature review of trajectory outlier detectors published between 2000 and 2022 led...

  • Article
  • Open Access
1 Citations
433 Views
20 Pages

4 December 2025

In data mining and exploration, outliers are specific and infrequent data that require special attention, as they may reveal potentially hazardous information. Detecting outliers can support, e.g., identification fraudulent credit card usage or unaut...

  • Article
  • Open Access
2 Citations
3,522 Views
20 Pages

19 January 2023

Outliers often occur during data collection, which could impact the result seriously and lead to a large inference error; therefore, it is important to detect outliers before data analysis. Gamma distribution is a popular distribution in statistics;...

  • Article
  • Open Access
7 Citations
5,575 Views
27 Pages

Qualitative Data Clustering to Detect Outliers

  • Agnieszka Nowak-Brzezińska and
  • Weronika Łazarz

7 July 2021

Detecting outliers is a widely studied problem in many disciplines, including statistics, data mining, and machine learning. All anomaly detection activities are aimed at identifying cases of unusual behavior compared to most observations. There are...

  • Article
  • Open Access
8 Citations
4,546 Views
27 Pages

Exploration of Outliers in If-Then Rule-Based Knowledge Bases

  • Agnieszka Nowak-Brzezińska and
  • Czesław Horyń

29 September 2020

The article presents both methods of clustering and outlier detection in complex data, such as rule-based knowledge bases. What distinguishes this work from others is, first, the application of clustering algorithms to rules in domain knowledge bases...

  • Article
  • Open Access
15 Citations
3,834 Views
21 Pages

k-Means+++: Outliers-Resistant Clustering

  • Adiel Statman,
  • Liat Rozenberg and
  • Dan Feldman

27 November 2020

The k-means problem is to compute a set of k centers (points) that minimizes the sum of squared distances to a given set of n points in a metric space. Arguably, the most common algorithm to solve it is k-means++ which is easy to implement and provid...

  • Article
  • Open Access
4 Citations
5,207 Views
32 Pages

Outliers in Semi-Parametric Estimation of Treatment Effects

  • Gustavo Canavire-Bacarreza,
  • Luis Castro Peñarrieta and
  • Darwin Ugarte Ontiveros

Outliers can be particularly hard to detect, creating bias and inconsistency in the semi-parametric estimates. In this paper, we use Monte Carlo simulations to demonstrate that semi-parametric methods, such as matching, are biased in the presence of...

  • Communication
  • Open Access
28 Citations
3,870 Views
7 Pages

We examine the presence of outliers and time-varying jumps in the returns of four major cryptocurrencies (Bitcoin, Ethereum, Ripple, Dogecoin, Litecoin), and a broad cryptocurrency index (CCI30). The results indicate that only Bitcoin returns are con...

  • Article
  • Open Access
3,287 Views
19 Pages

Filtering Link Outliers in Vehicle Trajectories by Spatial Reasoning

  • Junli Liu,
  • Miaomiao Pan,
  • Xianfeng Song,
  • Jing Wang,
  • Kemin Zhu,
  • Runkui Li,
  • Xiaoping Rui,
  • Weifeng Wang,
  • Jinghao Hu and
  • Venkatesh Raghavan

Vehicle trajectories derived from Global Navigation Satellite Systems (GNSS) are used in various traffic applications based on trajectory quality analysis for the development of successful traffic models. A trajectory consists of points and links tha...

  • Article
  • Open Access
7 Citations
1,758 Views
24 Pages

Empirical Evaluation of the Relative Range for Detecting Outliers

  • Dania Dallah,
  • Hana Sulieman,
  • Ayman Al Zaatreh and
  • Firuz Kamalov

7 July 2025

Outlier detection plays a key role in data analysis by improving data quality, uncovering data entry errors, and spotting unusual patterns, such as fraudulent activities. Choosing the right detection method is essential, as some approaches may be too...

  • Article
  • Open Access
20 Citations
5,118 Views
30 Pages

Cumulative Sum Chart Modeled under the Presence of Outliers

  • Nasir Abbas,
  • Mu’azu Ramat Abujiya,
  • Muhammad Riaz and
  • Tahir Mahmood

18 February 2020

Cumulative sum control charts that are based on the estimated control limits are extensively used in practice. Such control limits are often characterized by a Phase I estimation error. The presence of these errors can cause a change in the location...

  • Article
  • Open Access
11 Citations
4,961 Views
14 Pages

An MEF-Based Localization Algorithm against Outliers in Wireless Sensor Networks

  • Dandan Wang,
  • Jiangwen Wan,
  • Meimei Wang and
  • Qiang Zhang

7 July 2016

Precise localization has attracted considerable interest in Wireless Sensor Networks (WSNs) localization systems. Due to the internal or external disturbance, the existence of the outliers, including both the distance outliers and the anchor outliers...

  • Article
  • Open Access
253 Views
29 Pages

22 February 2026

Multicollinearity and outliers are common challenges in multiple linear regression, often adversely affecting the properties of least squares estimators. To address these issues, several robust estimators have been developed to handle multicollineari...

  • Article
  • Open Access
2 Citations
3,025 Views
12 Pages

Cross-Validation for Lower Rank Matrices Containing Outliers

  • Sergio Arciniegas-Alarcón,
  • Marisol García-Peña and
  • Wojtek J. Krzanowski

Several statistical techniques for analyzing data matrices use lower rank approximations to these matrices, for which, in general, the appropriate rank must first be estimated depending on the objective of the study. The estimation can be conducted b...

  • Article
  • Open Access
612 Views
20 Pages

20 June 2025

Both supervised and unsupervised machine learning algorithms are often based on regression to the mean. However, the mean can easily be biased by unevenly distributed data, i.e., outlier records. Batch normalization methods address this problem to so...

  • Article
  • Open Access
2 Citations
3,221 Views
19 Pages

Generating Alerts from Breathing Pattern Outliers

  • Chloé Benmussa,
  • Jessica R. Cauchard and
  • Zohar Yakhini

22 August 2022

Analysing human physiological data allows access to the health state and the state of mind of the subject individual. Whenever a person is sick, having a panic attack, happy or scared, physiological signals will be different. In terms of physiologica...

  • Communication
  • Open Access
4 Citations
2,120 Views
9 Pages

A Study of Outliers in GNSS Clock Products

  • Kamil Maciuk,
  • Inese Varna and
  • Karolina Krzykowska-Piotrowska

25 January 2024

Time is an extremely important element in the field of GNSS positioning. In precise positioning with a single-centimetre accuracy, satellite clock corrections are used. In this article, the longest available data set of satellite clock corrections of...

  • Article
  • Open Access
3 Citations
1,844 Views
12 Pages

19 September 2022

The interest in large or extreme outliers in arrays of empirical information is caused by the wishes of users (with whom the author worked): specialists in medical and zoo geography, mining, the application of meteorology in fishing tasks, etc. The f...

  • Article
  • Open Access
2 Citations
3,960 Views
15 Pages

7 February 2019

This paper presents a method for outlier detection in structured music corpora. Given a music collection organised into groups of songs, the method discovers contrast patterns which are significantly infrequent in a group. Discovered patterns identif...

  • Article
  • Open Access
4 Citations
2,486 Views
17 Pages

21 December 2022

The airborne array position and orientation measurement system (array POS) is a key device for high-resolution multi-dimensional real-time imaging motion compensation of military reconnaissance mapping. Abnormal values will appear in array POS inerti...

  • Article
  • Open Access
6 Citations
4,919 Views
23 Pages

4 November 2021

High-resolution point cloud data acquired with a laser scanner from any platform contain random noise and outliers. Therefore, outlier detection in LiDAR data is often necessary prior to analysis. Applications in agriculture are particularly challeng...

  • Article
  • Open Access
430 Views
24 Pages

COVAS: Highlighting the Importance of Outliers in Classification Through Explainable AI

  • Sebastian Roth,
  • Adrien Cerrito,
  • Samuel Orth,
  • Ulrich Hartmann and
  • Daniel Friemert

Understanding the decision-making behavior of machine learning models is essential in domains where individual predictions matter, such as medical diagnosis or sports analytics. While explainable artificial intelligence (XAI) methods such as SHAP pro...

  • Article
  • Open Access
11 Citations
6,836 Views
16 Pages

Trend-Residual Dual Modeling for Detection of Outliers in Low-Cost GPS Trajectories

  • Xiaojian Chen,
  • Tingting Cui,
  • Jianhong Fu,
  • Jianwei Peng and
  • Jie Shan

1 December 2016

Low-cost GPS (receiver) has become a ubiquitous and integral part of our daily life. Despite noticeable advantages such as being cheap, small, light, and easy to use, its limited positioning accuracy devalues and hampers its wide applications for rel...

  • Article
  • Open Access
14 Citations
5,396 Views
10 Pages

A New Methodology Based on Imbalanced Classification for Predicting Outliers in Electricity Demand Time Series

  • Francisco Javier Duque-Pintor,
  • Manuel Jesús Fernández-Gómez,
  • Alicia Troncoso and
  • Francisco Martínez-Álvarez

14 September 2016

The occurrence of outliers in real-world phenomena is quite usual. If these anomalous data are not properly treated, unreliable models can be generated. Many approaches in the literature are focused on a posteriori detection of outliers. However, a n...

  • Article
  • Open Access
3 Citations
2,031 Views
13 Pages

Research on the Self-Repairing Model of Outliers in Energy Data Based on Regional Convergence

  • Nan Li,
  • Xunwen Zhao,
  • Hailin Mu,
  • Yimeng Li,
  • Jingru Pang,
  • Yuqing Jiang,
  • Xin Jin and
  • Zhenwei Pei

18 September 2020

The need for the statistical stability of data is increasing nowadays as the data resource has become a more and more important production factor. In this study, a set of general identification and correction models are established for data outlier m...

  • Article
  • Open Access
3 Citations
3,185 Views
15 Pages

3 October 2024

Detecting anomalies in engine and machinery data during ship operations is crucial for maintaining the safety and efficiency of the vessel. We conducted experiments using device data from the maritime industry, consisting of time series records from...

  • Article
  • Open Access
670 Views
29 Pages

17 December 2025

In the presence of multicollinearity, the ordinary least squares (OLS) estimators, aside from BLUE (best linear unbiased estimator), lose efficiency and fail to achieve minimum variance. In addition, these estimators are highly sensitive to outliers...

  • Article
  • Open Access
25 Citations
4,384 Views
21 Pages

A New Closed-Form Solution for Acoustic Emission Source Location in the Presence of Outliers

  • Zilong Zhou,
  • Yichao Rui,
  • Jing Zhou,
  • Longjun Dong,
  • Lianjun Chen,
  • Xin Cai and
  • Ruishan Cheng

8 June 2018

The accuracy of an acoustic emission (AE) source location is always corrupted by outliers due to the complexity of engineering practice. To this end, a preconditioned closed-form solution based on weight estimation (PCFWE) is proposed in this study....

  • Feature Paper
  • Article
  • Open Access
149 Citations
13,580 Views
15 Pages

The Internet of Things (IoT) has gained significant recognition to become a novel sensing paradigm to interact with the physical world in this Industry 4.0 era. The IoTs are being used in many diverse applications that are part of our life and is gro...

  • Article
  • Open Access
23 Citations
2,654 Views
21 Pages

Observability-Constrained Resampling-Free Cubature Kalman Filter for GNSS/INS with Measurement Outliers

  • Bingbo Cui,
  • Wu Chen,
  • Duojie Weng,
  • Xinhua Wei,
  • Zeyu Sun,
  • Yan Zhao and
  • Yufei Liu

18 September 2023

Integrating global navigation satellite systems (GNSSs) with inertial navigation systems (INSs) has been widely recognized as an ideal solution for autonomous vehicle navigation. However, GNSSs suffer from disturbances and signal blocking inevitably,...

  • Review
  • Open Access
34 Citations
5,488 Views
30 Pages

15 August 2020

Due to the significant increase of the number of wind-based electricity generation systems, it is important to have accurate information on their operational characteristics, which are typically obtained by processing large amounts of measurements fr...

  • Article
  • Open Access
1 Citations
2,559 Views
12 Pages

7 May 2022

Lifelogs are generated in our daily lives and contain useful information for health monitoring. Nowadays, one can easily obtain various lifelogs from a wearable device such as a smartwatch. These lifelogs could include noise and outliers. In general,...

  • Article
  • Open Access
8 Citations
5,580 Views
26 Pages

k-Center Clustering with Outliers in Sliding Windows

  • Paolo Pellizzoni,
  • Andrea Pietracaprina and
  • Geppino Pucci

31 January 2022

Metric k-center clustering is a fundamental unsupervised learning primitive. Although widely used, this primitive is heavily affected by noise in the data, so a more sensible variant seeks for the best solution that disregards a given number z of poi...

  • Article
  • Open Access
3 Citations
4,002 Views
13 Pages

Decomposition of Dynamical Signals into Jumps, Oscillatory Patterns, and Possible Outliers

  • Elena Barton,
  • Basad Al-Sarray,
  • Stéphane Chrétien and
  • Kavya Jagan

16 July 2018

In this note, we present a component-wise algorithm combining several recent ideas from signal processing for simultaneous piecewise constants trend, seasonality, outliers, and noise decomposition of dynamical time series. Our approach is entirely ba...

  • Article
  • Open Access
2 Citations
2,768 Views
18 Pages

15 October 2019

Here we propose an online method to explore the multiway nature of urban spaces data for outlier detection based on higher-order singular value tensor decomposition. Our proposal has two sequential steps: (i) the offline modeling step, where we model...

  • Article
  • Open Access
1,447 Views
15 Pages

In modern fishery management, fishing vessel trajectory data are used to monitor and analyze fishing vessel activities. However, trajectory data are often of low quality, probably due to environmental factors, equipment failures, signal loss and oper...

  • Proceeding Paper
  • Open Access
1 Citations
1,442 Views
10 Pages

Most real time series exhibit certain characteristics that make the choice of model and its specification difficult. The objective of this study is to address the problem of parameter estimation and the accuracy of forecasts k-steps ahead in non-stat...

  • Article
  • Open Access
3 Citations
2,866 Views
26 Pages

20 October 2023

The moderate resolution imaging spectroradiometer (MODIS) calculates the leaf area index (LAI) for each pixel without incorporating the temporal correlation information, leading to a higher sensitivity for the LAI that produces uncertainties in obser...

  • Article
  • Open Access
18 Citations
5,828 Views
17 Pages

Outliers Detection Models in Shewhart Control Charts; an Application in Photolithography: A Semiconductor Manufacturing Industry

  • Ishaq Adeyanju Raji,
  • Muhammad Hisyam Lee,
  • Muhammad Riaz,
  • Mu’azu Ramat Abujiya and
  • Nasir Abbas

Shewhart control charts with estimated control limits are widely used in practice. However, the estimated control limits are often affected by phase-I estimation errors. These estimation errors arise due to variation in the practitioner’s choic...

  • Article
  • Open Access
8 Citations
4,146 Views
46 Pages

18 December 2023

This article investigates the applicability domain (AD) of machine learning (ML) models trained on high-dimensional data, for the prediction of the ideal gas enthalpy of formation and entropy of molecules via descriptors. The AD is crucial as it desc...

  • Article
  • Open Access
2 Citations
2,154 Views
9 Pages

Can Machine Learning Algorithms Contribute to the Initial Screening of Hip Prostheses and Early Identification of Outliers?

  • Khashayar Ghadirinejad,
  • Stephen Graves,
  • Richard de Steiger,
  • Nicole Pratt,
  • Lucian B. Solomon,
  • Mark Taylor and
  • Reza Hashemi

26 June 2024

Registries have significant roles in assessing the comparative performance of devices. Ideally, early identification of outliers should use a time-to-event outcome while reducing the confounding effects of other components in the device and patient c...

  • Article
  • Open Access
14 Citations
5,069 Views
28 Pages

Handling Multicollinearity and Outliers in Logistic Regression Using the Robust Kibria–Lukman Estimator

  • Adewale F. Lukman,
  • Suleiman Mohammed,
  • Olalekan Olaluwoye and
  • Rasha A. Farghali

30 December 2024

Logistic regression models encounter challenges with correlated predictors and influential outliers. This study integrates robust estimators, including the Bianco–Yohai estimator (BY) and conditionally unbiased bounded influence estimator (CE),...

  • Article
  • Open Access
1 Citations
714 Views
15 Pages

29 April 2025

This paper concerns a class of robust factorization models of low-rank matrix recovery, which have been widely applied in various fields such as machine learning and imaging sciences. An 1-loss robust factorized model incorporating the ...

  • Article
  • Open Access
24 Citations
6,311 Views
17 Pages

Time-Pattern Profiling from Smart Meter Data to Detect Outliers in Energy Consumption

  • William Hurst,
  • Casimiro A. Curbelo Montañez and
  • Nathan Shone

3 September 2020

Smart meters have become a core part of the Internet of Things, and its sensory network is increasing globally. For example, in the UK there are over 15 million smart meters operating across homes and businesses. One of the main advantages of the sma...

  • Article
  • Open Access
17 Citations
3,337 Views
19 Pages

Detection of Outliers in Time Series Power Data Based on Prediction Errors

  • Changzhi Li,
  • Dandan Liu,
  • Mao Wang,
  • Hanlin Wang and
  • Shuai Xu

4 January 2023

The primary focus of smart grid power analysis is on power load forecasting and data anomaly detection. Efficient and accurate power load prediction and data anomaly detection enable energy companies to develop reasonable production and scheduling pl...

  • Article
  • Open Access
18 Citations
4,567 Views
23 Pages

4 April 2018

In multi-target tracking, the outliers-corrupted process and measurement noises can reduce the performance of the probability hypothesis density (PHD) filter severely. To solve the problem, this paper proposed a novel PHD filter, called Student’s t m...

  • Article
  • Open Access
4 Citations
3,413 Views
18 Pages

6 April 2021

In this paper, we propose a general method to detect outliers from contaminated estimates of various image estimation applications. The method does not require any prior knowledge about the purpose, theory or hardware of the application but simply re...

  • Article
  • Open Access
25 Citations
11,662 Views
14 Pages

In order to detect outliers in temperature time series data for improving data quality and decision-making quality related to design and operation, we proposed an algorithm based on sliding window prediction. Firstly, the time series are segmented ba...

of 72