- Article
Pointwise Sharp Moderate Deviations for a Kernel Density Estimator
- Siyu Liu,
- Xiequan Fan,
- Haijuan Hu and
- Paul Doukhan
Let
Let
The nature of the kernel density estimator (KDE) is to find the underlying probability density function (p.d.f) for a given dataset. The key to training the KDE is to determine the optimal bandwidth or Parzen window. All the data points share a fixed...
Kernel smoothers are often used in Lagrangian particle dispersion simulations to estimate the concentration distribution of tracer gasses, pollutants etc. Their main disadvantage is that they suffer from the curse of dimensionality, i.e., they conver...
This paper studies the asymptotic normality for the kernel deconvolution estimator when the noise distribution is logarithmic chi-square; both identical and independently distributed observations and strong mixing observations are considered. The dep...
The weathered bedrock aquifer in the Jurassic coalfield of northern Shaanxi Province is a direct water-bearing aquifer, and accurately predicting its water-bearing properties is essential for preventing and controlling water hazards in mining operati...
This paper develops a method to obtain multivariate kernel functions for density estimation problems in which the density function is defined on compact support. If domain-specific knowledge requires certain conditions to be satisfied at the boundary...
The management of individual weights in broiler farming is not only crucial for increasing farm income but also directly linked to the revenue growth of integrated broiler companies, necessitating prompt resolution. This paper proposes a model to est...
Bright beams are commonly represented by sampled data in the numerical algorithms used to simulate their properties. However, in these calculations and the analyses of their outputs, the beam’s density is sometimes required and must be calculat...
This paper studies probability density estimation on the Siegel space. The Siegel space is a generalization of the hyperbolic space. Its Riemannian metric provides an interesting structure to the Toeplitz block Toeplitz matrices that appear in the co...
Conventional Euler deconvolution is widely used for interpreting profile, grid, and ungridded potential field data. The Tensor Euler deconvolution applies additional constraints to the Euler solution using all gravity vectors and the full gravity gra...
There are three main problems for classical kernel density estimation in its application: boundary problem, over-smoothing problem of high (low)-density region and low-efficiency problem of large samples. A new improved model of multivariate adaptive...
We present an unsupervised method to detect anomalous time series among a collection of time series. To do so, we extend traditional Kernel Density Estimation for estimating probability distributions in Euclidean space to Hilbert spaces. The estimate...
Kernel density estimation (KDE) is widely adopted to show the overall crime distribution and at the same time obscure exact crime locations due to the confidentiality of crime data in many countries. However, the confidential level of crime locationa...
A comprehensive and accurate wind power forecast assists in reducing the operational risk of wind power generation, improves the safety and stability of the power system, and maintains the balance of wind power generation. Herein, a hybrid wind power...
Randomized response models aim to protect respondent privacy when sampling sensitive variables but consequently compromise estimator efficiency. We propose a new sampling method, titled joint scrambling, which preserves all true responses while prote...
At present, the total length of accident blackspot accounts for 0.25% of the total length of the road network, while the total number of accidents that occurred at accident black spots accounts for 25% of the total number of accidents on the road net...
This paper proposes a wind power probabilistic model (WPPM) using the reflection method and multi-kernel function kernel density estimation (KDE). With the increasing penetration of renewable energy sources (RESs) into power systems, several probabil...
The uncertainty of wind power brings many challenges to the operation and control of power systems, especially for the joint operation of multiple wind farms. Therefore, the study of the joint probability density function (JPDF) of multiple wind farm...
In this paper, a pixel-based background modeling method, which uses nonparametric kernel density estimation, is proposed. To reduce the burden of image storage, we modify the original KDE method by using the first frame to initialize it and update it...
This paper presents a development of a statistical model of typhoon genesis, tracks based on kernel density estimation and a generalized additive model (GAM). Modeling of typhoon activity is ultimately beneficial to the people living in coastal zones...
The navigational safety of ships on waterways plays a crucial role in ensuring the operational efficiency of ports. Ship anomalous behavior detection is an important method of water traffic surveillance that can effectively identify abnormal ship beh...
Fault monitoring is often employed for the secure functioning of industrial systems. To assess performance and enhance product quality, statistical process control (SPC) charts such as Shewhart, CUSUM, and EWMA statistics have historically been utili...
Bipolar disorder is a severe mood disorder and is one of the top 20 causes of disability in the world. Although there have been numerous studies based on machine learning models for the detection of bipolar disorder patients, these works have limitat...
With the development of civil aviation in China, airspace congestion has become more and more serious and has gradually spread from airport terminal areas to en route networks. Traditionally, most prediction methods that obtain traffic flow data are...
Based on quantile regression (QR) and kernel density estimation (KDE), a framework for probability density forecasting of short-term wind speed is proposed in this study. The empirical mode decomposition (EMD) technique is implemented to reduce the n...
Accurate broiler weight estimation in commercial farms is hindered by noisy scale data and multi-broiler occupancy. To address this challenge, we propose a KDE-based framework enhanced with systematic preprocessing, including coefficient of variation...
An urban, commercial central district is often regarded as the heart of a city. Therefore, quantitative research on commercial central districts plays an important role when studying the development and evaluation of urban spatial layouts. However, c...
In this paper, we present the kernel density estimation (KDE)-based parallelized ultrasound entropy imaging and apply it for hepatic steatosis characterization. A KDE technique was used to estimate the probability density function (PDF) of ultrasound...
The current methods of classifying plant disease images are mainly affected by the training phase and the characteristics of the target dataset. Collecting plant samples during different leaf life cycle infection stages is time-consuming. However, th...
Accurate assessment and prediction of water shortage risk are essential prerequisites for the rational allocation and risk management of water resources. However, previous water shortage risk assessment models based on copulas have strict requirement...
30 December 2010
Concentrations of four heavy metals (Cr, Cu, Ni, and Zn) were measured at 1,082 sampling sites in Changhua county of central Taiwan. A hazard zone is defined in the study as a place where the content of each heavy metal exceeds the corresponding cont...
Efficiency in the operation of distribution networks is one of the commonly recognised goals of the Smart Grid aspect. Novel approaches are needed to assess the level of energy loss and reliability in electricity distribution. Transmission of electri...
Numerous studies have utilized remote sensing techniques to analyze seismic data in active areas. Point density techniques, widely used in remote sensing, examine the spatial distribution of point clouds related to specific variables. Applying these...
The accurate modeling of the charging behaviors for electric vehicles (EVs) is the basis for the charging load modeling, the charging impact on the power grid, orderly charging strategy, and planning of charging facilities. Therefore, an accurate joi...
Kernel density estimation (KDE) is a commonly used method for spatial point pattern analysis, but it is computationally demanding when analyzing large datasets. GPU-based parallel computing has been adopted to address such computational challenges. T...
Few-shot image classification aims to recognize novel classes from only a handful of labeled examples, a challenge in domains where data collection is costly or impractical. Existing solutions often rely on meta learning, fine tuning, or data augment...
Reliable and accurate daily runoff predictions are critical to water resource management and planning. Probability density predictions of daily runoff can provide decision-makers with comprehensive information by quantifying the uncertainty of foreca...
Taxi mobility data plays an important role in understanding urban mobility in the context of urban traffic. Specifically, the taxi is an important part of urban transportation, and taxi trips reflect human behaviors and mobility patterns, allowing us...
Long-term monitoring of radon (222Rn) and thoron (220Rn) radioactive gases has been used in earthquake forecasting. Seismic activity before earthquakes raises the levels of these gases, causing abnormalities in the baseline values of radon and thoron...
A novel ensemble algorithm based on kernel density estimation (KDE) is proposed to forecast distributed generation (DG) from renewable energy sources (RES). The proposed method relies solely on publicly available historical input variables (e.g., met...
Incipient fault detection in a hydraulic system is a challenge in the condition monitoring community. Existing research mainly monitors abnormal working conditions in hydraulic systems by separately detecting the key working parameter, which often ca...
A maritime route is used by sea transportation vessels to access the trading ports, and route design standards for the safety of maritime traffic have been established in various countries and organizations. However, no quantitative safety verificati...
With the continued social and economic development of northern China, landscape fragmentation has placed increasing pressure on the ecological system of the Beijing-Tianjin-Hebei (BTH) region. To maintain the integrity of ecological processes under t...
In the cities of post-industrialized countries, renovation is the main part of building construction activity and has a major urban impact. Measuring this ongoing phenomenon and its distribution is of great usefulness for municipality urban planning...
This study addresses the problems of multi-source data redundancy, insufficient feature capture timing, and delayed risk warning in the prediction of gas concentration in fully mechanized coal-mining operations by constructing a three-pronged technic...
Optical music recognition (OMR) refers to converting musical scores into digitized information using electronics. In recent years, few types of OMR research have involved numbered musical notation (NMN). The existing NMN recognition algorithm is diff...
Current research on automated driving systems focuses on Level 4 automated driving (AD) in specific operational design Domains (ODD). Measurement data from customer fleet operation are commonly used to extract scenarios and ODD features (road infrast...
Convolutional neural networks (CNN) have achieved excellent results in the field of image recognition that classifies objects in images. A typical CNN consists of a deep architecture that uses a large number of weights and layers to achieve high perf...
This paper introduces a new real-time method based on a combination of kernel density estimators and pyramid histogram of oriented gradients for identifying a point of interest along the stem of seedlings suitable for stem–stake coupling, also...
Volunteer-contributed geographic data (VGI) is an important source of geospatial big data that support research and applications. A major concern on VGI data quality is that the underlying observation processes are inherently biased. Detecting observ...
of 24