Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution

Wang, Lanhao; Ding, Chen; Hu, Hongdong; Wang, Hongyan; Dai, Wei

doi:10.3390/min15050483

Open AccessArticle

Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution

by

Lanhao Wang

¹

,

Chen Ding

²,

Hongdong Hu

²,

Hongyan Wang

^3,* and

Wei Dai

^2,3

¹

State Key Laboratory of Coking Coal Resources Green Exploitation, China University of Mining and Technology, Xuzhou 221116, China

²

School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China

³

Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou 221116, China

^*

Author to whom correspondence should be addressed.

Minerals 2025, 15(5), 483; https://doi.org/10.3390/min15050483

Submission received: 1 April 2025 / Revised: 28 April 2025 / Accepted: 2 May 2025 / Published: 5 May 2025

(This article belongs to the Special Issue Recent Advances in Ore Comminution)

Download

Browse Figures

Versions Notes

Abstract

The performance of image processing is crucial for accurately sorting antimony ore, yet several challenges persist. Existing image segmentation methods struggle with X-ray ore images that contain high noise and interference. Additionally, traditional classification methods primarily utilize single physical properties, such as the R-value, leading to low accuracy. To address segmentation issues, this paper proposes an improved method based on concave detection. This involves obtaining a binary image of antimony ore through adaptive threshold segmentation, extracting the ore contour, and detecting concave points using advanced techniques. The influence of interfering concave points is minimized with the three-wire method, while noise points are reduced through morphological operations based on area calculations. This results in accurate segmentation of the adherent antimony ore. For classification, this paper introduces a training method that combines transfer learning with shallow partial initialization. Transfer learning is employed to mitigate the challenges of limited antimony ore datasets when using deep learning models. The pre-trained model is then partially re-initialized according to a tailored strategy. Finally, fine-tuning is performed on the antimony ore dataset to achieve optimal results. Experimental results show that the antimony ore segmentation method proposed in this paper achieves accurate segmentation (96.27% correct segmentation rate). The antimony ore classification model training method proposed in this paper can effectively release some redundant parameters of the pre-training model, and has better classification performance on the target dataset (86.76% accuracy). Both methods are superior to traditional methods.

Keywords:

sorting of antimony ore; ore image segmentation; concave matching; transfer learning; partial initialization

1. Introduction

In the production process of ores, sorting is an indispensable link, and the efficacy of ore separation ultimately dictates whether the raw ore can be fully exploited [1]. In recent years, the dual-energy X-ray sorting methods have received increasing attention [2,3]. Most research has focused on sorting based on the physical characteristics of the ore itself [4]. However, there is little research on the segmentation of adherent antimony ores, which is essential in the ore sorting process. On the other hand, traditional mechanism-based classification methods represented by the R-value only use the physical properties of the ore as a single feature, and are easily affected by thickness effects, beam hardening effects, and afterglow effects, resulting in low classification accuracy.

In the realm of image segmentation, segmentation techniques leveraging concave point detection prove highly effective when it comes to segmenting circular attached objects [5,6,7,8]. This method primarily utilizes the prior knowledge that concave points will inevitably appear at the adhesion points of round-like objects to segment the adherent images [9,10,11,12,13,14]. For instance, Yao et al. [15] employed a concavity detection algorithm for segmenting rice, whereas Song et al. [16] introduced a technique that integrates concavity detection with the watershed method for segmenting adherent cells. The concave point detection-based segmentation method is characterized by its speed, making it suitable for real-time tasks, and it can achieve good results on smooth-surfaced objects like cells. Although antimony ore is also a round-like object, its physical properties result in many small interfering concave points on its surface. Directly using concave point detection will detect many small concave points, which, if not processed, will lead to significant over-segmentation.

In the field of image classification, convolutional neural networks (U-Net, VGG, ResNet) are widely used as important tools for image classification. They can effectively extract and utilize the hidden features in the target dataset by automatically executing feature extraction and self-learning, thereby reducing the influence of interference factors [17,18]. However, deep learning methods have two limitations. First, the training of deep learning models for ore classification methods relies on a large amount of data, but due to the different types of ores and some external factors, it is difficult to obtain image data from antimony ore. Second, deep learning models suffer from problems such as long training times, numerous model parameters, and difficulty in optimization [19]. Transfer learning can compensate for the above issues of insufficient antimony ore image data and slow model convergence on the target dataset. Generally, pre-trained models obtained through network-based deep transfer learning [20,21,22] have good feature extraction capabilities, which can be effectively applied to downstream tasks. However, during the transfer process, differences in data distribution may arise between the source domain and the target domain. This implies that pre-trained models, which have been trained on the source dataset, may not fully capture all of the features of the target dataset, and in extreme cases, it may even lead to negative transfer.

Pruning is a commonly used deep-learning model compression method. According to the specific methods used during the pruning process, pruning can be divided into importance-based pruning, reconstruction-based pruning, loss-gradient-change-based pruning, and similarity-based pruning. This paper focuses on importance-based pruning and similarity-based pruning. LI H [23] performed pruning based on the importance of convolutional kernels in the convolutional layer, calculating the absolute value of each kernel’s weight as the evaluation criterion and pruning several convolutional kernels with smaller absolute weight values, then retraining to restore network accuracy, achieving a good compression ratio. Liu Z [24] evaluated the importance of feature map channels by adding a scaling factor in the batch normalization layer and removing channels with higher sparsity. LIN M B et al. [25] believed that the rank of a feature map describes the independence of each vector of the feature and the richness of the feature information. The larger the rank, the stronger the independence of each vector, and the richer the feature information. They pruned the parameters of the convolutional layer based on the rank of the feature map after the convolution. Importance-based pruning focuses on the importance of convolutional kernels and feature maps for the entire task. In contrast, similarity-based pruning focuses on the similarity of convolutional kernels and feature maps within the same layer. HE Y [26] proposed a pruning criterion based on the geometric median of filters, selecting and pruning the filters with the smallest Euclidean distance sum to other filters in the same layer, thus removing redundant filters. In the existing literature, researchers pay more attention to the pruning method of a single criterion. Although some results have been achieved, there are few methods that integrate multiple criteria, especially in dynamically adjusting pruning strategies.

To solve the problems mentioned above, the main contributions of this paper are as follows: To address the existing problems in antimony ore image segmentation, this paper proposes a segmentation method for adherent antimony ore based on concave point detection. This method includes three parts: an adaptive threshold segmentation and image binarization module, a contour extraction module using Suzuki’s algorithm [27], a concave point detection module based on vector angle, and a concave point matching module based on auxiliary lines of concave points [28]. In order to solve the problems of insufficient data and long model training in the current ore image classification method, a model training approach that integrates shallow initialization of the model’s early layers with transfer learning has been introduced.

2. Materials and Methods

2.1. Materials

In this study, antimony ore was used for experimental verification. Antimony ore samples were collected from industrial partners in Shenyang Mineral Processing Zone, Liaoning, China Province, which is famous for its polymetallic deposits. In order to ensure the effectiveness of the experiment, a standardized preparation scheme was implemented, including jaw crushing (particle size < 10 mm) and rotary splitting, which realized the mineralogical uniformity, which is very important for the consistency of X-ray imaging. According to the economic feasibility and considering the process integrity, the samples are divided into three different categories: (1) concentrate (>25% antimony content), which represents economically feasible ore; (2) medium-grade ore (0.5%–5% antimony), which is a secondary economic material that needs mineral processing; and (3) tailings (<0.05% antimony), which represent the treatment waste stream.

A schematic diagram of a typical X-ray-based antimony ore sorting device is shown in Figure 1. The sorting process is as follows: The original antimony ore is crushed into small stones of uniform size, and then transported by conveyor belt to the pseudo dual energy X-ray identification device. The computer uses high- and low-energy images provided by the X-rays to determine the grade of the antimony ore and transmits its coordinates to an air valve. The air valve adjusts the air pressure based on these coordinates to blow the antimony ore into different sorting bins. In this process, the image processing algorithm’s ability to accurately segment the position of the antimony ore from the X-ray images and correctly classify the ore is essential for ensuring effective sorting.

2.2. Design of X-Ray Antimony Ore Image Segmentation Method

Before achieving model classification, it is necessary to ensure that the adherent antimony ores in the collected image data are correctly segmented. This paper employs an X-ray-based antimony ore image segmentation method, which includes a contour extraction module, a concave point detection module, and a concave point matching module. The specific process is shown in Figure 2. The contour extraction module primarily involves image binarization, noise processing, and contour extraction. The concave point detection module mainly determines whether a contour point is a concave point by calculating the concavity of the contour points using the vector method. The concave point matching module is the main contribution of this section. It reduces the probability of over-segmentation by drawing auxiliary lines connecting concave points and determining the relative positions of these auxiliary lines and the antimony ore contours, thereby avoiding the interference of ore boundaries and the influence of concave points on matching.

In the contour extraction module, adaptive threshold segmentation is first used to binarize the image to segmentation-adherent images and to perform an initial segmentation of adherent images. To remove noise points in the antimony ore images, morphological operations such as dilation and erosion are used sequentially to eliminate noise points inside and outside the antimony ore. Finally, the method proposed by Suzuki et al. is used to identify the contour from the denoised binary image [27].

In the concave point detection module, the concavity of contour points is calculated using the vector method to the x-axis

θ (u_{x}, u_{y})

and

θ (v_{x}, v_{y})

is computed, and the difference between these results gives the angular difference

θ_{1}

. The angle derived from the function in Equation (1),

θ (x, y)

is measured from the positive direction of the x-axis, with the angle range

[0, 2 π]

from

0^{°}

to counterclockwise, ending at

360^{°}

.

θ (x, y) = \{\begin{array}{l} \arctan (\frac{y}{x}) & x > 0, y \geq 0 \\ \arctan (\frac{y}{x}) + π & x < 0 \\ \arctan (\frac{y}{x}) + 2 π & y < 0, x > 0 \\ \frac{π}{2} & x = 0, y > 0 \\ \frac{3 π}{2} & x = 0, y < 0 \end{array}

(1)

due to

[0, 2 π]

, the results need to be processed using Equation (2) to obtain the actual angle value

α

.

α = \{\begin{matrix} θ_{1} + 2 π & θ_{1} \in (- 2 π, - π) \\ θ_{1} & θ_{1} \in [- π, π] \\ θ_{1} - 2 π & θ_{1} \in (π, 2 π) \end{matrix}

(2)

We can intuitively determine the convexity and concavity of points on the contour, as well as the degree of concavity and convexity, from the value of

α

. It is stipulated that only when

α \in [0, θ_{t}]

is considered a concave point. After repeated experiments, we can achieve a good concave point detection effect for our specific antimony ore samples, so this paper chooses

θ_{t} = 3 π / 4

as the threshold, and the selection of the threshold will have different effects for different samples.

This part’s main contribution is the concave matching module. It mainly constructs auxiliary lines connecting concave points to determine the distance between the auxiliary lines and the antimony ore contour, avoiding the influence of boundary interference on concave point matching, thus reducing the probability of over splitting. The auxiliary lines of concave point connections are actually parallel line segments to the concave point connections, with the same length as the concave point connections. Therefore, we only need to calculate the concave point coordinates and the coordinates of the end points of the auxiliary lines

Δ_{x}

and

Δ_{y}

based on the slope of the concave point connections

d

and their distance

d

from the auxiliary lines to ultimately obtain the auxiliary lines. Given the end point coordinates of the concave point connections

(r_{0}, c_{0})

,

(r_{1}, c_{1})

, we first calculate their slope

k

using Equation (3).

k = \frac{c_{1} - c_{0}}{r_{1} - r_{0}}

(3)

The absolute value of the difference in distance between the coordinates of the concave point and the coordinates of the endpoints of the auxiliary line can be obtained from the slope and the distance.

Δ_{x} = \frac{k \cdot d}{\sqrt{1 + k^{2}}}

(4)

Δ_{y} = \frac{- Δ_{x}}{k}

(5)

Translate the endpoint coordinates of the concave point

(r_{0}, c_{0})

and

(r_{1}, c_{1})

by

Δ_{x}

and

Δ_{y}

in the direction of the horizontal and vertical axes to get the endpoint coordinates of the auxiliary lines

P_{0}

,

P_{1}

,

N_{0}

,

N_{1}

on both sides of the concave point.

P_{0} = (r_{0} + Δ_{x}, c_{0} + Δ_{y})

(6)

P_{1} = (r_{1} + Δ_{x}, c_{1} + Δ_{y})

(7)

N_{0} = (r_{0} - Δ_{x}, c_{0} - Δ_{y})

(8)

N_{1} = (r_{1} - Δ_{x}, c_{1} - Δ_{y})

(9)

Connect the endpoints of the auxiliary line

P_{0}

and

P_{1}

to obtain the auxiliary line segment

L_{P}

; similarly, connect

N_{0}

and

N_{1}

to obtain the auxiliary line segment

L_{N}

. By using the above method for each candidate segmentation line, the true segmentation line of adherent antimony ore can be found from the candidate segmentation lines. The detailed description of Algorithm 1 is shown below:

Algorithm 1: Calculate candidate segmentation lines.

Input: The set of concave points on the contour C, the contour.
Output: Collection of segmentation lines

L_{l i s t}

.

Step 1. For the set of concave points

C = {p_{1}, p_{2}, \dots, p_{n}}

get the permutations

{p_{1}, p_{2}}

,

{p_{1}, p_{3}}

,

{p_{i}, p_{j}}

, …,

{p_{n - 1}, p_{n}}

Step 2. Calculate the distance between elements

D = {d_{1, 2}

,

d_{1, 3}

,

d_{i, j}

,…,

d_{n - 1, n}}

,where

d_{i, j} = | p_{i} - p_{j} |

Step 3. Find the smallest value

d_{k, q}

in the set D
Step 4. Remove

p_{k, t}, p_{t, q}

in set D where

t = 1, 2, \dots, n

Step 5. Repeat steps 3–4, until there are fewer than 2 elements in C
Step 6. For all line

L

of the

C L_{l i s t}

do
Plot its auxiliary line at distance d

L_{s i b l i n g s}

;
If

L_{s i b l i n g s}

are all in contours then
Addition

L

to

L_{l i s t}

.
End if
End for

2.3. A Method for Ore Classification Incorporating Transfer Learning and Model Shallow Part Initialization

The schematic diagram of the proposed antimony ore classification model training method is shown in Figure 3. It mainly consists of the following steps: 1. Model pre-training. 2. Partial reinitialization of the pre-trained model. 3. Fine-tuning of the partially initialized model on the target dataset. First, the model needs to be pre-trained on a larger dataset. Next, following the method proposed in Ref. [29], the model’s sub-modules are divided into multiple dependency groups G according to their dependency relationships. These dependency groups are then ranked based on similarity and magnitude of importance evaluation criteria. The less important dependency groups undergo shallow partial reinitialization. In this process, a parameter is introduced to balance the importance and similarity indicators for the partial reinitialization parameter ratio.

2.3.1. Model Pre-Training

First, it is necessary to obtain a pre-trained model from a large dataset. Pre-trained models typically require a large amount of data to learn rich feature representations, and the size of the dataset directly affects the performance of the model. Additionally, the dataset should contain diverse samples so that the model can learn a wide range of features and patterns, which helps the model adapt better to the task of mineral classification. While meeting the requirements for dataset size and diversity, the higher the relevance of the dataset to the target task, the better the performance of the pre-trained model usually is.

2.3.2. Partial Reinitialization of the Pre-Trained Model

In this paper, we adopt the same notation as He et al. [26]. For a neural network with

L

layers, where

N_{i}

and

N_{i + 1}

represent the number of input and output channels of the

K \times K

th layer, respectively, the

K \times K

convolution kernel of the

1 \leq j \leq N_{i + 1}

th layer can be represented as

F_{i, j} \in ℝ^{N_{i} \times K \times K}

. Thus, for the

i

th convolutional layer, the set of convolution kernels in the network can be represented as

1 \leq j \leq N_{i + 1}

, and its weight values can be represented as

W^{(i)} \in ℝ^{N_{i + 1} \times N_{i} \times K \times K}

. For simplicity, this paper represents the weight values of the convolution kernel

F_{i, j}

of the

i

th layer as a one-dimensional vector

X \in ℝ^{N_{i + 1} \times M_{i}}

, which means that there are

M

convolution kernel vectors in the

N_{i + 1}

th convolutional layer, and each vector has a size of

M_{i} = N_{i} \times K \times K

.

To release more parameters in the shallow layers for learning features that better fit the target dataset, this paper needs to adopt a model pruning approach to find convolutional kernels that can be partially reinitialized. The goal of the algorithm is to iteratively select and partially reinitialize unimportant groups of convolutional kernels. This way, after partial reinitialization, the convolutional kernels that have not been reinitialized can still extract features from the source dataset as effectively as possible, minimizing the loss of accuracy in the partially reinitialized model. This paper uses amplitude and similarity measures, two widely used metrics, to assess the similarity of convolutional kernels.

In the field of model pruning, researchers have long discovered that parameters with smaller weights can be safely pruned without affecting the model’s performance, a principle that also applies to pruning convolutional kernels. Li et al. [23] demonstrated through extensive experiments that convolutional kernels with smaller norms have less impact on the model’s final classification results compared to kernels with larger norms, and thus can be removed. In this paper, we also use the norm of the convolutional kernels as a measure of their importance.

M_{i}

, as mentioned above, is the vector size of the single-layer convolution kernel. If the convolution kernel of the

X_{j}

layer is expressed as

x \in ℝ^{1 \times M_{i}}

, its p norm can be calculated by Equation (10).

| | x_{j} {| |}_{p} = \sqrt[p]{\sum_{m = 1}^{M_{i}} | x_{m} |^{p}}

(10)

While convolutional kernels with smaller amplitudes can be safely removed as redundant parameters, those with larger amplitudes may also be redundant. He et al. [26,27,28,30] proposed that similarity can also serve as an evaluation metric for the importance of convolutional kernels, in addition to amplitude. Since convolutional kernels with high similarity can extract similar image features, one of them can be safely removed without affecting the model’s performance. Therefore, this paper comprehensively uses both amplitude and similarity as evaluation metrics for the importance of convolutional kernels. Methods for calculating convolutional kernel similarity typically include Euclidean similarity and cosine similarity. This paper selects cosine similarity, as shown in Equation (11), as the evaluation metric.

D_{c o s} (x, y) = 1 - \frac{\sum_{m = 1}^{M_{i}} (x_{m} \times y_{m})}{\sum_{m = 1}^{M_{i}} x_{m}^{2} \times \sum_{m = 1}^{M_{i}} y_{m}^{2}}

(11)

D_{c o s} (x, y)

is the cosine distance measure,

x

and

y

are the two convolution kernel parameter vectors to be compared, and

M_{i}

is the same as mentioned above.

In this paper, an iterative approach is used to prune convolutional kernels. During each iteration, after measuring the importance of convolutional kernels using Equation (10) or Equation (11), the algorithm selectively reinitializes the least important convolutional kernels in a structured manner. This means that the algorithm not only reinitializes the least important convolutional kernels, but also reinitializes a portion of the parameters corresponding to the subsequent connected convolutional kernels, resulting in a final actual reinitialization rate

P_{r}

that is higher than the set value

P

.

To maximize the release of redundant parameters in the convolutional kernels using both amplitude and similarity as metrics for convolutional kernel similarity, this paper uses an initialization rate

P

to determine when to use amplitude or similarity indicators. The ultimate goal of partially reinitializing the convolutional kernels is to achieve a proportion of free parameters in the shallow layer

L_{s h a l l o w}

that is greater than or equal to the given initialization rate

P

through iterative partial reinitialization. The formula for the initialization rate is shown in Equation (12).

P' = 1 - \frac{f_{f l o p} (W')}{f_{f l o p} (W)}

(12)

f_{f l o p} (W')

represents the number of parameters of the trimmed model, and

f_{f l o p} (W)

represents the number of parameters of the pre-trained model.

In the initial stage, this paper uses an amplitude-based importance criterion for reinitialization. After a certain degree of reinitialization, it switches to a similarity-based criterion for partial initialization of the model. This method is chosen because after removing convolutional kernels with smaller amplitudes, those with larger amplitudes may be similar to each other. These similar convolutional kernels can be redundant, as they can be mutually replaceable in feature extraction and processing. Given a magnitude pruning ratio

w_{m a g}

, approximately

P \times w_{m a g}

proportion of parameters are pruned using the amplitude-based criterion during the algorithm’s execution, while the remaining

P \times (1 - w_{m a g})

are partially reinitialized using the similarity criterion. In this paper,

w_{m a g}

is set to 0.5 to make both pruning criteria equally effective.

The detailed algorithm for pre-trained model reinitialization is shown in Algorithm 2. Given a pre-trained model

M_{p}

, a pruning initialization rate

P

, and an amplitude pruning ratio

w_{m a g}

, the algorithm iteratively selects unimportant convolutional kernels and obtains their corresponding dependent groups

G

. It then partially reinitializes all sub-modules in the dependent groups. After calculating the new reinitialization rate

P'

, the algorithm compares

P'

with the target initialization rate

P

. If it does not meet the criteria, the initialization operation continues until the initialized proportion

P'

is greater than the target initialization rate

P

, resulting in the final partially initialized model.

Algorithm 2: Pre-trained model reinitialization

Input: Dataset

D_{s}

, pre-trained model

M_{p}

, given initialization rate

W_{1}

, magnitude pruning ratio

w_{m a g}

.
Output: Models after reinitialization

M_{r}

.

Step 1. Initialization:

W' \leftarrow W

,

P' \leftarrow 0

, List of trimmed parameters:

P_{l i s t} \leftarrow {}

Step 2. Find all submodules

W_{i}

and their corresponding dependency groups in the shallow module

G = {G_{1}, G_{2}, \dots, G_{i}}

Step 3. While

P' < P

do
If

P' \leq P \times w_{m a g}

then
For

j

in

1, 2, \dots, i

do
Calculate the magnitude of the convolutional kernel amplitude

{M_{1}, M_{2}, \dots, M_{i}}

for each sub-module in the model according to Equation (10) and get the smallest amplitude

M_{m i n}

End for
Else
For

j

in

1, 2, \dots, i

do
Calculate the similarity matrix

S = [\begin{matrix} s_{11} & s_{12} & \dots & s_{1 n} \\ s_{21} & s_{22} & \dots & s_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ s_{m 1} & s_{m 2} & \dots & s_{m n} \end{matrix}]

between the sub-modules according to Equation (11), where

s_{i j}

denotes the similarity between the

i

th matrix and the

j

th matrix.
Sort the items in the similarity matrix in descending order:

[s_{1}, s_{2}, \dots, s_{n}]

.If

s_{1} = s_{x y}

is the maximum of these, then crop out

M_{x}

and remove the

M_{y, k}

term in the similarity matrix from the module to be cropped, where

k \in {1, 2, 3, \dots, n}

and

M_{t, y}

where

t \in {1, 2, \dots, m}

. Avoid that

M_{y}

-related terms are also cropped out.Add

M_{x}

and the corresponding parameter corresponding to it in the dependency group

G

to

P_{l i s t}

.
End for
End if
End while

2.3.3. Reinitialize Model Training

After obtaining the reinitialized model

M_{r}

with some of the shallow parameters released, the shallow parts of the parameters are assigned one by one to the completely reinitialized model

M_{n}

to obtain the target model

M_{n'}

with the shallow partially initialized and the deep layer fully initialized, and then the model

M_{n'}

is trained on the target dataset

D_{t}

to obtain the final fine-tuned model

M_{d}

. It should be noted that in the traditional deep learning training framework, the model parameters are updated in terms of the complete convolutional layer weights

W^{(i)}

. Therefore, in this paper, the extraction capability of the model obtained by training on the original dataset

D_{s}

is maintained during the model training process by retaining the shallow part of the uncropped parameters before parameter updating of the model training and restoring the retained parameters after the training.

3. Results and Discussion

3.1. Experiments on an Improved Antimony Ore Segmentation Method Based on Concave Point Detection

All simulation experiments in this paper are run on a computer equipped with an Intel-i9-12900kx 24 CPU and a GeForce RTX3090 GPU. We verify the image segmentation algorithm using Python 3.8.8 and OpenCV 4.5.5. The PyTorch software 1.12.x library is used to verify the deep learning image classification algorithm.

In order to verify the effect of the algorithm in this paper, this paper selects antimony ore to carry out experiments, and the relevant parameters of the samples are shown in Table 1. This paper selects 119 single channel 512 × 5632 16-bit depth gray images.

In order to better illustrate the problem, this article compares concave point matching based on the trilinear method with simple concave point matching and the watershed algorithm, where the latter two methods rely on distance transformation. All three use exactly the same input dataset. A few images are selected as typical adhesion images to illustrate the segmentation effect. As shown in Figure 4, for antimony ore with a large area of adhesion, the watershed algorithm cannot separate it, and for the cases where the watershed can segment it, there are also cases where the segmentation line is inaccurate. For the case where concave points appear due to the antimony ore edges, simple concave point matching will be over-segmented, while the three-line method proposed in this paper can handle these cases well.

In order to compare the performance of the segmentation algorithms more intuitively, the under-segmentation rate, over-segmentation rate, and the accuracy rate are used as the performance indices to measure the segmentation effect of adherent antimony ore. The segmentation accuracy rate is defined as:

P = \frac{R}{M} \times 100 %

(13)

P

represents the segmentation accuracy,

R

represents the correctly segmented particles, and

M

represents all particles.

Table 2 quantifies the proposed framework’s superiority, achieving 96.27% segmentation accuracy—3.73% improvement over watershed algorithms (92.54%) and 7.46% enhancement versus simple concave matching (88.81%). Critically, it eliminates over-segmentation artifacts (0% incidence) while maintaining equivalent under-segmentation rates (3.73%) to the baseline method.

3.2. Experimental Validation of Transfer Learning with Shallow Partial Initialization

3.2.1. Experimental Dataset

In order to select a suitable pre-training model dataset, this paper tests the accuracy of the ore classification model without migration learning and with different datasets for pre-training [31,32,33], and the experimental results are as follows.

From Table 3, it can be seen that the ore classification results obtained from different pre-training on different datasets are different, among which, the pre-training using ImageNet dataset has the best results. Therefore, in this paper, we choose the ImageNet pre-trained model as the base model

M_{p}

in the subsequent re-initialization part.

The antimony dataset is divided into a training set and a test set according to the ratio of 0.8:0.2, as shown in Table 4 below.

3.2.2. Deep Learning Model

This experiment uses VGG16 [34] as the deep learning model used for classification, and the ResNet18 model as the contrast model [35]. VGG16 contains a total of 16 weighting layers, which are 13 convolutional layers

{L_{c 1}, L_{c 2}, \dots, L_{c 13}}

and

{L_{l 1}, L_{l 2}, L_{l 3}}

fully connected. The convolutional layers are divided into multiple repetitive units, each containing two or three convolutional blocks using a small 3 × 3 convolutional kernel, followed by a maximum pooling layer. After the convolutional layer, there are three fully connected layers, of which, the first two have 4096 neurons each, and the number of neurons in the last fully connected layer depends on the number of classes in the classification task. On the VGG16 model, the first three convolutional layers

L_{s h a l l o w} = {L_{c 1}, L_{c 2}, L_{c 3}}

are used in this paper as the target shallow region that is partially reinitialized in this paper. The ResNet18 model includes 18 weight layers, including 17 convolution layers

{L_{c 1}, L_{c 2}, \dots, L_{c 17}}

and a full connection layer

L_{l 1}

. The convolution layer is divided into multiple repeated units, in which pooled layers, convolution cores of size 3 × 3, and convolution cores of size 7 × 7 are mixed. The first three convolution layers are taken as the shallow part

L_{s h a l l o w} = {L_{c 1}, L_{c 2}, L_{c 3}}

.

3.2.3. Analysis of the Effectiveness of the Classification Algorithm

In order to better illustrate the effectiveness of the antimony mining model training method proposed in this paper, which incorporates migration learning and initialization of the shallow part of the model, as well as the generality of the method, the method introduced in this paper is applied to the VGG16 model and compared to the ResNet18 model, and the accuracy of the two models in the antimony ore dataset is evaluated. The different precision data obtained for a given initialization rate with different precision are listed in Table 5, where

D_{V}

represents the precision value of the VGG16 model and

D_{R}

represents the precision value of the ResNet18 model. In Figure 5, the precision value of the VGG16 model is plotted as a line chart, where the precision value is averaged in five experiments.

3.3. Industrial Application Analysis

Although existing ore sorting systems have achieved a certain level of automation and intelligence, they still face significant challenges and limitations in practical applications. These challenges are primarily reflected in two areas: (1) the lack of functionality for algorithm tuning, and (2) the need for improved operation control optimization. In order to solve these limitations, this paper introduced the development of an ore separation system based on X-ray, built on the hardware platform of a company in Shenyang, as illustrated in Figure 6. The newly developed system offers an intuitive user interface that not only includes traditional features such as user management, production statistics, and real-time monitoring, but also incorporates advanced capabilities like algorithm parameter management and operational parameter optimization.

3.3.1. Split Parameter Settings

The segmentation settings interface, as shown in Figure 7, provides a range of parameter selection options, including the threshold algorithm, concave point threshold, distance (d), corrosion core size, expansion core size, auxiliary line color, and segmentation line color. These parameters play a critical role in the performance of the image segmentation algorithm, as they directly influence the segmentation quality and, consequently, the accuracy of ore sorting. Additionally, the system tracks the segmentation accuracy under various parameter configurations, offering users valuable insights to assess the effectiveness of different parameter settings and optimize the sorting process.

3.3.2. Classification Model Management

To ensure the X-ray ore sorting system achieves accurate classification based on the distinct characteristics of different ores, the system features a robust model management interface, as shown in Figure 8. On the left side of the interface, the currently selected model, such as the VGG16 model, is displayed in a schematic form, allowing users to quickly understand the model’s structure and key features. The parameter selection area on the right offers a range of training parameters, including optimizer selection, learning rate, weight decay, and momentum decay rate adjustments. These parameters directly influence the model’s training efficiency and final performance.

Additionally, the model management panel logs detailed training information, such as the session serial number, operator, build time, models used, datasets (training and testing), and the corresponding accuracy metrics, providing valuable insights for performance evaluation.

3.3.3. Real-Time Monitoring

The real-time processing results of the X-ray sorting machine’s image processing algorithm are shown in Figure 9. In the upper-left corner, the original video stream from the X-ray sorting machine is displayed, while the lower-left corner shows the real-time segmentation and classification results. The image is annotated with green and red pixel points, as well as semi-transparent yellow and red masks, to represent the segmentation lines, contours, identified tailings, and identified concentrates within the ore. On the right side, a list displays the coordinates of the center points for the segmented ore images, along with their corresponding classification results.

4. Conclusions

To improve the existing antimony ore image processing methods, this paper proposes a series of solutions. Aiming at the problems of over-segmentation and under-segmentation in antimony ore segmentation, this paper proposes an antimony ore segmentation method based on the improvement of concave point detection, to realize the accurate segmentation of adherent antimony ore. The method mainly uses morphological operations, as well as the trilinear method proposed in this paper for concave point matching, which successfully removes the interference caused by the fine concave points on the antimony ore contour for image segmentation. In order to solve the problem of low performance of applying deep learning to antimony ore classification based on X-ray images, this paper proposes an antimony ore classification model training method that integrates migration learning and initialization of the shallow part of the model to improve the antimony ore classification accuracy. In order to verify the effectiveness of the method introduced in this paper, a series of comparative experiments are carried out, and the experimental results show that the antimony ore segmentation method proposed in this paper achieves accurate segmentation (96.27% correct segmentation rate, Table 2). The antimony ore classification model training method proposed in this paper can effectively release some redundant parameters of the pre-training model, and has better classification performance on the target dataset (86.76% accuracy, Table 5). Both methods are superior to traditional methods.

An X-ray ore sorting system software, tailored for on-site applications, has been successfully developed. This system offers a range of features, including algorithm management, equipment control, real-time monitoring, historical data analysis, and log recording. Practical implementation at beneficiation sites has demonstrated that the system effectively addresses existing challenges in ore sorting, significantly enhancing process efficiency and enabling better management of the entire ore sorting operation.

Author Contributions

L.W.: model conception, supervision; C.D.: formula derivation, design of experiments; H.H.: data generation, data analysis; H.W.: paper editing, model optimization; W.D.: result analysis. All authors have read and agreed to the published version of the manuscript.

Funding

This work was financially supported by the National Natural Science Foundation of China (52304309), Key Research and Development Project of Liaoning Province (2024JH2/102400026), Basic Research Program of Jiangsu (BK20241636) and the Fundamental Research Funds for the Central Universities of China (2024QN11074).

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to patent protection period.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Qin, X.; Deng, J.; Lai, H.; Zhang, X. Beneficiation of Antimony Oxide Ore: A Review. Russ. J. Non-Ferr. Met. 2017, 58, 321–329. [Google Scholar] [CrossRef]
Jung, D.; Choi, Y. Systematic Review of Machine Learning Applications in Mining: Exploration, Exploitation, and Reclamation. Minerals 2021, 11, 148. [Google Scholar] [CrossRef]
Bhuiyan, I.U.; Mouzon, J.; Hedlund, J.; Forsberg, F.; Sjödahl, M.; Forsmo, S.P.E. Consideration of X-Ray Microtomography to Quantitatively Determine the Size Distribution of Bubble Cavities in Iron Ore Pellets. Powder Technol. 2013, 233, 312–318. [Google Scholar] [CrossRef]
Von Ketelhodt, L.; Bergmann, C. Dual Energy X-Ray Transmission Sorting of Coal. J. South. Afr. Inst. Min. Metall. 2010, 110, 371–378. [Google Scholar]
Otsu, N. A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
Wang, Y. Overview of Image Segmentation Methods Based on Deep Learning; SPIE: Bellingham, WA, USA, 2024; p. 13184. [Google Scholar]
Rosenfeld, A. The Max Roberts Operator Is a Hueckel-Type Edge Detector. IEEE Trans. Pattern Anal. Mach. Intell. 1981, PAMI-3, 101–103. [Google Scholar] [CrossRef]
Khan, J.F.; Bhuiyan, S.M.A.; Adhami, R.R. Image Segmentation and Shape Analysis for Road-Sign Detection. IEEE Trans. Intell. Transport. Syst. 2011, 12, 83–96. [Google Scholar] [CrossRef]
Gao, W.; Zhang, X.; Yang, L.; Liu, H. An Improved Sobel Edge Detection. In Proceedings of the 2010 3rd International Conference on Computer Science and Information Technology, Chengdu, China, 9–11 July 2010; Volume 5, pp. 67–71. [Google Scholar] [CrossRef]
Er-sen, L.; Shu-long, Z.; Bao-shan, Z.; Yong, Z.; Chao-gui, X.; Li-hua, S. An Adaptive Edge-Detection Method Based on the Canny Operator. In Proceedings of the 2009 International Conference on Environmental Science and Information Application Technology, Wuhan, China, 4–5 July 2009; Volume 1, pp. 465–469. [Google Scholar] [CrossRef]
Pham, D.L.; Xu, C.; Prince, J.L. Current methods in medical image segmentation. Annu. Rev. Biomed. Eng. 2000, 2, 315. [Google Scholar] [CrossRef]
Tremeau, A.; Borel, N. A Region Growing and Merging Algorithm to Color Segmentation. Pattern Recognit. 1997, 30, 1191–1203. [Google Scholar] [CrossRef]
Chandra, J.N.; Supraja, B.S.; Bhavana, V. A Survey on Advanced Segmentation Techniques in Image Processing Applications. In Proceedings of the 2017 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), Computational Intelligence and Computing Research (ICCIC), Coimbatore, India, 14–16 December 2017; pp. 1–5. [Google Scholar] [CrossRef]
Chien, S.Y.; Huang, Y.W.; Chen, L.G. Predictive Watershed: A Fast Watershed Algorithm for Video Segmentation. IEEE Trans. Circuits Syst. Video Technol. 2003, 13, 453–461. [Google Scholar] [CrossRef]
Yao, Y.; Wu, W.; Yang, T.; Liu, T.; Chen, W.; Chen, C.; Li, R.; Zhou, T.; Sun, C.; Zhou, Y.; et al. Head Rice Rate Measurement Based on Concave Point Matching. Sci. Rep. 2017, 7, 41353. [Google Scholar] [CrossRef] [PubMed]
Song, H.; Zhao, Q.; Liu, Y. Splitting Touching Cells Based on Concave-Point and Improved Watershed Algorithms. Front. Comput. Sci. Sel. Publ. Chin. Univ. 2014, 8, 156–162. [Google Scholar] [CrossRef]
Jin, C.; Wang, K.; Han, T.; Lu, Y.; Liu, A.; Liu, D. Segmentation of Ore and Waste Rocks in Borehole Images Using the Multi-Module Densely Connected U-Net. Comput. Geosci. 2022, 159, 105018. [Google Scholar] [CrossRef]
Cai, Y.; Xu, D.; Shi, H. Rapid Identification of Ore Minerals Using Multi-Scale Dilated Convolutional Attention Network Associated with Portable Raman Spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2022, 267, 120607. [Google Scholar] [CrossRef]
Qiu, J.; Zhang, Y.; Fu, C.; Yang, Y.; Ye, Y.; Wang, R.; Tang, B. Study on Photofluorescent Uranium Ore Sorting Based on Deep Learning. Miner. Eng. 2024, 206, 108523. [Google Scholar] [CrossRef]
Huang, J.-T.; Li, J.; Yu, D.; Deng, L.; Gong, Y. Cross-Language Knowledge Transfer Using Multilingual Deep Neural Network with Shared Hidden Layers. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Vancouver, BC, Canada, 26–31 May 2013; pp. 7304–7308. [Google Scholar] [CrossRef]
Garg, S.; Singh, P. Transfer Learning Based Lightweight Ensemble Model for Imbalanced Breast Cancer Classification. IEEE/ACM Trans. Comput. Biol. Bioinf. 2023, 20, 1529–1539. [Google Scholar] [CrossRef]
Zhou, W.; Wang, H.; Wan, Z. Ore Image Classification Based on Improved CNN. Comput. Electr. Eng. 2022, 99, 107819. [Google Scholar] [CrossRef]
Li, H.; Kadav, A.; Durdanovic, I.; Samet, H.; Graf, H.P. Pruning Filters for Efficient ConvNets. arXiv 2016, arXiv:1608.08710. [Google Scholar]
Liu, Z.; Li, J.; Shen, Z.; Huang, G.; Yan, S.; Zhang, C. Learning Efficient Convolutional Networks through Network Slimming. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2755–2763. [Google Scholar] [CrossRef]
Lin, M.; Ji, R.; Wang, Y.; Zhang, Y.; Zhang, B.; Tian, Y.; Shao, L. HRank: Filter Pruning Using High-Rank Feature Map. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 1526–1535. [Google Scholar] [CrossRef]
He, Y.; Liu, P.; Wang, Z.; Hu, Z.; Yang, Y. Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 4335–4344. [Google Scholar] [CrossRef]
Suzuki, S.; Be, K. Topological Structural Analysis of Digitized Binary Images by Border Following. Comput. Vis. Graph. Image Process. 1985, 30, 32–46. [Google Scholar] [CrossRef]
Skala, V. Point-in-Convex Polygon and Point-in-Convex Polyhedron Algorithms with O(1) Complexity Using Space Subdivision. AIP Conf. Proc. 2016, 1738, 480034. [Google Scholar] [CrossRef]
Fang, G.; Ma, X.; Song, M.; Bi Mi, M.; Wang, X. DepGraph: Towards Any Structural Pruning. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 16091–16101. [Google Scholar] [CrossRef]
He, Y.; Liu, P.; Zhu, L.; Yang, Y. Filter Pruning by Switching to Neighboring CNNs With Good Attributes. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 8044–8056. [Google Scholar] [CrossRef] [PubMed]
Nesteruk, S.; Agafonova, J.; Pavlov, I.; Gerasimov, M.; Dimitrov, D.; Kuznetsov, A.; Kadurin, A.; Latyshev, N.; Plechov, P. MineralImage5k: A Benchmark for Zero-Shot Raw Mineral Visual Recognition and Description. Comput. Geosci. 2023, 178, 105414. [Google Scholar] [CrossRef]
Kermany, D.S.; Liang, H.; Dong, J.; Pei, J.; Zhu, J.; Hewett, S.; Dong, J.; Fu, X.; Huu, V.A.N.; Zhang, E.D.; et al. Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell 2018, 172, 1122–1131.e9. [Google Scholar] [CrossRef]
Deng, J.; Dong, W.; Socher, R.; Li, L.-J.; Li, K.; Fei-Fei, L. ImageNet: A Large-Scale Hierarchical Image Database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; pp. 248–255. [Google Scholar] [CrossRef]
Pu, Y.; Apel, D.B.; Szmigiel, A.; Chen, J. Image Recognition of Coal and Coal Gangue Using a Convolutional Neural Network and Transfer Learning. Energies 2019, 12, 1735. [Google Scholar] [CrossRef]
Ma, L.; Hu, Y.; Meng, Y.; Li, Z.; Chen, G. Multi-Plant Disease Identification Based on Lightweight ResNet18 Model. Agronomy 2023, 13, 2702. [Google Scholar] [CrossRef]

Figure 1. Antimony ore sorting device.

Figure 2. X-ray antimony ore image segmentation method [27].

Figure 3. Training method combining transfer learning and shallow partial initialization.

Figure 4. Comparison of segmentation results of different algorithms.

Figure 5. VGG16 classification accuracy changes with clipping rate.

Figure 6. System working principle.

Figure 7. Segmentation settings.

Figure 8. Model management.

Figure 9. Real-time monitoring.

Table 1. Parameters of samples.

Types of Antimony Ore	Weight	Grade
High-grade ore	1.91 kg	26.83%
Medium-grade ore 1	1.7 kg	4.74%
Medium-grade ore 2	4.8 kg	0.556%
Tailings	17.9 kg	0.024%

Table 2. The accuracy rate of different algorithms.

Method	Under-Segmentation (%)	Over-Segmentation (%)	Right Segmentation (%)
Simple concave point matching	3.73	7.46	88.81
Watershed algorithm	5.22	2.24	92.54
Algorithm of this article	3.73	0	96.27

Table 3. Migration effect of different datasets.

Dataset	Accuracy	Transfer Effect
-	85.762	-
Mineral	85.456	Negative
Chest X-Ray	85.672	Negative
Cifar10	85.852	Positive
ImageNet	86.43	Positive

Table 4. Antimony ore dataset.

Classes	Training Image Number	Test Image Number
Concentrate	952	239
Tailings	3464	867

Table 5. Classification accuracy of antimony ore between VGG16 and ResNet18.

$P$	0	0.1	0.2	0.3	0.4	0.5	0.6	0.7
$P_{r}$	0	0.20	0.36	0.52	0.64	0.74	0.84	0.90
$D_{V}$	86.43 ± 0.57	86.61 ± 0.53	86.38 ± 0.39	86.76 ± 0.21	86.70 ± 0.31	86.63 ± 0.23	86.68 ± 0.42	86.50 ± 0.52
$D_{R}$	85.37 ± 0.39	85.44 ± 0.67	85.58 ± 0.37	85.37 ± 0.28	85.09 ± 0.46	85.35 ± 0.52	85.33 ± 0.47	85.12 ± 0.49

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, L.; Ding, C.; Hu, H.; Wang, H.; Dai, W. Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution. Minerals 2025, 15, 483. https://doi.org/10.3390/min15050483

AMA Style

Wang L, Ding C, Hu H, Wang H, Dai W. Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution. Minerals. 2025; 15(5):483. https://doi.org/10.3390/min15050483

Chicago/Turabian Style

Wang, Lanhao, Chen Ding, Hongdong Hu, Hongyan Wang, and Wei Dai. 2025. "Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution" Minerals 15, no. 5: 483. https://doi.org/10.3390/min15050483

APA Style

Wang, L., Ding, C., Hu, H., Wang, H., & Dai, W. (2025). Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution. Minerals, 15(5), 483. https://doi.org/10.3390/min15050483

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Antimony Ore Identification Method for Small Sample X-Ray Images with Random Distribution

Abstract

1. Introduction

2. Materials and Methods

2.1. Materials

2.2. Design of X-Ray Antimony Ore Image Segmentation Method

2.3. A Method for Ore Classification Incorporating Transfer Learning and Model Shallow Part Initialization

2.3.1. Model Pre-Training

2.3.2. Partial Reinitialization of the Pre-Trained Model

2.3.3. Reinitialize Model Training

3. Results and Discussion

3.1. Experiments on an Improved Antimony Ore Segmentation Method Based on Concave Point Detection

3.2. Experimental Validation of Transfer Learning with Shallow Partial Initialization

3.2.1. Experimental Dataset

3.2.2. Deep Learning Model

3.2.3. Analysis of the Effectiveness of the Classification Algorithm

3.3. Industrial Application Analysis

3.3.1. Split Parameter Settings

3.3.2. Classification Model Management

3.3.3. Real-Time Monitoring

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI