Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information

Zhu, Hongda; Zhao, Sihai

doi:10.3390/s25103023

Open AccessArticle

Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information

by

Hongda Zhu

^1,2

and

Sihai Zhao

^1,*

¹

School of Mechanical and Electrical Engineering, China University of Mining & Technology-Beijing, Beijing 100083, China

²

Huadian Coal Industry Group Digital Intelligence Technology Co., Ltd., Beijing 102488, China

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(10), 3023; https://doi.org/10.3390/s25103023

Submission received: 11 April 2025 / Revised: 5 May 2025 / Accepted: 8 May 2025 / Published: 10 May 2025

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Review Reports Versions Notes

Abstract

To address the challenges posed by significant parallax, dynamic changes in monitoring camera positions, and the need for rapid wide-field image stitching in underground coal mine tunneling faces, this paper proposes a fast image stitching method for tunneling face images based on navigation and positioning data. First, using a pixel-based calculation approach, the tunneling face scene is partitioned into the cutting section and the ground, enhancing the reliability of scene segmentation. Then, the spatial distance between the camera and the cutting plane is computed based on the tunneling machine’s navigation and positioning data, and a plane-induced homography model is employed to efficiently determine the dynamic transformation matrix of the cutting section. Finally, the Dual-Homography Warping (DHW) method is applied to achieve fast panoramic image stitching of the tunneling face. Comparative experiments with three classical stitching methods, SURF, SIFT, and BRISK, demonstrate that the proposed method reduces stitching time by 60%. Field experiments in underground environments verify that this method can generate a complete panoramic stitched image of the tunneling face, providing an unobstructed perspective beyond the machine body and cutting head to clearly observe the shovel plate and surrounding ground conditions, significantly enhancing the visibility and convenience of remote operation.

Keywords:

tunneling face; navigation; positioning; image stitching

1. Introduction

As the front line of coal mining operations, tunneling faces present significant safety risks. The implementation of intelligent remote control systems has become crucial for enhancing operational safety [1,2]. These systems rely heavily on visual data, where images and videos serve as core perception tools, enabling remote monitoring and control in coal mines. A critical challenge lies in monitoring key areas such as the cutting cross-section and the ground near the shovel plate. Traditional camera setups, dispersed across machinery [3,4,5], only capture fragmented views, failing to provide a comprehensive visual representation of these regions. This limitation hampers intuitive remote operation and impedes the practical deployment of intelligent systems [6]. Therefore, stitching together images captured by multiple cameras, especially by merging the image of the cutting face with the ground image of the scoop (shovel) board area, enables workers to observe not only the cutting face but also, as if “looking through” the obstructions of the machine body and the cutting head, the images of the scoop board and the ground. This technology can provide powerful support for the intelligentization of roadheading (tunneling/driving) operations.

Traditional image stitching methods like SURF, SIFT, and BRISK all rely on detecting key points in an image, extracting distinctive features from them, and matching these features across images to align and stitch them together. SURF (Speeded-Up Robust Features) and SIFT (Scale-Invariant Feature Transform) both focus on detecting key points that are invariant to scale, rotation, and illumination changes, making them robust for matching features in various conditions. BRISK (Binary Robust Invariant Scalable Keypoints), on the other hand, is a faster alternative that uses binary descriptors for feature matching, offering efficient performance while maintaining robustness in feature detection. These methods all aim to find overlapping areas between images and seamlessly combine them into a single, larger image by using matched key points and geometric transformations.

Building on these classic algorithms, many research groups today have expanded their work on image stitching. Ren Wei proposed installing a multi-lens panoramic camera at the front end of a roadheader in an underground coal mine to acquire large-field-of-view and high-resolution images of the roadheading face. However, due to obstructions from the machine body and the cutting head, the camera is unable to observe the ground area near the scoop board, resulting in blind spots in the field of view [7]. To obtain complete images of both the cutting face and the ground, it is necessary to install cameras on both sides and in the middle of the roadheader for image stitching. Nevertheless, significant installation position deviations among these three cameras make it difficult to avoid large parallax issues during stitching [8]. To address the problem of large parallax image stitching, Gao et al. proposed the DHW (Dual-Homography Warping) method, which segments the scene image into front and back planes and employs separate transformation matrices for each plane, combined with weighting factors for stitching alignment [9]. However, this method relies on scenarios where the camera and scene positions are relatively fixed. During coal mine tunneling, the operational equipment carrying the camera is constantly moving, causing the distance between the front and back planes in the scene image to dynamically change, making the traditional DHW method difficult to apply. For dynamic scenes within coal mine roadways, Zhang Kailong proposed utilizing online calibration of camera extrinsic parameters based on image feature points, combined with a 3D model of the equipment, to dynamically segment the front and back planes of the image [10]. However, the actual working conditions in coal mine roadways are complex, with poor lighting conditions and severe dust interference, making it difficult to obtain sufficient and high-quality feature points, thereby reducing the reliability of plane segmentation. Zhang Xuhui et al. proposed using image enhancement, feature point matching, and optimal seamline methods to calculate the transformation matrices for the front and back planes and address misalignment between them, overcoming the impact of low lighting and heavy dust underground. However, their stitching process takes nearly one second, which is difficult to meet the dynamic demands of practical applications [11].

Currently popular stitching algorithms often face issues like slow feature point matching and limitations in fixed scene applications. However, during the tunneling process, both the scene and depth of field change in real time, which makes these traditional algorithms unsuitable. To address the specific requirements of large parallax, dynamic changes in the positions of monitoring cameras, and rapid stitching of large-field-of-view images at tunneling faces in underground coal mines, this paper proposes a rapid stitching method for tunneling face images based on navigation and positioning information. Initially, based on the known coordinates of the cutting face and the ground demarcation line within the tunnel coordinate system, their corresponding pixel positions in the camera image are calculated, and the tunneling face image is subsequently divided into two planes accordingly. Furthermore, utilizing the navigation and positioning information of the roadheader, the distance between the camera and the cutting face is derived, and a dynamic transformation matrix for the cutting face is computed based on a plane-induced homography model. Finally, combined with the DHW method, rapid stitching of the cutting face images is achieved.

2. Segmentation and Registration of Tunneling Face

2.1. Calculation of Image Dynamic Segmentation Line

The navigation and positioning device for the roadheader consists of two parts: a laser guidance device suspended from the tunnel roof at the rear end of the roadheader and a pose measurement device installed on the roadheader. The origin O_L of the tunnel coordinate system is located on the laser guidance device, with the X, Y, and Z directions of the coordinate system representing the lateral, tunneling, and height directions of the tunnel, respectively. The entire navigation and positioning device provides the heading angle, pitch angle, and roll angle of the roadheader body, as well as the coordinates of the roadheader in the tunnel coordinate system [12].

To obtain surveillance images of the tunneling face, three cameras are installed on the left and right sides at the front end of the roadheader and near the driver’s position in the middle, respectively. As shown in Figure 1, the multi-camera system covers the entire cutting face and ground area through its spatially distributed fields of view.

In an actual tunnel, the boundary between the ground and the cutting face approximates an ideal straight line, and the coordinates of the endpoints P₁ and P₂ of this segmentation line are known in the tunnel coordinate system O_L. If we calculate the imaging pixels of P₁ and P₂ in the cameras, the line connecting these pixels can serve as the segmentation line between the cutting face and the ground in the image.

As shown in Figure 2, taking the middle camera as an example, a calibration board is installed at the tunnel heading, and the coordinates of each corner point on the calibration board in the tunnel coordinate system can be obtained in advance through measurement. Let the coordinate of any corner point in the tunnel coordinate system be

P_{tar_in_OL}

. Meanwhile, based on the camera’s intrinsic parameters, the geometric parameters of the calibration board itself, and the pixel coordinates corresponding to the image of this corner point captured by the middle camera, its coordinate in the middle camera coordinate system can be calculated as

P_{tar_in_OCm}

.

P_{tar_in_OCm} = {(R_{Cm}^{O_{t}})}^{- 1} [{(R_{O_{t}}^{O_{L}})}^{- 1} \cdot (P_{tar_in_OL} - T_{O_{t}}^{O_{L}}) - T_{Cm}^{O_{t}}]

(1)

where

R_{O_{t}}^{O_{L}}

and

T_{O_{t}}^{O_{L}}

represent the rotation matrix and translation vector from the pose measurement device coordinate system O_t to the tunnel coordinate system O_L, which can be obtained by inverse calculation using the heading angle, roll angle, pitch angle, and spatial position coordinates provided by the roadheader navigation and positioning system.

R_{Cm}^{O_{t}}

and

T_{Cm}^{O_{t}}

represent the rotation matrix and translation vector from the middle camera coordinate system O_Cm to the pose measurement device coordinate system O_t, which can be calibrated by combining multiple corner point coordinates and Equation (1).

For the endpoints P₁ and P₂ of the segmentation line between the cutting face and the ground, their coordinates in the tunnel coordinate system are P_{L_1} and P_{L_2}, respectively, while their coordinates P_{Cm_1} and P_{Cm_2} in the middle camera coordinate system can be obtained using Equation (2).

\{\begin{matrix} P_{Cm_1} = R_{O_{t}}^{Cm} (R_{O_{L}}^{O_{t}} \cdot P_{L_1} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{Cm} \\ P_{Cm_2} = R_{O_{t}}^{Cm} (R_{O_{L}}^{O_{t}} \cdot P_{L_2} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{Cm} \end{matrix}

(2)

By combining the camera’s intrinsic parameters, we can further obtain the corresponding pixel coordinates p_m1 and p_m2 of these two points in the image captured by the middle camera with the following equation.

\{\begin{matrix} p_{m 1} = \frac{1}{P_{Cm 1_Z}} \cdot K_{Cm} \cdot P_{Cm 1} \\ p_{m 2} = \frac{1}{P_{Cm 2_Z}} \cdot K_{Cm} \cdot P_{Cm 2} \end{matrix}

(3)

where

K_{Cm}

is the intrinsic matrix of the middle camera, while

P_{Cm 1_Z}

and

P_{Cm 2_Z}

represent the distances of points P₁ and P₂, respectively, along the Z-axis direction of the middle camera. The line connecting the pixel points p_m1 and p_m2 serves as the line between the cutting face and the ground in the tunneling heading image.

This method is also applicable to the images captured by the left and right cameras. On the one hand, it can obtain the rotation matrix

R_{Lm}^{O_{t}}

and translation vector

T_{Lm}^{O_{t}}

from the left camera coordinate system O_Lm to the pose measurement device coordinate system O_t, as well as the rotation matrix

R_{Rm}^{O_{t}}

and translation vector

T_{Rm}^{O_{t}}

from the right camera coordinate system O_Rm to the pose measurement device coordinate system O_t. On the other hand, it can also calculate the segmentation lines between the cutting face and the ground in the left and right camera images, as shown in Figure 2. p_r1, p_r2 and p_l1, p_l2 are the corresponding pixel points of points P₁ and P₂ in the right and left camera images, respectively, and their connecting lines represent the segmentation lines between the cutting face and the ground in the corresponding images.

Compared to other methods that utilize image feature points to divide the front and back planes, this method directly calculates and generates the segmentation lines between the cutting face and the ground based on the coordinates of the segmentation line endpoints, thereby avoiding reliance on image quality and significantly improving the reliability of the segmentation of the tunneling heading image.

2.2. Stitching of Segmented Images

After completing the segmentation of the cutting face and the ground within the front-facing images, this section employs the DHW method for these two planes. It separately transforms the images captured by the left and center cameras into the perspective of the right camera through matrix transformation, and then stitches them with the image captured by the right camera.

2.2.1. Stitching of Ground Plane

As shown in Figure 1, the heights of the left, center, and right cameras remain basically unchanged relative to the roadway ground. Therefore, once the camera installation positions are fixed, the homography matrices

H_{l_ground}^{r_ground}

and

H_{c_ground}^{r_ground}

for perspective transformation from the left and center cameras to the right camera are also fixed. These matrices can be pre-calibrated by setting multiple feature points in the common ground area of the cameras and performing feature point matching.

2.2.2. Stitching of Cutting Face

Unlike the ground, the distance between the cutting face and the cameras varies with the movement of the roadheader, leading to dynamic adjustments in the transformation matrices between images captured by different cameras according to this distance. To address the dynamically changing transformation matrices, this paper adopts a plane-induced homography model, which is rapidly calculated by integrating the internal and external parameters of the cameras, the pose transformation matrices between the coordinate systems of each camera, the normal vector of the cutting face in the camera coordinate system, and the distance from the camera to the cutting face [13].

Taking the left camera as an example, the homography matrix

H_{Lm_wall}^{Rm_wall}

of the cutting face portion within its image relative to the cutting face portion in the right camera’s image can be calculated by Equation (3).

H_{Lm_wall}^{Rm_wall} = K_{Lm} \cdot R_{Lm}^{Rm} (I + \frac{T_{Lm}^{Rm} \cdot n_{wall}^{Rm}}{d_{wall}^{Rm}}) \cdot K_{Rm}^{- 1}

(4)

where

K_{Lm}

and

K_{Rm}

are the intrinsic matrices of the left and right cameras, respectively;

R_{Lm}^{Rm}

and

T_{Lm}^{Rm}

are the rotation matrix and translation matrix from the left camera to the right camera, respectively; I is the identity matrix;

n_{wall}^{Rm}

is the normal vector of the cutting face in the right camera coordinate system; and

d_{wall}^{Rm}

is the distance from the cutting face to the origin of the right camera coordinate system.

Based on the coordinate system transformation relationship,

R_{Lm}^{Rm}

and

T_{Lm}^{Rm}

can be obtained according to the positional relationships between the left and right camera coordinate systems and the pose measurement device coordinate system provided in Section 2.1.

\{\begin{matrix} R_{Lm}^{Rm} = {(R_{Rm}^{O_{t}})}^{- 1} \cdot R_{Lm}^{O_{t}} \\ T_{Lm}^{Rm} = {(R_{Rm}^{O_{t}})}^{- 1} \cdot T_{Lm}^{O_{t}} + T_{O_{t}}^{Rm} \end{matrix}

(5)

d_{wall}^{Rm}

and

n_{wall}^{Rm}

can be determined based on several feature points (such as P₁, P₂, etc.) on the cutting face and the coordinate values of the right camera in the roadway coordinate system O_L [13,14]. The specific calculation steps are as follows: As shown in Figure 2, four feature points P₁, P₂, P₃, and P₄ are selected on the cutting face (P₁ and P₂ are the endpoints of the intersection line between the cutting face and the ground, and P₃ and P₄ are two feature points on the cutting face directly above P₁ and P₂ at a height h). Their coordinates in the roadway coordinate system O_L are known, denoted as P_{L_1}, P_{L_2}, P_{L_3}, and P_{L_4}, respectively. By combining the rotation matrix

R_{O_{t}}^{R m}

and translation vector

T_{O_{t}}^{R m}

between the pose measurement device and the right camera obtained through calibration in Section 2.1, as well as the pose parameters provided by the navigation and positioning system, their coordinates in the right camera coordinate system can be calculated.

\{\begin{matrix} P_{R m_1} (x_{1}, y_{1}, z_{1}) = R_{O_{t}}^{R m} [R_{O_{L}}^{O_{t}} (P_{L_1} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{R m}] \\ P_{R m_2} (x_{2}, y_{2}, z_{2}) = R_{O_{t}}^{R m} [R_{O_{L}}^{O_{t}} (P_{L_2} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{R m}] \\ P_{R m_3} (x_{3}, y_{3}, z_{3}) = R_{O_{t}}^{R m} [R_{O_{L}}^{O_{t}} (P_{L_3} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{R m}] \\ P_{R m_4} (x_{4}, y_{4}, z_{4}) = R_{O_{t}}^{R m} [R_{O_{L}}^{O_{t}} (P_{L_4} + T_{O_{L}}^{O_{t}}) + T_{O_{t}}^{R m}] \end{matrix}

(6)

Subsequently, the normal vector

n_{wall}^{Rm}

of the cutting face formed by the four points P₁, P₂, P₃, and P₄ in the right camera coordinate system and the distance

d_{wall}^{Rm}

from the cutting face to the origin of the right camera coordinate system can be obtained.

n_{wall}^{Rm} (x, y, z) = \{\begin{matrix} x (x_{1} - x_{2}) + y (y_{1} - y_{2}) + z (z_{1} - z_{2}) = 0 \\ x (x_{1} - x_{3}) + y (y_{1} - y_{3}) + z (z_{1} - z_{3}) = 0 \\ x (x_{1} - x_{4}) + y (y_{1} - y_{4}) + z (z_{1} - z_{4}) = 0 \end{matrix}

(7)

d_{wall}^{Rm} = \frac{|(O_{Rm} - P_{rm_1}) \cdot n_{wall}^{Rm}|}{|n_{wall}^{Rm}|}

(8)

By substituting the results from Equations (5)–(8) into Equation (4), the dynamic homography matrix

H_{Lm_wall}^{Rm_wall}

of the left camera relative to the right camera can be calculated. Similarly, the homography matrix

H_{Cm_wall}^{Rm_wall}

of the cutting face portion within the middle camera image relative to the right camera can also be determined.

The aforementioned process demonstrates that, given the camera parameters, the proposed method in this paper can quickly solve for the transformation matrix of the cutting face by combining the coordinate information of the cutting face provided by the navigation and positioning system and the distance from the camera to the cutting face. This approach avoids the complex process in traditional algorithms of first selecting feature points, performing registration, and then calculating the homography matrix through SVD decomposition, thereby significantly improving real-time performance.

2.2.3. Stitching of the Overall Image

After completing image segmentation and the calculation of transformation matrices, when employing the DHW method for final stitching, it is necessary to calculate the weight of each pixel based on its position in the image and perform overlay fusion. The process is illustrated in Figure 3.

Assuming there exists a pixel point p in the left image, the transformation matrix H_p used to convert it to the right camera’s perspective can be calculated according to the following formula.

H_{p} = (1 - ω_{p}) \cdot H_{Lm_wall}^{Rm_wall} + ω_{p} \cdot H_{Lm_ground}^{Rm_ground}

(9)

where

ω_{p}

is the weight corresponding to the position of point p in the image, and the calculation method is as follows.

ω_{p} = \frac{L_{p_wall}}{L_{p_wall} + L_{p_ground}}

(10)

L_{p_wall}

represents the distance from pixel point p to the nearest feature point on the cutting face, and

L_{p_ground}

represents the distance from pixel point p to the nearest feature point on the ground. Since the segmentation line between the cutting face and the ground has been clearly marked in this paper, when point p is on the cutting face,

L_{p_wall} = 0

, and at this time,

ω_{p} = 0

; similarly, when point p is on the ground,

L_{p_ground} = 0

, and at this time,

ω_{p} = 1

.

After performing perspective transformation on the left camera image using Equation (9), it is stitched and fused with the right camera image. The position of the image stitching seam (as indicated by the annotated stitching transition region in the figure) can achieve a smooth transition through linear weight blending [15].

After completing the stitching of the left and right camera images, this stitched image is then fused with the middle camera image. Since, in practical scenarios, the middle camera is usually installed at a higher position above the ground compared to the left and right cameras, the cutting face portion occupies a larger proportion in its image. Therefore, only the cutting face portion (above the dynamic segmentation line) in the middle camera image is selected, subjected to perspective transformation using the dynamic homography matrix

H_{Cm_wall}^{Rm_wall}

and then fused and stitched a second time with the previously stitched image of the left and right cameras to ultimately obtain a complete stitched image of the three cameras, as shown in Figure 4 below.

3. Experimental Verification

3.1. Introduction to the Simulation Experimental System

To verify the stitching effectiveness and efficiency of the method proposed in this paper, a simulation experimental environment for roadways was established indoors, as shown in Figure 5 below, by referencing actual underground tunneling face scenarios.

As shown in Figure 5a, the corridor section is assumed to be a tunneling roadway, with the wall directly ahead (enclosed by the red line segment) being the cutting face. A trolley in the middle of the roadway simulates the tunneling machine, equipped with a navigation and positioning system for the tunneling machine that provides real-time location and attitude information of the tunneling machine within the roadway. With reference to actual working conditions, the laser guidance device (i.e., the origin of the roadway coordinate system O_L) in the tunneling machine navigation and positioning system is located approximately 18 m behind the machine in the middle of the roadway. The coordinates of endpoints P₁ and P₂, which are the intersection points between the corresponding simulated cutting face and the ground plane, are known in O_L.

In Figure 5b, the left and right cameras are mounted on the left and right sides of the simulated tunneling machine, respectively, with a horizontal downward viewing angle of approximately 25° towards the ground and a height of approximately 50 cm above the ground. The middle camera is located in the middle of the machine body, with a horizontal viewing angle towards the cutting face and a height of approximately 100 cm above the ground. The simulated roadway images captured by the three cameras are shown in Figure 6. We deployed pieces of A4 paper sheets on the low-textured floor surface as artificial reference markers to demonstrate the quality of the image stitching result.

3.2. Three-Camera Image Stitching Process

Following the method described in Section 2, the specific stitching process is as follows:

(1) Calculation of Image Segmentation Lines

First, the rotation matrices and translation vectors from the pose measurement device coordinate system O_t to each camera coordinate system are calibrated using the method in Section 2.1. Simultaneously, given the coordinates of the two endpoints P₁ and P₂ of the boundary line between the cutting face and the ground plane in O_L, the segmentation lines between the cutting face and the ground in each camera image are calculated and displayed in the corresponding images, as shown in Figure 6.

(2) Image Fusion and Stitching of Left and Right Cameras

As described in Section 2.2, the transformation matrix

H_{Lm_ground}^{Rm_ground}

from the left camera’s ground portion to the right camera’s ground portion is calculated using common feature points in the left and right camera images, such as the marked points shown in Figure 7.

Four feature points, P₁, P₂, P₃, and P₄, are selected on the cutting face. Among them, P₁ and P₂ are points on the segmentation line between the cutting face and the ground, while P₃ and P₄ are two feature points on the cutting face located 1 m directly above P₁ and P₂, respectively. The coordinates of these four points are substituted into Equations (6)–(9) to calculate the dynamic transformation matrix

H_{Lm_wall}^{Rm_wall}

of the left camera relative to the right camera and the dynamic transformation matrix

H_{Cm_wall}^{Rm_wall}

of the middle camera relative to the right camera. After applying a homography matrix weighted transformation to the left image using Equation (9), it is stitched with the right camera image. Figure 8 demonstrates the stitching effect of the left camera image transformed using either the cutting face homography matrix or the ground homography matrix individually and then stitched with the right camera image, along with a comparison to the stitching effect using the method proposed in this paper. As shown in Figure 8, if only the homography matrix corresponding to the cutting face is used, misalignment occurs in the ground part of the stitched image; if only the homography matrix corresponding to the ground is used, large-scale deformation occurs in the cutting face part of the stitched image; thus, using the segmentation and weighted transformation method proposed in this paper significantly improves the stitching effect.

(3) Three-camera image stitching

The dynamic homography matrix

H_{Cm_wall}^{Rm_wall}

from the middle camera to the right camera is used to perform perspective transformation on the cutting face section, which is then stitched with the fused result of the left and right camera images as mentioned above. Ultimately, a complete stitched image from the three-camera fusion is obtained, as shown in Figure 9. It can be observed that the stitched image eliminates the obstruction of the roadheader’s own components to the line of sight, presenting a complete view of the heading face scene.

3.3. Time Consumption Performance Analysis

To verify the effectiveness of the proposed algorithm in terms of time efficiency, three classic stitching algorithms—SURF [16], SIFT [17], and BRISK [18]—were employed to perform full-image stitching based on the left and right camera images in Figure 8. The computational time of each method was compared (using the average time taken over 10 executions of each method), as shown in Table 1 below.

The stitching quality of the three traditional methods (SURF, SIFT, and BRISK) is comparable to the method proposed in this paper, and all of them can meet the requirements for tunneling monitoring. However, the algorithm presented in this paper has a clear advantage in terms of time consumption. Compared with classic stitching algorithms, the computational time is reduced by over 60%. Under the same hardware acceleration conditions, the proposed algorithm is more capable of meeting real-time monitoring requirements.

4. Verification of Real Coal Mine Tunneling Scenario

The proposed method in this paper was practically validated in a real coal mine tunneling scenario, as shown in Figure 10 below. The left and right cameras were installed on the left and right sides of the front end of the roadheader body, approximately 1 m above the ground and tilted downward at an angle of approximately 25° to monitor the scraper board and the ground in front of it. The middle camera was installed near the driver’s position, approximately 2 m above the ground, to monitor the cutting face section.

After pre-calibrating the transformation relationship between the camera and the inertial navigation system on the roadheader body, the perspective of the middle camera is used as the stitching reference. The images captured by the three cameras, as well as the segmentation lines between the cutting cross-section and the ground in each camera image, are shown in Figure 11 below.

The stitching process of images from the three cameras is illustrated in Figure 12. It can be observed that in the original, unstitched images, the field of view of the middle camera only captures partial information of the cutting cross-section, while the ground information below the cutting head is completely obscured. However, after stitching, not only can the complete image information of the cross-section be observed, but it is also possible to “see through” the cutting head to view the full scoop plate and ground information beneath it. Meanwhile, we also conducted experiments on the continuous image stitching of the underground tunneling process. The experimental results show that both the stitching quality and speed meet the monitoring requirements for the underground tunneling process. Since navigation and position information are employed, the increase in interference such as dust and low lighting does not reduce the processing speed of the algorithm presented in this paper. These features significantly enhance the visibility and operational precision of remote operations, particularly in assisting with machine relocation and clearing loose coal on the floor, holding important application value.

5. Conclusions

In response to the special requirements of large parallax in underground coal mine tunneling faces, dynamic changes in the positions of surveillance cameras, and rapid stitching of large-field-of-view images, this paper proposes a rapid stitching method for tunneling head images based on navigation and positioning information. This method first utilizes tunnel coordinate information to divide the tunneling head scene into the cutting cross-section and the ground through pixel calculations, enhancing the reliability of image segmentation. Subsequently, it leverages tunneling machine navigation and positioning information to calculate the spatial distance between the camera and the cutting plane and employs a plane-induced homography model to efficiently solve for the dynamic homography matrices of the cutting cross-section and the ground. Finally, the DHW method is adopted to achieve rapid stitching of tunneling head images.

By comparing this method with three classical stitching methods (SURF, SIFT, and BRISK), the results demonstrate a 60% reduction in stitching time. Application validation has been conducted under actual working conditions in coal mines, yielding complete stitched images of tunneling heads. This not only allows workers to observe the entire working face area from a unified perspective but also enables them to “see through” the machine body and cutting head to observe the scoop plate and the ground nearby, greatly facilitating remote machine movement and float coal cleanup operations by workers. This method holds high engineering application value.

Author Contributions

Conceptualization, H.Z. and S.Z.; methodology, S.Z.; software, H.Z.; validation, H.Z. and S.Z.; formal analysis, H.Z. and S.Z.; investigation, H.Z. and S.Z.; data curation, H.Z.; writing—original draft preparation, H.Z.; writing—review and editing, S.Z.; visualization, H.Z.; supervision, S.Z.; funding acquisition, S.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China grant number 52474187.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data and code of this work will be available from the corresponding author upon reasonable request.

Conflicts of Interest

Author Hongda Zhu was employed by the company Huadian Coal Industry Group Digital Intelligence Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Wang, G.F.; Zhang, L.; Li, S.B.; Li, S.; Feng, Y.h.; Meng, L.Y.; Nan, B.F.; Du, M.; Fu, Z.; Li, R.; et al. Progress in theory and technology research of unmanned intelligent mining systems in coal mines. J. China Coal Soc. 2023, 48, 34–53. [Google Scholar]
Wang, H.; Wang, B.K.; Zhang, X.F.; Li, F.Q. Key technology and engineering practice of intelligent rapid heading in coal mine. J. China Coal Soc. 2021, 46, 2068–2083. [Google Scholar]
Zhang, X.H.; Yang, H.Q.; Bai, L.N.; Shi, S.; Du, Y.Y.; Zhang, C.; Wan, J.C.; Yang, W.J.; Mao, Q.H. Research on low illumination video enhancement technology in coal mine heading face. J. Coal Geol. Explor. 2023, 51, 309–316. [Google Scholar]
Cheng, J.; Li, H.; Ma, K.; Liu, B.; Sun, D.Z.; Ma, Y.Z.; Yin, G.; Wang, G.F.; Li, H.P. Architecture and key technologies of coalmine underground vision computing. Coal Sci. Technol. 2023, 51, 202–218. [Google Scholar]
Wang, J.C.; Pan, W.D.; Zhang, G.Y.; Yang, S.L.; Yang, K.H.; Li, L.H. Principles and applications of image-based recognition of withdrawn coal and intelligent control of draw opening in longwall top coal caving face. J. China Coal Soc. 2022, 47, 87–101. [Google Scholar]
Gao, X.B. Research on key technology of remote visual control in fully-mechanized heading face. Coal Sci. Technol. 2019, 47, 17–22. [Google Scholar]
Ren, W. Development and application of multi view panoramic camera in fully mechanized mining face. Coal Eng. 2022, 54, 102–108. [Google Scholar]
Xia, D.; Zhou, R. Survey of Parallax Image Registration Technology. Comput. Eng. Appl. 2021, 57, 18–27. [Google Scholar]
Gao, J.; Kim, S.J.; Brown, M.S. Constructing image panoramas using dual-homography warping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Colorado Springs, CO, USA, 20–25 June 2011; pp. 49–56. [Google Scholar]
Zhang, K.L. Research and application of panoramic video remote control technology for intelligent fully mechanized mining face. China Coal 2023, 49, 70–81. [Google Scholar] [CrossRef]
Zhang, X.H.; Wang, Y.; Yang, W.J.; Chen, X.; Zhang, C.; Huang, M.; Liu, Y.H.; Yang, J.H. A mine image stitching method based on improved best seam-line. Ind. Min. Autom. 2024, 50, 9–17. [Google Scholar]
Wang, P.P.; Li, R.; Liu, X.; Li, X.; Fu, C.L. A positioning solution method for roadheader under optical target occlusion conditions. Ind. Min. Autom. 2024, 50, 118–124. [Google Scholar]
Sun, H.X.; Luo, J.X.; Pan, Z.S.; Zhang, Y.Y.; Zheng, Y.J. A Method for Solving Homography Matrix Based on Constrained Total Least Squares. Comput. Technol. Dev. 2022, 32, 50–56. [Google Scholar]
Deng, S.C.; Jiang, Y.L.; Gao, X.Y. Parameter Calibration of Line Structured Light Vision Sensor Based on Plane Normal Vectors. Mod. Mach. Tool Autom. Manuf. Technol. 2023, 7, 69–72. [Google Scholar]
He, J.H.; Wu, B.; Zhang, H.Y. Fast image stitching based on similarity of invariant moments. Microcomput. Appl. 2017, 36, 50–53. [Google Scholar]
Wang, Z.J.; Chao, Y.F. Image registration algorithm using SURF feature and local cross correlation information. Infrared Laser Eng. 2022, 51, 492–497. [Google Scholar]
Xia, X.H.; Zhao, Q.; Xiang, H.T.; Qin, X.F.; Yue, J.P. SIFT feature extraction method for the defocused blurred area of multi-focus images. Opt. Precis. Eng. 2023, 31, 3630–3639. [Google Scholar] [CrossRef]
Du, G.; Hou, L.Y.; Tong, Q.; Yang, D.L. Image mosaicing based on BRISK and improved RANSAC algorithm. Liq. Cryst. Disp. 2022, 37, 758–767. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the roadheader working face and camera view.

Figure 2. Calibration method for camera coordinate system and tunnel coordinate system.

Figure 3. Schematic diagram of the stitching principle for left and right camera images.

Figure 4. Schematic diagram of stitching for the fusion image of the middle camera with the left and right cameras.

Figure 5. Construction of a simulation experiment system. (a) Schematic diagram of the simulated roadway. (b) Installation of simulated tunneling machine and sensors.

Figure 6. Simulated roadway images captured by the three cameras with segmentation lines marked. (a) The image captured by the left camera. (b) The image captured by the middle camera. (c) The image captured by the right camera. (d) Final stitched image.

Figure 7. Selection of matching feature points in Figure 7’s left and right camera images.

Figure 8. Stitching of left and right camera images.

Figure 9. Stitching of fused images from the middle camera and the left and right cameras.

Figure 10. Camera position layout at the heading face.

Figure 11. Preset feature points of the cutting cross-section and plane segmentation lines.

Figure 12. Overall stitching result in a mine.

Table 1. Time performance analysis of different algorithms.

Algorithm	Proposed Algorithm	SURF	SIFT	BRISK
Time (ms)	129.1	389.8	845.3	349.5

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhu, H.; Zhao, S. Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information. Sensors 2025, 25, 3023. https://doi.org/10.3390/s25103023

AMA Style

Zhu H, Zhao S. Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information. Sensors. 2025; 25(10):3023. https://doi.org/10.3390/s25103023

Chicago/Turabian Style

Zhu, Hongda, and Sihai Zhao. 2025. "Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information" Sensors 25, no. 10: 3023. https://doi.org/10.3390/s25103023

APA Style

Zhu, H., & Zhao, S. (2025). Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information. Sensors, 25(10), 3023. https://doi.org/10.3390/s25103023

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Research on a Rapid Image Stitching Method for Tunneling Front Based on Navigation and Positioning Information

Abstract

1. Introduction

2. Segmentation and Registration of Tunneling Face

2.1. Calculation of Image Dynamic Segmentation Line

2.2. Stitching of Segmented Images

2.2.1. Stitching of Ground Plane

2.2.2. Stitching of Cutting Face

2.2.3. Stitching of the Overall Image

3. Experimental Verification

3.1. Introduction to the Simulation Experimental System

3.2. Three-Camera Image Stitching Process

3.3. Time Consumption Performance Analysis

4. Verification of Real Coal Mine Tunneling Scenario

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI