Next Article in Journal
SRDFNet: Semantic Refinement and Differential Features for High-Resolution Change Detection
Previous Article in Journal
CoRe: Joint Optimization with Contrastive Learning for Medical Image Registration
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Visual Alignment Method for Hoisting Prefabricated Segmented Beams

1
CCCC Second Harbor Engineering Company Ltd., Wuhan 430040, China
2
School of Transportation and Logistics Engineering, Wuhan University of Technology, Wuhan 430063, China
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(11), 3426; https://doi.org/10.3390/s26113426
Submission received: 4 March 2026 / Revised: 2 May 2026 / Accepted: 12 May 2026 / Published: 28 May 2026
(This article belongs to the Section Sensing and Imaging)

Abstract

During the hoisting of prefabricated segmented beams, the alignment of rods and holes mainly relies on manual operation, which suffers from low safety and efficiency. To improve the safety and efficiency of rod–hole alignment, this paper proposes a vision-based alignment method for hoisting prefabricated segmented beams. The method uses binocular vision to measure the spatial coordinates of key points on rods and holes, establishes a mathematical model for alignment, and calculates the center distance and relative rotation angle between them. An experimental platform is built and tests are conducted. The results show that the proposed method can effectively measure the center distance and rotation angle, improve measurement efficiency and safety, achieve high accuracy, and possess high practical engineering value.

1. Introduction

In the field of modern bridge construction, segmented beam assemblies, as an efficient, high-quality and environmentally friendly construction method, are increasingly being widely applied [1]. This construction technique divides a bridge into several sections, which are prefabricated in factories or prefabrication sites and then transported to the construction site for assembly. This significantly shortens the on-site construction period, reduces the impact on the surrounding environment, and simultaneously enhances the construction quality and structural stability of the bridge. This method is widely used in bridge construction projects, such as urban bridges, highway bridges, and cross-sea bridges. For example, in urban elevated bridge construction, segmented beam assemblies enable rapid construction in confined urban spaces and minimize traffic disruption. In the construction of large-scale cross-river and cross-sea bridges, they help improve construction efficiency and reduce construction risks [2].
During the assembly process of segmented beams, the alignment of rods and holes is a crucial step, directly affecting the stability and safety of the entire bridge structure. If the rods are not accurately aligned with the holes, this may lead to uneven force distribution on the rod and local stress concentration, thereby affecting the service life of the rod and even causing serious problems, such as deformation and cracking of the bridge structure, threatening the operational safety of the bridge. The traditional methods for aligning rods and holes, such as manual measurement and adjustment, are not only inefficient but also have difficulties in guaranteeing accuracy. With the continuous expansion of bridge construction scales and the improvement of technical standards, in-depth research on the alignment methods of rods and holes has become an important issue that urgently needs to be addressed in the field of bridge construction. In the face of complex construction environments and high-precision bridge construction requirements, the exploration of more efficient, precise, and intelligent alignment technologies is needed. This has practical significance for promoting the advancement of bridge construction technology and ensuring the quality of bridge engineering [3].
A schematic diagram of the hoisting of segmented beams is shown in Figure 1. There are six square rods on the lifting gear, and corresponding square holes are provided on the segmented beams. After the precise alignment of hole positions with the rod is completed, the control system will insert the rod into the hole and fix them. The cross-sections of both the rod and the hole are square structures. To ensure that the square rod can be accurately embedded into the square hole, the side length of the hole is usually 10 mm longer than that of the rod.
In the early stage of the development of segmented beam assembly technology, the alignment of rods and holes mainly relied on manual operation. In this way, construction workers first used their experience and simple measuring tools to manually adjust the spacing between the rods to roughly match the spacing of the holes on the segmented beams. In actual operation, workers stood on the top surface of the segmented beams or used simple climbing equipment to manually push the rods closer to the holes. Through continuous observation and fine tuning, the rods were gradually aligned with the holes. When the rods approached the holes, workers needed to adjust the angle and position more carefully to ensure that the rods could be accurately inserted into the holes. After the alignment was completed, fastening anchors were manually installed to fix and connect the rods to the segmented beams.
This manual alignment method has many drawbacks. In terms of efficiency, the entire alignment process requires the collaborative operation of multiple workers, and each adjustment is time and labor consuming, resulting in slow assembly progress of the segmented beams. In some early bridge segmented beams assembly projects, it might take several hours or even longer to align just one set of rods and holes, seriously affecting the overall progress of the project.
In terms of safety, workers are required to operate at high altitudes and in narrow working spaces; during the operation process, they also need to frequently move and adjust the rods. Thus, there exist significant safety risks. For instance, workers may encounter accidents due to accidental falls or being hit by the rods.
As for alignment accuracy, manual operation is greatly influenced by factors such as workers’ experience, skill level, and fatigue, making it difficult to ensure that each alignment meets the high-precision requirements. This can easily lead to certain deviations between rods and holes, thereby affecting the installation quality of the segmented beams and the stability of the bridge structure. If there is a deviation in the alignment of rods and holes, during the subsequent use of the bridge, there will be uneven local force distribution on the rods, resulting in varying degrees of wear and deformation of the rods, which increases the maintenance cost and safety hazards of the bridge [4,5].
With the continuous development of bridge construction technologies and the increasing demand for construction efficiency, semi-automatic auxiliary alignment technology has gradually emerged on the basis of manual alignment. This technology introduces some simple mechanical equipment to assist construction workers in aligning rods and holes. Among them, the simple sliding device is one of the more common auxiliary devices. It is usually composed of a track, a slider and a drive device. The track is installed on the segmented beams or lifting gear, the slider can slide freely on the track, and the rods are installed on the slider. Through the drive device, such as a small motor or hydraulic device, the slider can move on the track, thereby driving the hanger to adjust its position in the horizontal direction. The construction workers only need to control the drive device to conveniently adjust the position of the rods to approach the holes and then manual fine tune to complete the final alignment [6,7].
Compared with the early pure manual alignment method, semi-automatic auxiliary alignment technology has made remarkable progress. In terms of efficiency, thanks to mechanical equipment, the position adjustment of rods is more rapid and convenient, significantly reducing the time required for alignment. Taking a real project as an example, after adopting semi-automatic auxiliary alignment technology, the alignment time for each group of rods and holes is shortened from several hours to tens of minutes, which improves construction efficiency several times. In terms of labor intensity, the workload of workers manually pushing rods has been reduced, alleviating their labor intensity and enabling them to operate in a relatively relaxed environment. However, semi-automatic auxiliary alignment technology also has certain limitations. In terms of alignment accuracy, although it has been improved compared with pure manual methods, it is still difficult to meet the requirements of some high-precision bridge construction projects. In particular, when dealing with complex segmented beam structures and strict design standards, certain deviations may still occur. Semi-automatic auxiliary alignment technology relies to a certain extent on the normal operation of mechanical equipment. If the equipment malfunctions, it may lead to construction interruption and affect the project progress [8]. In recent years, with the rapid development of advanced sensor technology and automatic control technology, intelligent and automatic alignment methods have gradually emerged in the assembly of segmented beams and have been widely applied [9,10,11,12,13]. These methods utilize advanced sensors, such as laser sensors and vision sensors, to obtain the position information of rods and holes in real time. For instance, by leveraging machine vision recognition technology, the position, shape and posture of the holes can be identified and analyzed through cameras installed on the lifting gear or segmented beams, and the obtained image information is transmitted to the control system. The control system processes and analyzes the data collected by the sensors based on advanced algorithms, such as pattern recognition algorithms and image processing algorithms [14,15,16]. The position deviation and angle deviation between rods and holes are calculated using these algorithms. Based on the calculation results, the automatic control actuators, such as electric push rods and hydraulic cylinders, precisely adjust the position and angle of the rods to achieve automatic alignment between rods and holes. In some large-scale bridge construction projects, intelligent and automated alignment systems can complete high-precision alignment of rods and holes in a short time, greatly improving construction efficiency and quality.
Intelligent and automated alignment methods have many significant advantages. In terms of accuracy, they can achieve millimeter-level or even higher precision alignment, effectively ensuring the installation quality of the segmented beams, reducing structural hazards caused by alignment deviations, and enhancing the overall stability and safety of the bridge. In terms of efficiency, the automated operation process significantly reduces the alignment time. Compared with traditional manual or semi-automatic alignment methods, the efficiency is increased several times or even dozens of times, thus meeting the demands of large-scale and high-efficiency bridge construction. From the perspective of safety guarantee, these methods reduce the working time and workload of workers in high-altitude and dangerous environments, lower construction safety risks, and ensure the personal safety of construction personnel. Intelligent and automated alignment methods can also achieve real-time monitoring and recording of data, facilitating quality traceability and management of the construction process, and providing strong support for the intelligent and digital development of bridge construction.
Traditional rod–hole alignment relies on manual naked-eye observation or monocular vision, which has obvious accuracy defects. Manual alignment is easily affected by occlusion and experience differences. Monocular vision can only obtain two-dimensional plane information and cannot determine the depth deviation between rods and holes. The three-dimensional positioning capability of binocular vision can better solve this problem [17]. In rod–hole alignment, the binocular vision method can output three-dimensional deviation values in real time, guiding actuators to make precise adjustments. This avoids problems such as the shear deformation of rods and stress concentration at the edge of holes caused by alignment deviation, and it significantly reduces long-term maintenance costs.
To meet the demand for the intelligent and digital development of precast segmented beam assembly, upgrade the unmanned and intelligent level of beam hoisting and lowering, and enhance operational safety, this paper proposes a binocular vision-based hoisting alignment method for precast segmented beams. In response to the problems of high dependence on manual operation, low automation, insufficient alignment accuracy, and significant safety risks in traditional segmented beam hoisting operations, the proposed method uses binocular vision to realize the spatial perception of key components. By acquiring the spatial coordinates of the matching holes and rods in real time, it establishes a mathematical model for attitude analysis and calculates the center distance and relative rotation angle between rods and holes. This method effectively improves the precision, efficiency, and safety of the alignment process, reduces human errors and construction risks, and provides technical support for the intelligent and unmanned construction of segmented beam assembly.
This paper has the following innovative points: (1) A stereo camera calibration method based on constraint conditions is proposed. (2) It presents the calculation methods for the center distance and rotation angle in rod–hole alignment. (3) A stereo camera layout method based on the vision-based process of aligning rods and holes is proposed. The organization of this paper is as follows: Section 1 introduces the current research status of rod–hole alignment methods. Section 2 introduces the visual measurement methods for holes and rods. Section 3 establishes an engineering experimental platform and presents the experiments. Section 4 summarizes the full paper.

2. Method

2.1. Calibration Method for Stereo Cameras Based on Constraint Conditions

To obtain the three-dimensional data of the object to be measured, a binocular vision measurement model [18,19] is required. As shown in Figure 2, the principle of binocular vision measurement is illustrated. Point P is a spatial point, and the projection points of P on the two images are p r and p l , respectively. Theoretically, points P , p l , and O l are collinear, and points P , p r , and O r are also collinear.
The projections of point P on left image and right image are p l and p r , respectively. Points P , p l and p r satisfy the following relationship:
P l = R l P w + t l P r = R r P w + t r
Among them, R l is the rotation matrix of the left camera, t l is the translation matrix of the left camera, R r is the rotation matrix of the right camera, and t r is the translation matrix of the right camera.
The point P in the left and right camera coordinate systems can be associated by the rotation matrix and the translation matrix, and the calculation formulas are as follows:
P l = R d ( P r t d )
Among them, R d and t d are the rotation matrix and translation matrix between the left and right cameras, respectively. Therefore, the following relationship can be derived:
R d = R r ( R l ) t d t d = t t R d t l
The binocular vision measurement system uses two cameras to simulate the positioning function of the human eye, and it determines the three-dimensional coordinates of spatial points by the spatial posture of the cameras and image points. Generally speaking, for the convenience of calculation, the world coordinate system is superimposed with one of the camera coordinate systems. The projection matrices of the left and right cameras are as follows, respectively:
M L = K L I 0 M R = K R R t
Among them, K L and K R are the left and right camera intrinsic parameter matrices, respectively, and I is a 3 × 3 identity matrix. After obtaining the intrinsic and extrinsic parameters of the two cameras through the calibration method, the three-dimensional coordinates of the corresponding spatial points can be calculated based on the corresponding image points on the left and right images.
With the above equations, the relationship between the left and right cameras can be obtained as follows.
s r u r v r 1 = f r r 1 f r r 2 f r r 3 f r t x f r r 4 f r r 5 f r r 6 f r t y r 7 r 8 r 9 t z z u l / f l z v l / f l z 1
Through Equation (5) and Figure 2, the space coordinate of point P can be expressed by the following equation.
x = z u l / f l
y = z v l / f l
z = f l f r t x u r t z u r r 7 u l + r 8 v l + f l r 9 f r r 1 u l + r 2 v l + r 3 f l = f l f r t y v r t z v r r 7 u l + r 8 v l + f l r 9 f r r 4 u l + r 5 v l + r 6 f l
Among them, f l and f r are the scale factors of the left and right cameras; r 1 r 9 are the coefficients. Since t x is not equal to 0, T = ( 1 , t y , t z ) T , so Equations (6)–(8) can be expressed as f ( x ) = 0 . The specific equation is as follows.
f r u r t l z r 4 u l + r 5 v l + f l r 6 f r t y v r t l z r 1 u l + r 2 v l + f l r 3 v r u r t l y r 7 u l + r 8 v l + f l r 9 = 0
x = t y , t z , r 1 , r 2 , r 3 , r 4 , r 5 , r 6 , r 7 , r 8 , r 9 T
The length of the reference line can be calculated from the following equation.
f l 2 L 2 = z 1 u l 1 z 49 u l 49 2 + z 1 v l 1 z 49 v l 49 2 + f l 2 z 1 z 49 2
If the calibration board has multiple reference lines, there is the following equation, whose lengths are L i . To minimize the overall error of the measurement, the constraints of reference lines need to be comprehensively considered, so the translation vector T is as follows.
T = T * / L * ( 1 n L i ) / n
Among them, n represents the number of reference lines.

2.2. Epipolar Geometric Search

The core idea of standard binocular system measurement is to rectify the left and right images using the calibration parameters of the two cameras and then calculate the depth information of the object to be measured via geometric relationships [20,21,22]. Figure 3 shows the correction schematic of the binocular system. Points O L and O R are the projection centers of the cameras, and the line connecting them O L O R is the baseline. P represents the spatial point to be measured, and its projections on the images are P l and P r , respectively. The planes of the left and right images are denoted as plane ß L and plane ß R , respectively. The intersection lines h l e l and h r e r between the epipolar plane PO L O R and the planes ß L and ß R are called epipolar lines. Point Q lies on the line O L P , and its projection on plane ß L is also P l . The projection of point P onto plane ß R must lie on the line h r e r . This is the epipolar constraint, which forms the basis of stereo matching. The epipolar constraint restricts the search and matching of pixels to a certain line segment in the image, thus reducing the computational load of pixel matching. By using the epipolar constraint, the corresponding points can be quickly located in the other image.
According to the geometric relationship, there are the following coplanar equations on the epipolar plane:
B x B y B z x l y l f x e 1 y e 1 f = 0
Among them, B x , B y and B z are the baseline components; x l and y l are the image plane coordinates of point P l which are known; and f is the focal length of the camera. The relationship between x l and y l is as follows:
y l = ( A / B ) · x l + ( C / B ) · f
Among them,
A = f · B l + y l · B z B = f · B l + x l · B z C = y l · B x x l · B y
For the epipolar line of the right image, the coplanar conditional equation can also be used to solve it:
B x B y B z x r y r f x e 2 y e 2 f = 0
Among them,
[   B x     B y     B z   ] = [   B x     B y     B z   ] M r [   x r     y r     f   ] = [   x e 2     y e 2     f   ] M r
M r is the rotation matrix between the left image space auxiliary coordinate system and the right image space auxiliary coordinate system. x and y are the image plane coordinates of point P r , which are known. The relationship between x and y is as follows:
y r = ( A / B ) · x r + ( C / B ) · f
Among them,
A = f · B x + y r · B z B = f · B x + x r · B z C = y r · B x x r · B y
Therefore, the relationship between x and y can be obtained, and the epipolar equation on the right image is acquired.
As shown in Figure 4, after the stereo camera is calibrated and its parameters are known, stereo correction can be performed using the epipolar constraint so that the optical axes of the left and right cameras are parallel, which means the image planes are parallel. The epipolar lines on the two images then lie on the corresponding horizontal lines. Suppose the world coordinates of the spatial point P are P w ; its projections onto the two camera coordinate systems are P cl and P cr , respectively. The extrinsic parameters of the left and right cameras are ( R l , t l ) and ( R r , t r ) , respectively. The relationship between the camera coordinate system and the world coordinate system is as follows:
P c l = R l P W + t l P c r = R r P W + t r
Disparity d is the horizontal difference of corresponding points on the left and right imaging planes. Its calculation formula is as follows:
d = x l x r
According to the geometric relationship, the depth z (Z-direction coordinate) of the object can be calculated by the following formula:
z = b f d
Once the disparity map is calculated, the depth information can be calculated through the conversion relationship between disparity and depth.
The initial three-dimensional coordinates of spatial points are obtained using the aforementioned method. Due to factors such as noise in image feature extraction, matching errors, and limited camera calibration accuracy, these initial positions typically contain certain deviations and cannot directly meet the requirements of high-precision localization. To further improve the accuracy of the 3D point coordinates, the system introduces the reprojection error as an optimization criterion. The currently estimated 3D spatial point is back-projected onto each original image plane according to the camera projection model, yielding the corresponding reprojected pixel coordinates. These reprojected coordinates are then compared with the observed feature point coordinates detected in the images, and the geometric distance between them in pixel space is defined as the reprojection error. This error intuitively reflects the deviation between the current estimated 3D point and the true values.
On this basis, with the objective of minimizing the global reprojection error, the three-dimensional coordinates of spatial points are iteratively adjusted via a nonlinear optimization algorithm (e.g., the Levenberg–Marquardt algorithm). In each iteration, the algorithm corrects the position of the 3D point according to the error magnitude and gradient direction, driving the reprojected points to gradually approach the ground truth points. After multiple iterations until convergence, highly accurate and consistent 3D coordinates are finally obtained, which significantly improves the reliability of 3D reconstruction or localization results.

2.3. Analysis of the Measurement Method for Alignment of the Rod and Hole

The previous section introduced the conversion principle between image coordinates and spatial coordinates. After image processing to eliminate interference factors such as lighting, followed by edge extraction and corner fitting, the image coordinates of key points on rods and holes can be obtained. The spatial coordinates of key points of holes can be obtained using the corresponding image coordinates [23,24,25,26].
Since the relative position of the rods and the camera is fixed, rod–hole alignment mainly involves calculating the relative posture between rods and holes. Therefore, it is necessary to construct an alignment analysis model for rod–hole pairing. As shown in Figure 5, this figure shows the spatial attitude diagram of rods and holes during alignment. The green dots represent the four key points to be measured on the hole. The plane fitted by these four key points is P 1 , and the normal vector of P 1 is n 1 . The plane fitted by three points on the rod is P 2 , and the normal vector of P 2 is n 2 . The blue dotted lines represent the central axes of the two planes. The intersection points of the axes of the hole and the rod on plane P 1 are O 1 and O 2 , respectively. The distance between O 1 and O 2 is the center distance. The projections of points B and C onto plane P 1 are B and C , respectively. Through the control system, plane P 2 can be adjusted to be parallel to plane P 1 . The angle between line FG and line B C on plane P 1 is α , which is the rotation angle to be calculated.
The specific 4 steps for aligning rod and hole are as follows:
  • Given the position of the camera, the coordinates of the key points A, B, and C on the rods are obtained by binocular vision. Then, the equation of plane P 2 and the equation of the central axis are fitted according to the key points.
    Given A = ( x A , y A , z A ) , B = ( x B , y B , z B ) , C = ( x C , y C , z C ) , where A, B, and C lie on a square cross-section of a rod, with B being the right-angled vertex, the following formula should theoretically hold:
    B A · B C = 0 B A = B C
    The normal vector calculation formula for plane P 2 is as follows:
    n 2 = B A × B C = ( A B ) × ( C B )
    Normalize the above vector:
    n 2 = n 2 n 2 = ( a 2 , b 2 , c 2 )
    The axis of the rod passes through O s and is perpendicular to the cross-section P 2 , with a direction of n 2 . The equation for the axis is as follows:
    r ( t ) = O s + t n 2 , t R
    The above equation can be rewritten as a symmetric equation:
    x x O s a 2 = y y O s b 2 = z z O s c 2
  • The least square method is adopted to perform planar fitting (plane P 1 ) on the four key points (D, E, F, and G) of the hole, and the coordinates of projection points of the key points on the plane P 1 are calculated. Connect four projection points to obtain the spatial contour of the hole.
    Given that points D, E, F and G are theoretically coplanar and have square vertices, D = ( x D , y D , z D ) , E = ( x E , y E , z E ) , F = ( x F , y F , z F ) , G = ( x G , y G , z G ) . Using the least squares method to fit plane P 1 , the coordinates are centered as shown in the following equation:
    P ¯ 1 = ( x ¯ 1 , y ¯ 1 , z ¯ 1 ) = D + E + F + G 4 Q i = P i P ¯ 1 , i { D , E , F , G }
    Construct the covariance matrix as follows:
    M 1 = i Q i Q i T = S x x S x y S x z S x y S y y S y z S x z S y z S z z
    where S x x = ( x i x ¯ 1 ) 2 , S x y = ( x i x ¯ 1 ) ( y i y ¯ 1 ) .
    Let λ m i n be the minimum eigenvalue of M 1 , and the corresponding unit eigenvector n 1 = ( a 1 , b 1 , c 1 ) is the unit normal vector of plane P 1 . The equation for plane P 1 is as follows:
    a 1 x + b 1 y + c 1 z + d 1 = 0 d 1 = ( a 1 x ˜ 1 + b 1 y ˜ 1 + c 1 z ˜ 1 )
    For any point P = ( x 0 , y 0 , z 0 ) , the projected P coordinates on plane P 1 are as follows:
    t = a 1 x 0 + b 1 y 0 + c 1 z 0 + d 1 P = ( x 0 a 1 t , y 0 b 1 t , z 0 c 1 t )
    By calculating D, E, F, and G separately, D , E , F and G can be obtained. The sequential connection of points on the hole ( D E F G D ) results in the spatial quadrilateral contour located on P 1 .
  • Calculate the coordinates of the vertical foot O 2 , and take the distance between O 2 and the O 1 as the center distance. At the same time, calculate the angle α between the normal vector of the rod and hole plane.
    The formula for solving the intersection point (perpendicular foot O 2 ) between the central axis r ( t ) = O s + t n 2 and plane P 1 is as follows:
    a 1 ( x O s + t a 2 ) + b 1 ( y O s + t b 2 ) + c 1 ( z O s + t c 2 ) + d 1 = 0 t 0 = a 1 x O s + b 1 y O s + c 1 z O s + d 1 a 1 a 2 + b 1 b 2 + c 1 c 2 O 2 = O s + t 0 n 2
    The formula for solving the center O 1 of the hole is as follows:
    O 1 = D + E + F + G 4
    The formula for calculating the center distance is as follows:
    Δ = O 1 O 2
    Projection B and C onto plane P 1 yields B and C , which are calculated using the following formula:
    t B = a 1 x B + b 1 y B + c 1 z B + d 1 B = ( x B a 1 t B , b 1 t B , z B c 1 t B )
    Similarly, C can be obtained.
    The orientation quantities u = G F , v = C B , both vectors, are on plane P 1 . Normalize the vector coordinates using the following formula:
    u ^ = u u v ^ = v v
    Calculate the angle α according to the dot product formula:
    cos α = u ^ · v ^ α = arccos ( cos α ) [ 0 , π ]
  • Adjust the spatial posture of the lifting gear based on the center distance O 1 O 2 and rotation angle α .

3. Experiment

3.1. Experiment Design

The alignment experiment can be divided into three main stages: camera selection, image acquisition, and algorithm accuracy evaluation. The camera should have a suitable frame rate to meet the requirements of hoisting operations and be capable of monitoring the operating status in real time. The deployment position of the camera should be consistent with the actual scene, ensuring imaging quality without interfering with the hoisting process. To ensure the reliability of the experimental procedure, it is necessary to adjust the relative positions of holes and rods during testing. The experimental flow is shown in Figure 6.
The image processing workflow is illustrated in the following figures: Figure 7a presents the original image segmented by the YOLO model. Based on the RGB (Red–Green–Blue) features of the segmented image, binarization is applied, and the corresponding result is shown in Figure 7b. After extracting the inner edges from Figure 7b, the Gaussian mixture clustering algorithm is employed to automatically cluster the edges [27,28,29], thereby obtaining discrete point sets corresponding to the four edges. Subsequently, four straight lines are fitted using these discrete points, and their intersection points are identified as the four corner points of the hole; the result is depicted in Figure 7c, where the four edges are distinguished by four different colors. Once the corner points are determined, corresponding conjugate points are searched for in the other image to obtain stable point pairs.

3.2. Construction of the Experimental Platform

As shown in Figure 8a, in order to more realistically reproduce the scene of the construction site, following the design requirements of the lifting gear and the dimensions of the segmented beams, a three-dimensional model of the segmented beams hoisting operation is established with a total of six rods and six holes. To verify the proposed method, an experimental platform required for visual hoisting is built according to the actual design scale to simulate the working states of individual rods and holes. A stereo camera is installed on the stand. The position and angle of the camera can be adjusted to complete the shooting at the best angle and distance. To ensure clear imaging, a light source is installed on the bracket.
To address the core technical challenge of optimal deployment of visual monitoring cameras in large-scale construction areas, as well as the significant challenges posed by the complex construction site environment and variable natural lighting conditions on camera calibration and image acquisition, this paper, in combination with practical engineering needs, carried out camera installation position optimization and lighting environment adaptation design, and it rigorously verified the feasibility and rationality of the solution through experiments. Considering the actual structural form, spatial dimensions, and operational conditions of the engineering lifting equipment, a comprehensive analysis of key factors such as the camera’s field of view coverage, target image clarity, installation stability, and ease of subsequent maintenance was conducted to determine the optimal installation positions of the cameras. In the experimental platform, the arrangement positions and installation attitudes of the cameras are completely consistent with the installation state in actual engineering projects, ensuring a high degree of match between the experimental environment and real construction scenarios, and guaranteeing that the experimental results have direct engineering reference value and transferability.
To effectively mitigate the adverse effects of uneven lighting, strong light interference, and shadow changes at the construction site on image quality, and to improve visual recognition and positioning accuracy, a dedicated auxiliary lighting source is added to the experimental system. The type, power, arrangement, and angle of the light are consistent with the lighting configuration on the actual hoist, maximally restoring the real working environment, effectively reducing the interference of lighting changes on visual measurement accuracy, and enhancing the system’s robustness under complex site conditions. In terms of camera spatial position parameter design, a refined quantitative optimization was carried out: the distance between the camera and the hoist in the Y direction is set to 0.5 m, which ensures the complete imaging of the target while taking into account structural interference risks and compact installation; considering the constraints on the relative distance of the hoist holes from the preliminary positioning of the previous stage of laser ranging, that is, the hole distance needs to be maintained within the range of 0.2 m to 0.3 m, and by balancing positioning accuracy with spatial layout, the total distance in the Z direction is set to 1.85 m in the experiment; fully considering multiple constraints such as the camera’s field of view, imaging resolution, depth of field requirements, hoist structure strength, and motion stability, after multiple sets of simulations and preliminary experiments for comparative analysis, the final relative distance between the camera and the hoist in the X direction is set to 2.2 m, enabling the visual system to stably acquire high-quality images throughout the entire lifting process, and meeting the accuracy requirements of visual positioning and intelligent hoisting.
The above camera positioning optimization scheme and lighting compensation strategy effectively address practical engineering problems such as the difficulty of camera deployment in large-scale construction areas, the difficulty of on-site calibration, and poor lighting adaptability, demonstrating not only the theoretical rationality of the proposed method but also fully validating its practical application value and promotion prospects in real engineering projects.
As shown in Figure 8b,c, the three red marked points on the rods are the points to be measured, and the four green marked points on the hole are the points to be measured. The relative pose of the rod and hole is measured by these seven points.
As shown in Table 1, these are the intrinsic parameters of the stereo camera. The pixel size is 3.45 μm, with u x and u y being the coordinates of the main image points and k 1 and k 2 being the radial distortion parameters of the camera, and the image resolution is 4096 × 3000.
The extrinsic parameters of the camera are shown in Table 2. X, Y and Z represent the coordinates of the camera’s projection center, and φ , ω , and κ are the rotation angles of the image in the three directions.
Figure 9a,b presents an image captured by a stereo camera. Both the left and right images contain a rod and a hole. The rod and hole are separated by using an image segmentation algorithm, and image processing is carried out. The processed local image is shown in Figure 9c–f. The main features of the rod and hole are retained on the local image, and the parts unrelated to the rod and hole are removed. Image processing is carried out on the local image to obtain the image coordinates of key points in the left image, and the epipolar constraint is used to match the image coordinates of key points in the right image. According to the principle of binocular vision, the image coordinates are converted into spatial coordinates, which facilitates the calculation of the relative positions of the hole and rod.
To fully verify the measurement accuracy and robustness of the proposed method, 30 independent experiments were conducted by adjusting the spatial position of the hoisting hole to vary the relative pose and distance between the suspender and the hoisting hole while keeping the positions of the suspender and the camera unchanged. The measurement results from a total station were adopted as the ground truth of spatial coordinates against which the measurement results of the proposed method were compared. Meanwhile, the standard deviation and root mean square error (RMSE) of the rotation angle and center distance were calculated, and the confidence intervals were derived at the 95% confidence level. The detailed experimental data and error analysis are presented in Table 3 and Table 4.
Based on the experimental data and error statistics from 30 independent tests, the proposed measurement method achieves high accuracy and favorable stability.
For rotation angle measurement, the absolute error ranges from 0.06° to 0.45°, and the relative error ranges from 0.73% to 6.21% with most values below 3%. The standard deviation is 0.123°, the RMSE is 0.215°, and the 95% confidence interval is [0.172, 0.258], indicating small error dispersion and high reliability.
For center distance measurement, the absolute error ranges from 0.14 mm to 1.73 mm, and the relative error ranges from 0.26% to 1.57%. The standard deviation is 0.452 mm, the RMSE is 0.683 mm, and the 95% confidence interval is [0.561, 0.805], confirming stable measurements that meet practical engineering accuracy requirements.
Overall, the proposed method achieves high-precision measurement under varying hole positions and relative poses, validating its robustness and engineering applicability.
This experimental system establishes a complete technical process integrating image acquisition, visual processing, mathematical modeling, and data analysis. The system’s stability is verified through multiple sets of repeatability experiments. The experimental results indicate that the deviations in the rotation angles of the hoist hole and hoist rod, as well as the deviation in the center distance, are all within the allowable engineering error range. The overall measurement accuracy is high, and it is capable of meeting the technical specifications and usage requirements of practical engineering applications.

4. Conclusions

To improve the safety and efficiency of rod–hole alignment during precast beam hoisting, this paper proposes a binocular vision-based visual alignment method for precast beam hoisting. This method adopts a stereo camera to acquire images of rods and holes, extracts image key points, and matches these points via epipolar geometry to calculate the spatial coordinates of key points on rods and holes. According to the spatial positional relationship of these key points, a mathematical model is established to compute the center distance and rotation angle between rods and holes.
To validate the practical performance of the proposed method, an experimental platform for rod–hole alignment is constructed based on real engineering scenarios. The positions of rods and the camera remain fixed, while the relative pose between rods and holes is adjusted by moving holes. Experimental results show that the maximum absolute error of center distance is less than 1.73 mm, and the maximum absolute error of rotation angle is merely 0.47°. Both indicators fully meet practical engineering requirements.
The established experimental platform only completes alignment tests for a single rod and hole pair. Practical construction requires the simultaneous alignment of six pairs of rods and holes. Moreover, the current test platform still differs from actual working conditions. Accordingly, future research will focus on conducting alignment experiments for six pairs of rods and holes under real engineering environments [30,31,32,33].

Author Contributions

L.X.: Writing—original draft (equal). C.Z.: Supervision (equal). All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

Author Lin Xiao was employed by the company CCCC Second Harbor Engineering Company Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Fu, Q. Research on Prefabrication Measurement and Control Technology of Short-Line Matching Segment-Based Prefabricated Assembly Bridges Based on Photogrammetry. Ph.D. Thesis, Tongji University, Shanghai, China, 2022. [Google Scholar]
  2. Rettinger, M.; Hueckler, A.; Schlaich, M. Technologies and developments in precast segmental bridge construction. Beton-Und Stahlbetonbau 2021, 116, 12–23. [Google Scholar] [CrossRef]
  3. Liu, W.; Han, B.; Yan, W.; Xie, H. Research Progress on Structural Mechanical Properties of Prefabricated and Assembled Concrete Segmental Beams. China J. Highw. Transp. 2023, 36, 81–99. [Google Scholar] [CrossRef]
  4. Zhong, Y.; Zhu, Q. Analysis of Shrinkage Creep and Prestress Loss during Prefabrication and Assembly of Segmental Beams. J. China Foreign Highw. 2021, 41, 126–132. [Google Scholar]
  5. Xue, S.; Hao, H.; Hao, Y.; Zou, D.; Liu, Y. Experimental and numerical investigations on the impact response of precast segmental bridge piers subjected to soft and hard missile impacts. Eng. Struct. 2025, 339, 120602. [Google Scholar] [CrossRef]
  6. Deng, W.; Song, Q.; Liu, D.; Peng, Z.; Zhang, J. Optimal Design of Segment Storage and Hoisting Scheme for Prefabricated Corrugated Steel Web Composite Box Girder Bridge. J. Jiangsu Univ. (Nat. Sci. Ed.) 2025, 46, 113–119. [Google Scholar]
  7. Ji, X.; Fang, Z.; Xiao, H.; Dong, Q. Research on the Auxiliary Positioning Method of Mechanical Arm for Concrete Segmental Beam Lifting Gear. Lift. Transp. Mach. 2022, 18, 36–43. [Google Scholar]
  8. Zhao, C.; Fan, C.; Zhao, Z. Optimization Method of Square Hole Measurement Based on Generalized Point Photogrammetry. Appl. Sci. 2023, 13, 6320. [Google Scholar] [CrossRef]
  9. Xing, B.; Xu, J.; Guan, L.; Liu, Y. Machine Vision and Sensor Technology; Huazhong University of Science and Technology Press: Wuhan, China, 2023. [Google Scholar]
  10. Fraser, C.S. Close-range photogrammetry applications in bridge construction: A review. Photogramm. Eng. Remote Sens. 2002, 68, 51–58. [Google Scholar]
  11. Cheng, Y.; Lin, F.; Wang, W.; Zhang, J. Vision-based trajectory monitoring for assembly alignment of precast concrete bridge components. Autom. Constr. 2022, 140, 104350. [Google Scholar] [CrossRef]
  12. Kim, M.; Son, B.; Kim, S. Automated defect detection and dimensional measurement in precast concrete using photogrammetry. Autom. Constr. 2021, 125, 103668. [Google Scholar] [CrossRef]
  13. García de Soto, B.; Ajay, K.; Brilakis, I. Sensor integration and precision control for robotic assembly of bridge segments. Autom. Constr. 2019, 104, 102828. [Google Scholar] [CrossRef]
  14. Wu, H.; Zhang, W.; Lu, W.; Chen, J.; Bao, J.; Liu, Y. Automated part placement for precast concrete component manufacturing: An intelligent robotic system using target detection and path planning. J. Comput. Civ. Eng. 2025, 39, 04024044. [Google Scholar] [CrossRef]
  15. Li, X.; Wang, X.; Luo, X. Precision assembly control of precast segments using multi-sensor fusion. J. Constr. Eng. Manag. 2021, 147, 04021066. [Google Scholar] [CrossRef]
  16. Pan, Y.; Lu, W.; Chen, G. Machine vision for intelligent manufacturing in prefabricated construction. J. Manag. Eng. 2022, 38, 04021097. [Google Scholar] [CrossRef]
  17. Zhang, Z.; Liu, W.; Liu, G.; Song, L.; Qu, Y.; Li, X.; Wei, Z. Progress of 3D Vision Measurement Technology and Its Application. J. Image Graph. 2021, 26, 1483–1502. [Google Scholar]
  18. Zhang, Z.; Zheng, S.; Wang, X. Development and Application of Industrial Photogrammetry Technology. Acta Geod. Cartogr. Sin. 2022, 51, 843–854. [Google Scholar] [CrossRef]
  19. Reagan, D.; Sabato, A.; Niezrecki, C. Feasibility of using digital image correlation for unmanned aerial vehicle structural health monitoring of bridges. Struct. Health Monit. 2018, 17, 1056–1072. [Google Scholar] [CrossRef]
  20. Zhao, C.; Fan, B.; Tian, L.; Hu, J.; Pan, Q. Statistical Optimization Feature Matching Algorithm Based on Polar Geometry. Chin. J. Aeronaut. 2018, 39, 321727. [Google Scholar] [CrossRef]
  21. Alsadik, B.; Abdulateef, N.A. Epipolar Geometry between Photogrammetry and Computer Vision–A Computational Guide. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, 5, 25–32. [Google Scholar] [CrossRef]
  22. Han, L.; Shi, Z.; Wang, X.; Gao, Y. Depth estimation from light fields via epipolar geometry and an axial attention mechanism. Opt. Express 2025, 33, 24321–24340. [Google Scholar] [CrossRef]
  23. An, C.; Song, K.; Bao, L.; Zhao, D.; Zhou, Z.; Yan, Y. A Novel Edge Detection Method of Blade With Multisupervision for Foreground–Background Confusion Caused by Extreme Illumination. IEEE Sens. J. 2024, 24, 29429–29440. [Google Scholar] [CrossRef]
  24. Pu, M.; Huang, Y.; Liu, Y.; Guan, Q.; Ling, H. EDTER: Edge Detection With Transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2022; pp. 1402–1412. [Google Scholar]
  25. Abosinnee, A.S.; Bencsik, G.; Abedi, F. Edges in image with illumination variations scenarios: A review. Vis. Comput. 2025, 41, 12277–12305. [Google Scholar] [CrossRef]
  26. Agrawal, H.; Desai, K. Canny edge detection: A comprehensive review. Int. J. Tech. Res. Sci. 2024, 9, 27–35. [Google Scholar] [CrossRef] [PubMed]
  27. Tian, J.; Liu, X.; Chen, C.; Xiao, G.; Wang, Y.; Kang, Y.; Wang, P. Feature fusion-based inconsistency evaluation for battery pack: Improved Gaussian mixture model. IEEE Trans. Intell. Transp. Syst. 2022, 24, 446–458. [Google Scholar] [CrossRef]
  28. Punzo, A.; Tortora, C. A model-based clustering approach for bounded data using transformation-based Gaussian mixture models. Comput. Stat. Data Anal. 2024, 198, 107921. [Google Scholar]
  29. Wang, H. Improved Gaussian mixture model for short-term wind power data clustering. Energy Convers. Manag. 2024, 312, 118676. [Google Scholar]
  30. Yuan, J.; Meng, Y.; Zeng, J.; Zhu, X. Determination of Key Chain Project Buffer for Segment Beam Production Based on Comprehensive Utility. J. Railw. Sci. Eng. 2024, 21, 4779–4788. [Google Scholar] [CrossRef]
  31. Ma, Z.L.; Liu, Y.; Liu, S.L.; Shi, Q.B.; Wang, Z.B. Rapid Measurement Method for Formwork Pose of Segmental Beams in Prefabricated Beam Yard. Acta Opt. Sin. 2022, 42, 1512001. [Google Scholar] [CrossRef]
  32. Dong, Y.; Yang, H.; Yin, M.; Li, M.; Qu, Y.; Jia, X. Research on Lightweight Method of Segment Beam Point Cloud Based on Edge Detection Optimization. Buildings 2024, 14, 1221. [Google Scholar] [CrossRef]
  33. Xu, C.; Xiong, W.; Tang, P.; Cai, C.S. Automated flatness assessment for large quantities of full-scale precast beams using laser scanning. Comput. Civ. Infrastruct. Eng. 2024, 39, 1868–1885. [Google Scholar] [CrossRef]
Figure 1. The hoisting of segmented beams.
Figure 1. The hoisting of segmented beams.
Sensors 26 03426 g001
Figure 2. Binocular vision principle.
Figure 2. Binocular vision principle.
Sensors 26 03426 g002
Figure 3. Epipolar search principle.
Figure 3. Epipolar search principle.
Sensors 26 03426 g003
Figure 4. Disparity calculation principle.
Figure 4. Disparity calculation principle.
Sensors 26 03426 g004
Figure 5. The alignment principle of rod and hole.
Figure 5. The alignment principle of rod and hole.
Sensors 26 03426 g005
Figure 6. Experiment flow.
Figure 6. Experiment flow.
Sensors 26 03426 g006
Figure 7. Image process.
Figure 7. Image process.
Sensors 26 03426 g007
Figure 8. Experiment scenario.
Figure 8. Experiment scenario.
Sensors 26 03426 g008
Figure 9. Real photos.
Figure 9. Real photos.
Sensors 26 03426 g009
Table 1. Intrinsic parameters of camera.
Table 1. Intrinsic parameters of camera.
ParametersCam1Cam2
f (μm)3.4553.449
u x (pixel)2070.562070.79
u y (pixel)1514.271501.06
k 1 0.0121−0.0508
k 2 −1.05020.9882
width (pixel)40964096
height (pixel)30003000
Table 2. Extrinsic parameters of camera.
Table 2. Extrinsic parameters of camera.
 XYZ φ (rad) ω (rad) κ (rad)
Cam1468.9527.972530.646.133.301.91
Cam2424.31327.522437.246.163.171.83
Table 3. Alignment results of rod and hole.
Table 3. Alignment results of rod and hole.
NumberIndexProposedTrueAbsolute ErrorRelative Error
1Rotation (°)6.926.850.071.02%
Distance (mm)54.2953.870.420.78%
2Rotation (°)6.296.390.101.57%
Distance (mm)54.3154.450.140.26%
3Rotation (°)7.677.360.314.21%
Distance (mm)103.47104.901.431.36%
4Rotation (°)15.5315.980.452.82%
Distance (mm)108.45110.181.731.57%
5Rotation (°)4.684.990.316.21%
Distance (mm)19.5419.380.160.83%
6Rotation (°)8.288.220.060.73%
Distance (mm)72.4072.150.250.35%
7Rotation (°)10.1210.350.232.22%
Distance (mm)88.6289.050.430.48%
8Rotation (°)5.785.920.142.37%
Distance (mm)36.8936.720.170.46%
9Rotation (°)12.3512.180.171.39%
Distance (mm)95.7696.921.161.20%
10Rotation (°)7.897.760.131.67%
Distance (mm)65.3265.580.260.39%
11Rotation (°)14.2114.530.322.20%
Distance (mm)115.47116.891.421.21%
12Rotation (°)5.235.380.152.79%
Distance (mm)28.6528.490.160.56%
13Rotation (°)9.459.280.171.83%
Distance (mm)82.1382.560.430.52%
14Rotation (°)11.7611.920.161.34%
Distance (mm)99.87100.320.450.45%
15Rotation (°)6.546.670.131.95%
Distance (mm)49.2849.050.230.47%
16Rotation (°)13.5813.890.312.23%
Distance (mm)107.54108.961.421.30%
17Rotation (°)5.896.050.162.64%
Distance (mm)39.7639.580.180.45%
18Rotation (°)8.768.620.141.62%
Distance (mm)76.3276.680.360.47%
19Rotation (°)10.8911.120.232.07%
Distance (mm)89.4589.920.470.52%
20Rotation (°)7.237.150.081.12%
Distance (mm)58.7658.590.170.29%
21Rotation (°)12.9813.250.272.04%
Distance (mm)102.34103.761.421.37%
22Rotation (°)5.455.620.173.02%
Distance (mm)32.1832.030.150.47%
23Rotation (°)9.879.730.141.44%
Distance (mm)85.6786.020.350.41%
24Rotation (°)14.7615.120.362.38%
Distance (mm)112.89114.251.361.19%
25Rotation (°)6.897.020.131.85%
Distance (mm)52.4552.280.170.33%
26Rotation (°)8.348.210.131.58%
Distance (mm)70.1270.450.330.47%
27Rotation (°)11.2311.450.221.92%
Distance (mm)92.7893.210.430.46%
28Rotation (°)5.125.270.152.85%
Distance (mm)26.8926.740.150.56%
29Rotation (°)13.1213.450.332.45%
Distance (mm)105.67107.021.351.26%
30Rotation (°)9.159.080.070.77%
Distance (mm)78.6278.950.330.42%
Table 4. Error statistics and confidence interval analysis (95% confidence level).
Table 4. Error statistics and confidence interval analysis (95% confidence level).
Evaluation IndexStandard DeviationRMSEConfidence Interval
Rotation (°)0.1230.215[0.172, 0.258]
Distance (mm)0.4520.683[0.561, 0.805]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, L.; Zhao, C. Visual Alignment Method for Hoisting Prefabricated Segmented Beams. Sensors 2026, 26, 3426. https://doi.org/10.3390/s26113426

AMA Style

Xiao L, Zhao C. Visual Alignment Method for Hoisting Prefabricated Segmented Beams. Sensors. 2026; 26(11):3426. https://doi.org/10.3390/s26113426

Chicago/Turabian Style

Xiao, Lin, and Chengli Zhao. 2026. "Visual Alignment Method for Hoisting Prefabricated Segmented Beams" Sensors 26, no. 11: 3426. https://doi.org/10.3390/s26113426

APA Style

Xiao, L., & Zhao, C. (2026). Visual Alignment Method for Hoisting Prefabricated Segmented Beams. Sensors, 26(11), 3426. https://doi.org/10.3390/s26113426

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop