1. Introduction
Power transmission line icing (PTLI) is an important issue that affects the reliability and safety of power grid systems, especially in cold regions. PTLI occurs through the ice phenomenon, in which cold water droplets increase on the transmission line and then gradually freeze to become a layer of ice on top of the line cables, conductors, insulators, and supporting structures. The formation of this ice sheet can result in a series of significant problems, such as line breaks, tower collapses and other power transmission damage, which is a very dangerous threat to the safety of electric power operating systems. Worse, seriously threatening reliability, PTLI also brings great economic losses to the company and security risks to the country and society. In addition to the technical problems it causes, the impact of the ice sheet on transmission lines is very wide. Ice loads on various components of the power transmission system cause high losses for utility companies, and repairing damaged systems is also very expensive. In addition, PTLI disasters have an impact on the instability of the power supply system, which often leads to regional power outages and, thus, affects human activities and poses a significant risk to infrastructure [
1,
2,
3]. Between 1980 and 2000, major snowstorms caused damage to transmission lines in several cities in the United States, the United Kingdom, Canada, Russia, and Norway, creating global problems. In China, large-scale power outages occurred in 2005 and 2008 due to PTLI disasters that caused a major impact on the power grid and beneficiaries suffered huge economic losses [
1].
Similarly, an ice storm hit the eastern province of Canada in 2013, damaging
$200 million worth of property. The PTLI disaster left more than a million of its consumers in the dark for several days. For example, three 500 kV transmission lines covering the 500 kV Guanli line, the 500 kV Dushan line, and the 500 kV Guanqiao line were tripped and disconnected in January 2018 due to melting ice and tremors in the Anhui province, which posed a severe danger to the safe operation of the power grid [
4,
5,
6]. The PTLI disaster also occurred in February 2021 and affected the power grid system in Texas, U.S., leaving millions of people without electricity during winter. Since February 2024, the city of Wuhan in Hubei province has been hit by a low-temperature freezing rain disaster, which refers to freezing rain that causes difficulties in transportation and electricity in the region. Based on the problems mentioned above, it is necessary to monitor the level of ice accumulation in power transmission lines to prevent damage to infrastructure and also provide support for a consistent and stable power supply in areas experiencing extreme and bad weather conditions [
7].
Processing data from PTLI images is not easy; PTLI environments usually show climate change and irregular light conditions, leading to noise and artifacts in the images taken [
8,
9]. In the context of improving the accuracy of 3D measurements and monitoring of power transmission lines for ice thickness, such conditions can lead to poor image quality, which makes it difficult to identify and analyze some image features, such as low contrast as a result of inadequate lighting or visibility barriers as exemplified by snow or ice sheets, which can make it difficult to use signal processing approaches for image enhancement. The difficulties mentioned above can be critical in applications where accuracy is most important, for example, in 3D measurements, so it can lead to incorrect judgment and inaccurate image processing. According to the above description, one of them is to create an effective and powerful image processing algorithm to support a significant range of monitoring and measurement work under suboptimal environments [
10,
11].
Generally, 3D measurements on PTLI consist of several key stages [
2,
12]. The first stage is PTLI identification, where edge detection and line detection are performed to identify the contours and main geometric structures of objects in 3D space. Once the ice is identified, the next stage is key point matching, in which key point matching matches unique points, such as angles or texture features, which can be recognized on two images from different angles of view (left and right). The matching points on both images are used to calculate depth or 3D information through triangulation. With key point matching, the system can determine the 3D coordinates of each of those unique points in 3D space, which is essential for reconstruction accuracy. Once the 3D model is formed, ice thickness measurements can be made based on the results of the 3D reconstruction. However, this paper only focuses on the improvement of the key point-matching stage. Key point matching is an important stage in 3D reconstruction, as accuracy depends on the success of key point detection and key point matching. In some cases, this key point matching process has environmental disturbances that cause image degradation, such as noise, background, and low lighting problems. It is generally noted that failed key point matching methods in PTLI images resulted in the desired decrease in accuracy and durability. The above limitations are the reason why better methods are needed to find and match important points in difficult environmental conditions. Improving the accuracy and reliability of key point matching is important to improve the effectiveness of image processing in the application of ice thickness and the effectiveness of the methods used in the monitoring process. In this study, key point matching is used in PTLI monitoring applications where information is considered based on ice thickness measurements, so an accurate key point matching method is needed.
The primary objective of this research is to develop a robust image processing pipeline capable of accurately detecting and matching key points between images in challenging environments. The pipeline is designed for PTLI 3D monitoring based on binocular vision. The contributions of the research are as follows:
The new integration for 3D measurement: the research combines established techniques like multiscale retinex (MSR) and contrast limited adaptive histogram equalization (CLAHE); oriented FAST and rotated BRIEF (ORB); and scale-invariant feature transform (SIFT) in a new method. This integration into a cohesive pipeline specifically targets robust key point matching. The application of these new methods to the PTLI image allow for 3D measurements for ice thickness estimation;
Advanced random sample consensus (RANSAC)-based refinement with m-estimator sample consensus (MAGSAC) Integration: MAGSAC-driven RANSAC optimization with iterative parameter tuning and least median of squares (LMEDS) integration for superior outlier exclusion and transformation accuracy;
Automatic cropping and practical application: The automatic cropping mechanism is based on the improvement of a connected component analysis, coupled with practical application and validation in PTLI images.
3. Methodology
3.1. Pipeline Overview
This research proposes to build an effective image processing system that would help identify the correspondence between images and their key points within PTLI images. The image processing pipeline comprises several steps, and each step is designed to solve particular problems that are usually encountered in image processing. The process starts with the image being loaded and preprocessed. At this stage, the image is read and set for analysis to change the format to standard and initial scaling. The following step is an image enhancement, which is useful for improving image resolution and erasing out noise, so that features can be easily detected. At this stage, methods for enhancing the visibility and contrast of the image are incorporated.
Subsequently, there will be an automatic cropping of the imagery stage that will focus on important parts of the images while eliminating unnecessary areas. The feature point detection is carried out after the image cropping; in this process, significant points called feature points and crucial descriptors for robust image matching are obtained. For feature detection, the ORB and SIFT algorithms detect the key points, after which feature matching matches the corresponding images. The next step is feature matching, with the intention of identifying the corresponding point between two images based on the key points. Following this stage, the RANSAC-based refinement is performed to select key point matching, eliminate outliers, and determine robust transformations between images. Finally, the points of interest are presented to assess the pipeline implementation and the correctness of the outputs.
Figure 1 shows the block diagram of the proposed image processing pipeline.
3.2. Image Loading and Preprocessing
At this stage, efficient image acquisition and preprocessing are presented, consisting of several stages: color space standardization and computationally optimized rescaling. The loading and preprocessing stages of the image are very important so that the image is in a very good condition for processing. The loading step is to choose the file format, which is important because it affects the quality and ease of manipulating the image. Preprocessing also involves operations such as converting the images from BGR format to RGB format since they are compatible with most libraries and image processing procedures that involve matrices, and simpler analysis involves the transformation of the images from RGB into matrices of lower dimensions. The second one is a resizing process, and the image is resampled on the predefined scale factor, so it is not very computationally intensive. This step is beneficial to handle large datasets or real-time systems where concern for speed matters most. This step ensures that the necessary qualities of the image, such as the resolution and format of the image, are similar to the other images in the set of data collection as a solid basis for the next task in processing the image.
Given an input image,
, with dimensions
, the image is initially loaded and converted from
to
color space:
The image is then resized by a scaling factor, s, adjusting its dimensions to optimize computational efficiency:
3.3. Image Enhancement
At the image enhancement stage, multifaceted adaptive image enhancement is presented: integrating multiscale retinex, CLAHE, unsharp masking, wavelet denoising, bilateral filtering, and guided filtering for optimal detail and noise Reduction. At this stage, it aims to improve the quality of PTLI’s image by enhancing details and reducing noise, facilitating better feature detection. Image enhancement is one of the important methods in this image processing, especially in PTLI images such as low lighting or complex backgrounds. The proposed image processing pipeline integrates several enhancement methods to maximize the visibility of key features. The first stage in this image enhancement is the integration of multiscale retinex (MSR) and contrast limited adaptive histogram equalization (CLAHE). MSR improves dynamic range and contrast by combining retinex operations on multiple scales, improving the visibility of details in light and dark regions. Meanwhile, CLAHE increases local contrast and makes details of different image regions more distinguishable. Unsharp masking is implemented to improve the edges by adding back a lower frequency of the original image and then averaging. The next step is the application of a sequence of filters, including a wavelet denoising filter, a bilateral filter, and a guided filter. Wavelet denoising allows for the filtering out of noise so that small details and important information are not lost, while bilateral filtering smooths out the image while preserving edges. The final stage, with guided filtering, fine-tunes the refinement step and helps highlight the important features without producing any artifacts. The equation of the image enhancement process for multiscale retinex (MSR) for each pixel intensity
at position
is as follows:
where
is the Gaussian filter with scale
, and
is the number of scales. The
operation is applied to the luminance channel
of the image in the LAB color space:
The sharpened image
is computed by:
where
is a Gaussian blurred version of
, and λ is a sharpening factor. Wavelet denoising is applied by decomposing the image into wavelet coefficients
and thresholding:
where T is a thresholding operator and
is the inverse wavelet transform. The bilateral filter smooths the image while preserving edges:
where
is the neighborhood of pixel
,
and
are spatial and range parameters, and
is a normalization factor. Guided filtering further refines the image by:
3.4. Automatic Cropping
Automatic cropping provides automatic methods for extraction of regions of interest (ROI) through connected components analysis with precision cropping with bounding boxes expanded adaptively. The goal of this stage is to eliminate areas of the image, which are not required to be further analyzed since this helps reduce the efficiency of the image analysis in this part. Thresholding is the first stage of automatic image cropping, which changes the image to a simpler format that highlights the ROI. Thus, the enhanced image stage is converted into grayscale for reduction of complexity as it eliminates the color channels and works only on the intensity values. The next step is binary and connected component analysis (CCA) where small blobs of pixels are detected and labeled based on their adjacency (i.e., connected region). Every connected area is considered an object. Following the execution of CCA, a process known as the bounding box stage is performed to identify the ROI’s boundaries. Based on these calculations around this region, the most significant connected component, which is most probably the ROI, is chosen. Bounding refers to a rectangular frame that surrounds the entire component and maximizes the area of the image that sets the key regions of the image. The last stage of this stage is image cropping, where a number of regions within the image are cropped, with a view of eradicating unessential sections while retaining relevant information. The image is cropped based on the bounding box dimensions. This process eliminates backgrounds and focuses on the most important regions of the image. A small border is added to the bounding box to ensure that no important details near the boundaries are lost during cropping. The equation for automatic cropping is as follows, where
is the thresholded image:
Connected component analysis identifies regions, and the bounding box
is determined as follows:
The image is cropped to this bounding box with a margin,
, to retain key details:
3.5. Feature Detection
In this session, a hybrid feature extraction mechanism with the integration of ORB and SIFT was presented for enhanced scale-invariant, rotation-resilient key point detection and descriptor robustness. Feature detection is an important stage in the schematic flow proposed in this paper, as it involves identifying key points in the images that can be used to match two images. This image processing pipeline uses ORB (Oriented FAST and Rotated BRIEF) and SIFT (Scale-Invariant Feature Transform) algorithms for feature detection. ORB was chosen because of its efficient algorithm for detecting critical points and calculating binary descriptors, which can be optimized for real-time applications. This method is powerful for rotation and provides a fast binary descriptor. SIFT offers high accuracy in detecting and describing key points that are unchanged to scale, rotation, and partial lighting changes. The last stage is the merger of ORB and SIFT. This stage takes advantage of the speed of ORB and the robustness of SIFT, so it is expected to improve the detection toughness, increasing the possibility of accurate matching. Feature detection involves detecting key points
and their corresponding descriptors
, key points
and descriptors
are computed using ORB:
Key points
and descriptors
are computed using SIFT:
The combined key points and descriptors are:
3.6. Feature Matching
This feature matching session presents optimized key point correspondence through ORB and SIFT descriptor integration: dual-metric matching with Hamming distance, L2 Norm, and Lowe’s Ratio test for enhanced accuracy. The feature-matching process applies to establishing a correspondence between the key points detected in two binocular images. The key point matching process used brute force matching. This technique compares the point descriptor from the first image with the point descriptor from the second image, and the best point match is identified based on the L2 norm distance. Lowe’s ratio test is applicable to filter out inconsistent matches because it only considers the ratio of distance to the model and the next nearest model. The best point-matching parameters are those with point matches better than the second-best match by more than 0.75 times the distance value in the matching process. This method reduces the number of false point matches and increases the reliability of key point matching. The key point matchings are then ranked according to distance, where nearer distances represent closer feature point matching. This sorting process allows for prioritization of the most accurate key point matches before the refinement stage. Feature matching is performed by finding correspondences between descriptors
from image
and
from the image
; matches are found by minimizing the distance
between descriptors:
Matches are filtered using Lowe’s ratio test:
where
is a predefined threshold, and
is the second-closest match.
3.7. RANSAC-Based Outlier Removal
Random sample consensus (RANSAC)-based refinement presents an enhanced match refinement pipeline: m-estimator sample consensus (MAGSAC)-driven RANSAC optimization with iterative parameter tuning and LMEDS integration for superior outlier exclusion and transformation accuracy. To increase the accuracy of the key point matching, this image processing pipeline integrates a RANSAC-based refinement method. RANSAC is used to remove outliers from key point matching, where it randomly selects a subset of match points and estimates the model using only these points in each iteration. This image processing pipeline also involves key point matching using an algorithm known as MAGSAC, which is an enhancement of RANSAC. MAGSAC increases the precision of model estimation and accuracy of image alignment by eliminating outliers by dynamically estimating the threshold for inliers. At this point, it adjusts the RANSAC reprojection error threshold. RANSAC is run several times with different parameters, and the best match is selected to improve the durability of the matching process. In addition, the use of least median of squares (LMEDS) offers a good alternative for estimation especially when RANSAC is subjected to a certain noise or outlier. These refinement measures are critical to ensuring that the final set of key points matches is free of fake correspondence, leading to more accurate and reliable results at the next stage of analysis. Let
be the homography matrix that transforms points
from image
to points
in image
. The MAGSAC approach minimizes the error between the transformed points while simultaneously estimating the optimal inlier threshold
. The homography matrix
is determined by solving:
where
is the M-estimator cost function dependent on the dynamically estimated inlier threshold
. MAGSAC modifies the RANSAC algorithm by estimating
during the model fitting, which ensures robustness against outliers.
3.8. Visualization
This session presents an improved correspondence visualization. In this stage, exclusive inlier key point mapping and Matplotlib representation are performed to ensure accurate match validation. Following the application of MAGSAC, the subsequent steps involve plotting the inlier’s key points and showing the accurate key points in the scene. The Matplotlib is used to explain the key points matching where there is a representation of how the matching was conducted. This stage is crucial to ensure the accuracy of the match between the key points obtained and to get the relationship between the binocular images. The application of the color codes or connecting lines between the key point matching enhances the understandability of visualization in case the efficiency of key point matching can be effectively assessed and explained. Finally, the inlier matches
are visualized by displaying the correspondences between images
and
:
4. Experimental Setup
4.1. Dataset
The dataset used for this research was taken from our lab data, where the images consisted of a sequence of independently constructed scenes to simulate an ice-covered transmission line to facilitate the collection of PTLI imagery data. This research simulates the transmission line using long-cylinder pearl cotton (expandable polyethylene, EPE), EPE utilized in this study was sourced from Guangdong Chuanling New Material Technology Co., Ltd., located in Dongguan, China. Expanded polystyrene foam (EPS) is attached to the surface to simulate an ice load layer, EPS used in this study was sourced from Hebei Sashang Technology Co., Ltd., located in Shijiazhuang, China. The dataset in this research consists of 20 PTLI images taken randomly. PTLI images were collected using Daheng binocular cameras in a controlled indoor environment. The Daheng binocular cameras are manufactured by Daheng Imaging, based in Beijing, China. The image simulates a landscape covered in ice or a white sky.
Figure 2 shows the PTLI image used for the experiment.
4.2. Evaluation Metrics
Evaluation metrics to assess the performance of image processing pipelines that focus on accuracy and practicality in PTLI imagery applications. The key point matching scheme proposed in this paper will be evaluated using a combination of quantitative, qualitative, and computational metrics, which are benchmarked against the traditional method. Key point matching accuracy is one of the critical metrics measured using the correct match ratio to the total number of key points detected. In addition, the accuracy of the 3D measurement of the ice thickness is evaluated by calculating the measurement error between the estimated 3D coordinates and the actual measurement. This evaluation is necessary in the case of ice thickness estimation, for which inaccuracies in measurements can result in deviations in the PTLI monitoring. Another criterion of the assessment of image processing pipelines is time complexity, which is applied to measure the time costs for detection, matching, and refinement of the key points necessary for the identification of relations to ensure that operations in these pipelines can be performed in real time. Finally, resistance to noise and varying lighting conditions were evaluated through comparative tests with traditional methods, determining how well the image processing pipeline in this study performed in a challenging environment. The traditional method will use the standard key point detection and matching techniques without the enhancements introduced in this paper.
4.2.1. Quantitative Metrics
Precision, recall, and F1 scores will be calculated to assess the accuracy and effectiveness of key point detection and key point matching in these evaluation metrics. These metrics can be computed using the matches and their validity as determined by the RANSAC refinement. Precision is employed to determine the ratio of the truth of all the matching key point coordinates detected to the total key point matches. This metric is to show how effective the correct match is from the wrong match compared to the baseline. Precision is the ratio of correctly matched key points (true positives) to the total number of points detected as a match (the sum of true positives and false positives). This metric provides insight into the accuracy of the proposed pipeline by measuring how many matching key points are correct.
High precision means that most of the key points that are matched are correct, but it does not mean that all key points have been matched (ignoring missed matches).
Recall measures the proportion of true matches identified out of all possible true matches. A high recall means the method successfully captures most of the correct matches. It indicates how well the algorithm detects all the correct key points.
A high recall means that the algorithm finds most of the correct key points but may include some incorrect matches.
The F1 score is a measure derived from precision and recall that is the average of the two and that takes into consideration false positives and false negatives.
The F1 Score is a representative of the quality, where the maximum value is 1, while the minimum is 0, where 1 corresponds to the best result without error in both precision and recall.
4.2.2. Qualitative Evaluation
Quantitative analysis by visual assessment, which includes an analysis of the accuracy and efficiency of the detection and matching of the critical points on PTLI image. Key points are mapped onto the PTLI images to assess their accuracy, distribution, and relevance in terms of issues such as low light. Features points between the pairs of PTLI images are then compared to determine the accuracy of alignment, consistency, and reduction of false matches through line plots. This method helps determine the robustness of an image processing system compared to a baseline, especially in difficult environments and low light.
Key point detection on visualization is the result of the critical point detection that will be visualized on the image for a comprehensive examination. The objective is to evaluate the accuracy, distribution, and relevance of these key points matching between pairs of PTLI images. It is required to evaluate key factors such as:
Density: How many critical points are detected throughout the image.
Dispersion: The spread and coverage of points across different regions of the image.
Effectiveness in challenging environments: Detection algorithm performance in low-light conditions.
These factors will be compared between the proposed image processing pipeline and the basic method for determining performance improvements, especially in detecting critical points in difficult lighting situations. Visualization on key point matching stage is the quality of the matching point between two pairs of PTLI images is visually assessed. Line plots are used to connect the corresponding key points in each PTLI image, allowing for visual comparison. Evaluation metrics include:
Alignment Accuracy: The correlation between a given contribution of one image to the overall images and how well this related to the other in the other pairing relating to the image.
False match reduction: the capability of the proposed pipeline to reduce the possibility of false match between the points.
Consistency: The stability of feature matching of the key points in the images in different stages and scenes.
This comparison will be created with a baseline method, aiming to demonstrate improvements in alignment and match quality.
The qualitative evaluation will be conducted using images of the artificial PTLI under low-light conditions, emphasizing the robustness of the proposed pipeline in maintaining accurate key point detection and matching in such challenging environments.
4.2.3. Computational Efficiency
Based on time, computational efficiency estimates how a pipeline of image processing for detecting and matching crucial points takes. This evaluation will be documented and then compared with the conventional methods that have been used. The part and parcel of this process is key point detection and matching. The time needed for the identification of the critical points through ORB or SIFT, then checking for the correspondence through the Lowe’s ratio test and then applying the RANSAC to refine the result. An evaluation will be made to analyze the trade-off between achieving higher accuracy and overall computation time concerning PTLI’s image dataset.
4.2.4. Application to Ice Thickness Measurement
The pipeline application proposed in this paper is for ice thickness measurement on PTLI, which will be evaluated through several main metrics. The accuracy of the pipeline in 3-D measurements and ice thickness measurements compared to traditional methods. The results will be validated against ice thickness measurements using manual measurements of micrometers (ground truth) to assess the improvement in precision and accuracy. This comprehensive evaluation aims to explain the advantages of this pipeline in terms of precision, recall, and computational efficiency, especially in PTLI images where accurate ice thickness measurement is necessary for monitoring on PTLI.
4.3. Experimental Procedure
The experimental procedure involves a detailed description of the settings, software environment, and hardware specifications used during the evaluation.
Setup The experimental setup includes a controlled environment for processing PTLI images. The PTLI image dataset includes images taken under various lighting conditions and ice accumulation scenarios on power transmission lines. The images will be pre-processed and further improved through the application of the proposed pipeline for key point detection and 3D reconstruction.
Software environment: The experiment software includes OpenCV 4.0 image processing libraries, and it uses the Python 3.11 platform to develop key point detection and matching algorithms including ORB, SIFT, and Lowe’s ratio test. Metrics for evaluation and analysis will be based on Python, which includes tools for visual inspection, 3D reconstruction, and computation performance measurement.
Hardware specs: The experiment will be performed on a high-performance workstation with a multi-core processor (Intel i9) having at least 32 GB of RAM to deal with PTLI image dataset. The multi-core processor used in this study is the Intel i9, manufactured by Intel Corporation, located in Santa Clara, California, USA for key point detection, matching and 3D reconstruction. NVIDIA Corporation, located in Santa Clara, California, USA are employed to accelerate the process. As for storage, SSDs are used in order to have fast and efficient data storage and retrieval.
This experimental procedure ensures that the experiment is conducted in an optimal environment to accurately measure performance, computing efficiency, and accuracy.
5. Results and Discussion
5.1. Performance of the Image Enhancement Techniques and Automatic Cropping
This session aims to demonstrate the success of improving the quality of PTLI images to improve the visibility of the ice load and facilitate 3D reconstruction. In addition, the result of automatic cropping was also shown in this session. Automatic cropping searches for regions of interest and removes unnecessary areas.
Figure 3 shows the results of image enhancement.
Figure 3 demonstrates the performance of the image enhancement techniques and the automatic cropping process. In
Figure 3a, the original PTLI image is shown as a baseline.
Figure 3b displays the enhanced image, where visibility and contrast are improved through the applied image enhancement techniques. Finally,
Figure 3c illustrates the results after image denoising on the PTLI image, where noise has been reduced while preserving important details for further analysis.
Figure 4 illustrates the automatic cropping process applied to a PTLI image. In
Figure 4a, the framing of the region of interest (ROI) shows how the relevant portion of the image is identified before automatic cropping is performed.
Figure 4b presents the results after the automatic cropping has been completed to the PTLI image, where unnecessary background elements have been removed, focusing on the key area for further image analysis.
5.2. Key Point Detection and Matching Accuracy
The accuracy of key point detection and matching is analyzed by comparing the ORB, SIFT, ORB2, and the proposed methods in this paper. The accuracy is calculated using precision, recall, and F1 score in Equations (19)–(21). In this experiment, 20 PTLI images were taken at random.
Figure 5 shows the precision comparison graph of several methods.
The performance results of each method of accuracy calculation using precision in Equation (12) have been shown in
Figure 5. Based on
Figure 5, the proposed method consistently demonstrates superior precision in most image pairs. The average precision of the proposed method is 90%, the highest average precision among other methods. SIFT and ORB2 have an average precision value of around 82% and 85%, respectively. The precision values of the two methods are still below the proposed method. The lowest precision value is the ORB method, which only has an average precision of 80%.
Figure 6 shows the recall comparison graph of several methods.
The recall values of all methods are presented in
Figure 6 above. As can be seen in
Figure 6, the proposed method gives the highest average recall value, which is 82 percent. This average recall shows that it has been more successful in detecting and matching relevant key points than other methods. The average recall values for SIFT, ORB, and ORB2 only ranged from 62–63%, indicating that several key points were not successfully detected or matched on several pairs of PTLI images.
Figure 7 shows the F1 score graph of several methods.
A comparison of the F1 scores from several methods for the detection of key points and pairs has been shown in
Figure 7. The F1 score is a precision and recall balancer, based on
Figure 7, which is shown to be the highest achieved by the proposed method, with an average F1 score of 86%. ORB2 ranks second with an average F1 score of 73%. SIFT and ORB have the lowest F1 score of the proposed method and ORB at only around 71–70%.
Based on
Figure 5, particularly in image pair number 8 (Image 8L and Image 8R) and number 16 (Image 16L and Image 16R), as well as in
Figure 6 and
Figure 7, a striking difference in results among the other image pairs can be observed. The causes of degradation in these PTLI images are mainly related to environmental factors that affect key point detection and matching accuracy. In particular, image pairs numbers 8 and 16 show problems with low contrast due to low illumination conditions, resulting in fewer key points being detected, which interferes with accurate key point matching and reduces overall matching performance. For image pair number 8 and 16 of PTLI images, the lowest precision and recall values are held by the SIFT method, with a precision value of 36% and recall with a value of 20%. The best performance is shown by the ORB2 method with a precision value of 79%, but it is still categorized as medium precision. The proposed method also experienced difficulties in identifying the key points in the number 8 image pair, where the precision value was only 40%, even though the average precision was 90%. The image pair that shows the best performance on all key point detection and matching methods is number 6 (Image 6L vs. Image 6R). The proposed method has a precision of 90%, which leads to an F1 score of 93.4% in image pair number 6. The best performance on the proposed method was also shown in the number pairs of number 17 and 19, in which the number pairs reached 90% on all metrics.
Based on
Figure 5,
Figure 6 and
Figure 7, a conclusion can be drawn from comparing all methods for key point detection and matching. The advantage of the proposed method is that it is more consistent in almost all PTLI image pairs. The advantages of this method are shown in all metric values, namely precision, recall, and F1 scores, for all methods. The proposed method has a better balance between precision and recall among all other methods, which indicates that this method has the highest F1 score. Meanwhile, ORB2 is generally better than ORB in some pairs of PTLI images, especially in the average precision value. The average recall value almost shows comparable values between the two, but ORB2 is better overall, especially in the analysis using F1. SIFT is a traditional method for detecting and matching key points, so based on the analysis using metrics in Equations (19)–(21), the value is the lowest among the ORB2 method and the proposed method in most cases in PTLI image pairs. Therefore, SIFT has made many mistakes in the key point matching process. Summary statistics for the methods table are shown in
Table 1.
Based on
Table 1, the highest average values across all metrics (precision, recall, and F1 score) were achieved by the proposed method. This proposed method has a relatively low variability (standard deviation), so it shows reliable and consistent performance. ORB2 occupies the precision value and F1 score number two of all methods. Then, lower average performance and higher variability, especially in recall, are shown by the SIFT and ORB methods.
5.3. Analysis of Computational Efficiency for Key Point Detection and Matching
The overall efficiency of the method for key point detection and key point matching is evaluated by measuring the processing time for key point detection and key point matching.
Figure 8 shows the processing times for key point detection and key point matching of all methods.
The processing time of four methods (ORB, ORB2, SIFT, and proposed method) for key point detection and matching in 20 pairs of images are visually compared in
Figure 8. Based on
Figure 8, the longest processing time shown is by the SIFT method, and the average processing time is 1 s more on most PTLI image pairs. The SIFT line graph shows the longest processing time of about 1.4 s in several pairs of PTLI images. The ORB method shows the second-longest processing time after SIFT. Some image pairs reach 1 s, with small fluctuations across image pairs, so, ORB is more efficient than the SIFT method. Based on the analysis of
Figure 8, it is shown that ORB2 is better than SIFT and ORB, with an average processing time of 0.1 s. The fastest processing time, among other methods, is the proposed method. It consistently takes an average of 0.01 s, making it the most efficient method with a significant margin among other methods. In addition, this method is ideal if applied in real time.
5.4. Application to 3D Measurement and Ice Thickness Estimation
The proposed key point detection and matching methods are tested for their performance in applying 3D measurements for ice thickness for PTLI imagery.
Table 4 shows the results of applying the proposed method to 3D measurements for ice thickness on PTLI. This paper focuses only on the application of key point matching in 3D measurement. The 3D measurement is used here only to verify the performance of key point matching for ice thickness measurements in transmission lines. A micrometer is used for manual measurements that will be used for ground truth. In this measurement, SIFT [
31] and ORB2 [
2] are used for comparison. Finally, the proposed method is compared with other methods for applying 3D measurements for ice thickness for PTLI images.
Table 4 shows the results of manual measurements used for ground truth on the thickness of the ice on the power transmission line. Measurement results using the SIFT method result in higher measurements than manual measurements; in some cases, the PTLI image has a large deviation (e.g., 100.2 mm vs. 88.5 mm for Location 1). The measurements using the ORB2 method present results closer to manual measurements when compared to the SIFT method, but some cases still show significant differences. The proposed method successfully provides more accurate measurements, which are very close to the results of manual measurements at most PTLI locations. As in the absolute error column, where the error ranges from 0.55–1.8 mm. Based on
Table 4 regarding the absolute error between the proposed method and the manual measurement, it is mostly low. This method indicates that the proposed method has a good performance in the ice thickness measurement on the PTLI. The most significant error was at location 11, which was 1.8 mm. However, the overall average error is below 2 mm, which indicates that the proposed method is more reliable and accurate than other methods.
The findings of this study have significant implications for PTLI. The capabilities of the proposed method are to improve image quality and the accuracy of direct key point detection on better monitoring of power lines, especially in challenging environments with dim or complex light backgrounds. The efficiency of this method in terms of processing time is excellent for real-time monitoring and maintenance, which can potentially improve the reliability and safety of the power transmission system. In addition, precise 3D measurement capabilities can be instrumental in measuring ice loads on transmission lines, which is crucial for preventing line damage or service interruptions.
Despite the success of the proposed method, some limitations must be acknowledged. First, the PTLI image resolution may limit the accuracy of key point detection in some cases, especially in areas with low contrast. Additionally, while processing time is significantly reduced, further improvements are required to ensure scalability across large datasets or higher-resolution imagery without sacrificing accuracy. The method’s performance can also be affected by drastic changes in lighting or texture, where further adjustments may be required. The accuracy also needs to be improved.
Future research may focus on several avenues to improve the proposed image processing methods. One promising area is optimizing methods for faster processing times, ensuring scalability for large-scale industrial applications. Another direction could involve exploring these methods outside of power transmission lines, such as aerial inspections of other critical infrastructure, such as bridges or pipelines. In addition, integrating real-time monitoring capabilities, potentially through drones or automated systems, can significantly improve the application of pipelines in real-world scenarios.
6. Conclusions
The proposed method significantly improves the image enhancement key point detection and matching processes, especially for PTLI images. Image enhancement techniques result in better visibility in low-light conditions and successful noise reduction while retaining important details. In addition, the automatic cropping method effectively isolates the region of interest, removing unnecessary areas and speeding up the overall image processing. Key point detection and matching are also enhanced with the proposed method, which consistently outperforms traditional methods such as SIFT and ORB regarding precision, recall, and F1 score, with an average precision of 90% and an F1 score of 86%. Pipe computing efficiency reduces processing time to as low as 0.01 s, making it feasible for real-time applications. Finally, the proposed method can provide accurate 3D measurements of the thickness of the ice on the power transmission line, with an absolute error of less than 2 mm compared to manual measurements.