To address the evaluation problem of 3D reconstruction in inner-cavity visual SLAM, using real data of human inner cavities has limitations and is difficult to fully test during the research stage. Therefore, based on the actual needs of inner-cavity scenes and limitations such as the equipment and data collection, this paper constructed a simulated inner-cavity experimental platform and used a simulation model to test the 3D reconstruction of inner cavities in a real indoor scene using an inner-cavity monocular SLAM system. In order to recover the scale factor of the inner-cavity monocular SLAM system and conduct accuracy evaluation, this paper combined two feasible methods: intra-cavity calibration object co-reconstruction and external pre-calibration, and mainly calculated the measurement error of the inner-cavity SLAM from the perspective of the absolute size error, and used the root-mean-square error (RMSE) to statistically calculate this error value. During surgery, inspection through laparoscopy focuses more on local measurement information of target areas or specific lesions. After obtaining the reconstructed point cloud model, typical areas of the target organ or soft tissue surface that are significant in the inner-cavity are selected as the target lesion area. Local conformity evaluation is adopted, and the quantitative evaluation of the system’s 3D reconstruction accuracy is carried out from the direction of size conformity of the feature structure.
2.1. Intracavitary Calibrator Reconstruction Method
Before calculating the error between the reconstructed measurements and the real size, the idea of the GCP (ground control point) based on the significant ground control points is adopted, and the coordinate transformation relationship is corrected by establishing the coordinate transformation relationship in the base image data with precise information, so that the a priori knowledge obtained by other methods (such as conventional measurements) can be used to establish the estimation relationship, and the point cloud model obtained from the reconstruction coordinates to the world coordinates with real scale. In the 3D reconstruction of the simulated internal cavity, the conversion relationship between the original coordinate system of the reconstructed model and the real-scale world coordinate system is obtained by the known conversion relationship between the actual dimensions of the calibrated object and its dimensions in the 3D point cloud model, so that the 3D reconstruction results can be corrected for the relative scale. Therefore, for the scale uncertainty problem between the reconstructed estimated size of the target object size in the environment and the actual true size, the scale scaling factor
s is defined as follows:
where
and
are the unitless quantized values of the specific object dimensions in the reconstructed model and the unitary quantized values of their corresponding object dimensions in the actual scene, respectively.
The intracavity calibrator reconstruction method uses the calibration template in the scene to align the 3D reconstruction results to the reconstructed coordinates with standard dimensions, i.e., the 3D coordinates of specific feature points in the reconstruction data under the real space are obtained, and then the dimensions of the target lesion in the internal cavity can be calculated, and the real structure dimensions measured in its reality are calculated and counted, and the dimensional fit of the reconstruction results of the internal cavity target can be obtained.
In the 3D point cloud data-measurement analysis, PCL is used to extract the feature corner points of the target organ tissue surface in the map model using a manual point selection method for measurement. For example, the longest diameter of an irregular lesion can be obtained by using the lesion contour endpoints through the Euclidean distance calculation.
The evaluation index RMSE is calculated as shown in the following Equation (2), where
is the measurement value of the corresponding target in the 3D model obtained by fitting the reconstructed point cloud,
is the real size of the actual model, and
N is the number of measurements.
A calibration template is placed in the scene and used as part of the environment for 3D reconstruction, as shown in
Figure 2. The actual dimensions of the calibration template can be used as a priori knowledge as a reconstruction constraint; thus, the scale factor can be recovered. Because in intracavitary surgery, a priori knowledge can be provided by introducing known-size micro instruments or other surgical tools as calibrators, with the actual size and the estimated size of the reconstruction, the scale factor can be calculated; therefore, the size of the reconstructed model at the actual scale can be recovered.
2.2. Results and Discussions
To evaluate the 3D reconstruction of endoscopic-vision SLAM, using real data from human body cavities in the research stage has certain limitations and makes it difficult to conduct sufficient testing. Therefore, based on the actual needs and constraints of endoscopic scenarios, equipment, and data acquisition, this paper buildt a simulated endoscopic experimental platform, using a synthetic model to test in a realistic indoor environment, and performed 3D reconstruction of the simulated cavity using an endoscopic monocular SLAM system. In order to recover the scale factor of the endoscopic monocular SLAM system and evaluate its accuracy, this paper combined two feasible methods: co-reconstruction with an intracavitary calibration object and pre-calibration outside the body, based on the minimally invasive surgery scenario. Then, mainly from the perspective of absolute size error, we calculated the measurement error of endoscopic SLAM and use the root-mean-square error (RMSE) to quantify this error value. Moreover, considering that during surgery inspection using laparoscopy, more attention was paid to local measurement information of the target areas or specific lesions; we selected typical parts with prominent features on target organs or soft tissue surfaces in the cavity as lesion areas after obtaining the reconstructed point cloud model. We used a local alignment evaluation method to quantitatively evaluate the 3D reconstruction accuracy of the system from the direction of feature structure size alignment.
This section verifies the effectiveness and reliability of the endoscopic monocular SLAM system in a simulated abdominal puncture exploration by building a simulated human body cavity experimental platform. It mainly focuses on conducting multiple endoscopic monocular SLAM reconstruction experiments around the designed intracavitary accuracy evaluation method and quantitatively evaluating the accuracy of the reconstruction data as well as analyzing and discussing the corresponding results. Thus, it studied the validity of the proposed evaluation method for this scenario and determined the precision of the system. In this section, the relevant experimental hardware configuration of the endoscopic monocular vision SLAM-based 3D reconstruction platform is shown in
Table 1.
In the endoscopic 3D reconstruction experiment based on monocular SLAM, this paper simulated a realistic minimally invasive surgery scenario by using a synthetic abdominal cavity box and a straight rod laparoscope on a mechanical experimental platform, as shown in
Figure 3a. The abdomen of the simulation box was made of silicone material, and a stomach organ model was placed in the box as the target subject of the intracavitary experiment. A puncture tool could be used to insert the laparoscope from the silicone abdomen for exploration. On the endoscope, we used a 30-degree straight rod laparoscope with a light source, which had a diameter of 10 mm, a front-end monocular camera resolution of 640 × 480, a frame rate of 30fps, and eight LED ring light sources around the lens. Moreover, its 30-degree oblique angle was consistent with the clinical surgery perspective, as shown in
Figure 3b. At the same time, in order to restore the dark environment inside the cavity under real laparoscopic surgery as much as possible, we always ensured that there were no other light sources in the experimental environment. We only used the built-in light source of the laparoscope as an illumination condition to simulate shooting inside the abdominal cavity in an almost completely dark laboratory.
This paper uses Zhang’s calibration method to calibrate the intrinsic parameters of the laparoscope used in the experiment. The calibration work uses a 9 × 7 chessboard, with a single black and white square size of 24 mm. As shown in
Figure 4a, 25 images were taken from multiple rotation angles through the laparoscope as the calibration image source.
Figure 4b shows the detection result of the calibration image, where the green circles are the chessboard corner points extracted in the calibration, and the red crosses are re-projection points, representing the spatial three-dimensional points of the corner points re-projected to the calibration image according to the calibrated intrinsic parameters. They are used to evaluate the accuracy of the calibration results. The smaller the re-projection error, the better the calibration accuracy.
To enhance the calibration accuracy, we used the average pixel error of the images to filter and eliminate individual chessboard pictures, and finally obtain the remaining 15 pictures to calibrate again. After, the calibration was completed; the overall error of its results is shown in
Figure 5. The maximum calibration error was 0.26 pixels, and the average calibration error was 0.20 pixel, which met the calibration error requirements. This indicates that the laparoscope has a high calibration precision, and its calibration results have a high credibility.
The corresponding internal reference data of the laparoscope can be obtained through laparoscopic calibration work, and the specific values are shown in
Table 2.
Among them, , and are radial distortion parameters, and and are tangential distortion parameters, which are used to correct different degrees of distortion caused by material and production process factors during imaging. In visual SLAM, in order to obtain accurate results, the camera intrinsic parameters are usually treated as known parameters for camera file configuration, and the corresponding parameter files need to be imported before the system runs.
Firstly, for the method of converting coordinate relations with the help of a known calibration template given in
Section 2.1, we used a chessboard for auxiliary measurements and placed a chessboard plane target near the target organ tissue in the reconstructed cavity. Since the length of each small square on the chessboard surface was precise, and the laparoscope intrinsic parameters were known, we could obtain the coordinate information of each chessboard corner point according to the projection matrix and transformation relationship between coordinates. Therefore, we could recover the scale factor of the translation vector in the monocular SLAM imaging process and obtain a reconstructed point cloud model with real scale.
In the intracavitary reconstruction error evaluation experiment, our overall shooting condition was in a realistic indoor environment. We turned off all external light sources, and put the “stomach” organ model shown in
Figure 6a as the target object into the human abdominal cavity simulation box. By holding the laparoscope, we simulated exploration with its own light source and took image sequence data from multiple angles and positions around the stomach, which served as the original input data for the monocular intracavitary SLAM system. As shown in
Figure 6b, a checkerboard grid planar target was placed in the simulation box as a calibrator to participate in the accuracy test of monocular internal cavity SLAM. And in order to further evaluate the accuracy of 3D reconstruction quantitatively, a number of linear segments with more obvious and easy-to-measure characteristic corner points were manually selected for measurement in the internal scene of the simulated box taken by laparoscope, as shown in
Figure 6b (red lines numbered as 1, 2, 3 and blue lines numbered as 4, 5, 6). The measurement results of the system reconstruction model were compared with the actual distances obtained from real measurements and the errors were calculated. The selected sample line segments are the numbered sections in the figure.
Figure 7 shows the reconstructed point cloud after obtaining the real scale. Because the surface of the organ model used does not have too many blood vessels or undulating structures compared to the soft tissue of the real internal organs, the blank area of the model surface is too smooth and lacks texture. So, fewer feature points can be extracted, which makes the reconstructed model have corresponding deficiencies and holes. On the other hand, because the laparoscope focuses on the location of the selected target measurement area during the reconstruction process, the acquisition for the right side of the model is not sufficient. However, it can be seen that the point cloud model obtained from the 3D reconstruction is basically able to provide a more correct and detailed representation of the model of the target region and the tessellation target, and the parameters of the selected sample line segments are calculated and measured by this reconstructed model.
The accuracy is calculated from the model by selecting the average measured distance from the source to the target points, and the experimental measurement comparison results are shown in
Figure 8. The percentages on the bar graph represent the relative error percentage between the reconstructed data and the real measurement data. The RMSE of the selected local structure dimensions is 2.12 mm, and the root-mean-square value of the relative error is 8.04%.