A Machine Learning Method Based on 3D Local Surface Representation for Recognizing the Inscriptions on Ancient Stele
Abstract
:1. Introduction
2. Related Work
2.1. Basic Geometrical Features
2.2. Relief Extraction Using Machine Learning
2.3. Spin Images
3. Proposed Method
3.1. Preprocessing
3.2. CSI Feature Extraction
- Relative vertices localization: As mentioned above, to enhance the classification performance at each vertex, the representation of a single SI is augmented with the SIs of its neighboring vertices. To estimate the position of the vertices along the principal curvatures of , the curvature tensors and the curvature derivatives are initially determined, and then the principal curvatures whose directions are denoted by and are obtained [33]. 3D points at a distance of from along , , , and are identified. Since the points are probably not on the mesh surface, we project the 3D points on the mesh and find the vertices closest to the projections. For clarity, we illustrate in Figure 4 how and are searched along the maximal principal direction; the same procedures are repeated in the minimal principal direction to search for and . Once , , , and are determined, the SI information of these surface points and are retrieved from the SI database.
- Sum-pooling: To reduce the dimension of , each SI is subsampled by using a sum-pooling operation with filters and stride s. That is, adjacent bins of the SI are summed up to yield a single bin of the subsampled SI. Therefore, the dimensions of each SI reduces from to . The purpose of this operation is to reduce the size of the SIs while preserving important information before they are vectorized. The sum-pooling operation is inspired by the multi-resolution SI, which applies the concept of pyramid structure of an image to compress a high resolution SI [27]. It is notable that the sum-pooling operation is actually identical to generating SI by partitioning the same 3D space into smaller number of larger subvolumes. In our experiments, s was set to 2.
- Vectorization: This operation converts the original SI information to vector of length for easy merging with the four subsampled SIs. The operation is also repeated for the four subsampled SIs before they are merged. For each of the subsampled SI, the length of the vector is .
- Concatenation: All five vectors obtained from the output of the vectorization stages are concatenated to a single feature vector of length . The concatenated array forms the CSI and it is denoted as .
3.3. Classification
4. Experimental Results and Discussion
4.1. Hyperparameters Search
4.1.1. SI Parameters Search
4.1.2. RF Hyperparameters Search
4.1.3. Effect of the Smoothing Filter
4.1.4. Search
4.1.5. Size of Neighboring Vertices in k-NN
4.2. Effectiveness of the Proposed Method
4.2.1. SI and CSI
4.2.2. Two-Class and Three-Class RF(CSI)
4.3. Evaluation of the Proposed Method with Other Existing Methods
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
CSI | Cross spin image |
k-NN | k-nearest neighbor |
RF | Random forest |
SI | Spin image |
SIRI | Segmented inscription recognition index |
SVM | Support vector machine |
References
- Johnson, N. Cast in Stone: Monuments, Geography, and Nationalism. Environ. Plan. Soc. Space 1995, 13, 51–65. [Google Scholar] [CrossRef]
- Fitzner, B.; Heinrichs, K. Damage diagnosis at stone monuments-weathering forms, damage categories and damage indices. Acta Univ. Carol. Geol. 2001, 1, 12–13. [Google Scholar]
- Camuffo, D. Physical weathering of stones. Sci. Total. Environ. 1995, 167, 1–14. [Google Scholar] [CrossRef]
- Wikipedia Contributors. Wikipedia. Available online: Https://en.wikipedia.org/wiki/Stone-rubbing (accessed on 18 March 2021).
- Defrasne, C. Digital image enhancement for recording rupestrian engravings: Applications to an alpine rockshelter. J. Archaeol. Sci. 2014, 50, 31–38. [Google Scholar] [CrossRef]
- Kolomenkin, M.; Shimshoni, I.; Tal, A. Prominent field for shape processing and analysis of archaeological artifacts. Int. J. Comp. Vis. 2011, 94, 89–100. [Google Scholar] [CrossRef] [Green Version]
- Lawonn, K.; Trostmann, E.; Preim, B.; Hildebrandt, K. Visualization and extraction of carvings for heritage conservation. IEEE Trans. Vis. Comput. Graph. 2016, 23, 801–810. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zatzarinni, R.; Tal, A.; Shamir, A. Relief analysis and extraction. ACM Trans. Graph. (TOG) 2009, 28, 1–9. [Google Scholar] [CrossRef]
- Torrente, M.L.; Biasotti, S.; Falcidieno, B. Recognition of feature curves on 3D shapes using an algebraic approach to Hough transforms. Pattern Recognit. 2018, 73, 111–130. [Google Scholar] [CrossRef] [Green Version]
- Romanengo, C.; Biasotti, S.; Falcidieno, B. Recognising decorations in archaeological finds through the analysis of characteristic curves on 3D models. Pattern Recognit. Lett. 2020, 131, 405–412. [Google Scholar] [CrossRef]
- Harman, J. Using decorrelation stretch to enhance rock art images. In Proceedings of the American Rock Art Research Association Annual Meeting, Sparks, NV, USA, 28–30 May 2005; Volume 28. [Google Scholar]
- Gomes, L.; Regina Pereira Bellon, O.; Silva, L. 3D reconstruction methods for digital preservation of cultural heritage: A survey. Pattern Recognit. Lett. 2014, 50, 3–14. [Google Scholar] [CrossRef]
- Liu, S.; Martin, R.R.; Langbein, F.C.; Rosin, P.L. Segmenting periodic reliefs on triangle meshes. In IMA International Conference on Mathematics of Surfaces; Springer: Berlin/Heidelberg, Germany, 2007; pp. 290–306. [Google Scholar]
- Anderson, S.; Levoy, M. Unwrapping and visualizing cuneiform tablets. IEEE Comput. Graph. Appl. 2002, 22, 82–88. [Google Scholar] [CrossRef]
- Pan, R.; Tang, Z.; Da, W. Digital stone rubbing from 3D models. J. Cult. Herit. 2019, 37, 192–198. [Google Scholar] [CrossRef]
- Weyrich, T.; Pauly, M.; Keiser, R.; Heinzle, S.; Scandella, S.; Gross, M.H. Post-processing of scanned 3D surface data. In Proceedings of the First Eurographics Conference on Point-Based Graphics, Zurich, Switzerland, 2–4 June 2004; pp. 85–94. [Google Scholar]
- Frangi, A.F.; Niessen, W.J.; Vincken, K.L.; Viergever, M.A. Multiscale vessel enhancement filtering. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer: Berlin/Heidelberg, Germany, 1998; pp. 130–137. [Google Scholar]
- Heo, E.J.; Sa, J.M.; Choi, K.S. Relief extraction from stone monuments. IEIE Trans. Smart Process. Comput. 2018, 7, 321–324. [Google Scholar] [CrossRef]
- Fiorucci, M.; Khoroshiltseva, M.; Pontil, M.; Traviglia, A.; Del Bue, A.; James, S. Machine learning for cultural heritage: A survey. Pattern Recognit. Lett. 2020, 133, 102–108. [Google Scholar] [CrossRef]
- Grilli, E.; Remondino, F. Classification of 3D Digital Heritage. Remote Sens. 2019, 11, 847. [Google Scholar] [CrossRef] [Green Version]
- Belhi, A.; Bouras, A.; Al-Ali, A.K.; Foufou, S. A machine learning framework for enhancing digital experiences in cultural heritage. J. Enterp. Inf. Manag. 2020. [Google Scholar] [CrossRef]
- Liu, S.; Martin, R.R.; Langbein, F.C.; Rosin, P.L. Background surface estimation for reverse engineering of reliefs. Int. J. CAD/CAM. 2007, 7, 31–40. [Google Scholar]
- Zeppelzauer, M.; Poier, G.; Seidl, M.; Reinbacher, C.; Schulter, S.; Breiteneder, C.; Bischof, H. Interactive 3D segmentation of rock-art by enhanced depth maps and gradient preserving regularization. J. Comput. Cult. Herit. (JOCCH) 2016, 9, 1–30. [Google Scholar] [CrossRef]
- Choi, Y.C.; Murtala, S.; Jeong, B.C.; Choi, K.S. Relief Extraction from a Rough Stele Surface using SVM-based Relief Segment Selection. IEEE Access. 2021, 9, 4973–4982. [Google Scholar] [CrossRef]
- Johnson, A.E. Spin-Images: A Representation for 3-D Surface Matching. Ph.D. Dissertation, The Robotics Institute, Carnegie Mellon University, Pittsburgh, PA, USA, 1997. [Google Scholar]
- Park, D.R.; Choi, K.S. A technique for extracting 3D tombstone carvings using spin images. J. Inst. Electron. Eng. 2019, 56, 52–59. [Google Scholar]
- Dinh, H.Q.; Kropac, S. Multi-Resolution Spin-Images. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), New York, NY, USA, 17 June 2006; Volume 1, pp. 863–870. [Google Scholar]
- Belongie, S.; Malik, J.; Puzicha, J. Shape context: A new descriptor for shape matching and object recognition. In Advances in Neural Information Processing Systems; Leen, T.K., Dietterich, T.G., Tresp, V., Eds.; MIT Press: Cambridge, MA, USA, 2001; pp. 831–837. [Google Scholar]
- Tombari, F.; Salti, S.; Di Stefano, L. Unique shape context for 3D data description. In Proceedings of the ACM Workshop on 3D Object Retrieval (3DOR ’10), Firenze, Italy, 25 October 2010; Association for Computing Machinery: New York, NY, USA, 2010; pp. 57–62. [Google Scholar]
- Belongie, S.; Malik, J.; Puzicha, J. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 509–522. [Google Scholar] [CrossRef] [Green Version]
- Mikolajczyk, K.; Schmid, C. A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1615–1630. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Choi, K.-S.; Kim, D.-H. Angular-partitioned spin image descriptor for robust 3D facial landmark detection. Electron. Lett. 2013, 49, 1454–1455. [Google Scholar] [CrossRef]
- Rusinkiewicz, S. Estimating curvatures and their derivatives on triangle meshes. In Proceedings of the 2nd International Symposium on 3D Data Processing, Visualization and Transmission 3DPVT 2004, Thessaloniki, Greece, 9 September 2004; pp. 486–493. [Google Scholar]
- Cignoni, P.; Callieri, M.; Corsini, M.; Dellepiane, M.; Ganovelli, F.; Ranzuglia, G. MeshLab: An Open-Source Mesh Processing Tool. In Proceedings of the Eurographics Italian Chapter Conference, Salerno, Italy, 2–4 July 2008; pp. 129–136. [Google Scholar]
- Jeong, B.-C.; Choi, Y.-C.; Murtala, S.; Choi, K.-S. Segmented inscription recognition index for measuring the inscription quality extracted from 3-D data. IEIE Trans. Smart Process. Comput. 2020, 9, 445–451. [Google Scholar] [CrossRef]
Trees () | Max. Depth () | Accuracy | Precision | Recall | F1 Score |
---|---|---|---|---|---|
200 | 16 | 0.9548 | 0.8755 | 0.8628 | 0.8691 |
200 | 32 | 0.9560 | 0.8808 | 0.8636 | 0.8721 |
200 | 64 | 0.9559 | 0.8811 | 0.8631 | 0.8720 |
500 | 16 | 0.9549 | 0.8755 | 0.8632 | 0.8693 |
500 | 32 | 0.9562 | 0.8810 | 0.8651 | 0.8730 |
500 | 64 | 0.9562 | 0.8814 | 0.8646 | 0.8729 |
1000 | 16 | 0.9549 | 0.8756 | 0.8635 | 0.8695 |
1000 | 32 | 0.9562 | 0.8808 | 0.8655 | 0.8731 |
1000 | 64 | 0.9562 | 0.8813 | 0.8642 | 0.8728 |
1500 | 16 | 0.9550 | 0.8756 | 0.8637 | 0.8696 |
1500 | 32 | 0.9562 | 0.8809 | 0.8649 | 0.8728 |
1500 | 64 | 0.9561 | 0.8812 | 0.8642 | 0.8726 |
k | Accuracy | Precision | Recall | F1 Score |
---|---|---|---|---|
10 | 0.9610 | 0.8907 | 0.8836 | 0.8851 |
20 | 0.9612 | 0.8921 | 0.8839 | 0.8860 |
30 | 0.9615 | 0.8929 | 0.8843 | 0.8866 |
40 | 0.9619 | 0.8944 | 0.8846 | 0.8875 |
50 | 0.9622 | 0.8959 | 0.8848 | 0.8883 |
60 | 0.9625 | 0.8985 | 0.8832 | 0.8889 |
70 | 0.9623 | 0.9006 | 0.8793 | 0.8880 |
80 | 0.9622 | 0.9026 | 0.8757 | 0.8873 |
Metrics | Using SI | Using CSI |
---|---|---|
Accuracy | 0.9594 | 0.9625 |
Precision | 0.8914 | 0.8985 |
Recall | 0.8716 | 0.8832 |
F1 score | 0.8796 | 0.8889 |
Metrics | Two-Class | Three-Class |
---|---|---|
Accuracy | 0.9606 | 0.9625 |
Precision | 0.9120 | 0.8985 |
Recall | 0.8557 | 0.8832 |
F1 score | 0.8810 | 0.8889 |
Method | Accuracy | Precision | Recall | F1 Score |
---|---|---|---|---|
SVM | 0.9557 | 0.8632 | 0.8814 | 0.8664 |
RF(CSI) | 0.9596 | 0.8914 | 0.8744 | 0.8828 |
RF(CSI), k-NN | 0.9625 | 0.8985 | 0.8832 | 0.8889 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Murtala, S.; Choi, Y.-C.; Choi, K.-S. A Machine Learning Method Based on 3D Local Surface Representation for Recognizing the Inscriptions on Ancient Stele. Appl. Sci. 2021, 11, 5758. https://doi.org/10.3390/app11125758
Murtala S, Choi Y-C, Choi K-S. A Machine Learning Method Based on 3D Local Surface Representation for Recognizing the Inscriptions on Ancient Stele. Applied Sciences. 2021; 11(12):5758. https://doi.org/10.3390/app11125758
Chicago/Turabian StyleMurtala, Sheriff, Ye-Chan Choi, and Kang-Sun Choi. 2021. "A Machine Learning Method Based on 3D Local Surface Representation for Recognizing the Inscriptions on Ancient Stele" Applied Sciences 11, no. 12: 5758. https://doi.org/10.3390/app11125758
APA StyleMurtala, S., Choi, Y. -C., & Choi, K. -S. (2021). A Machine Learning Method Based on 3D Local Surface Representation for Recognizing the Inscriptions on Ancient Stele. Applied Sciences, 11(12), 5758. https://doi.org/10.3390/app11125758