A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions

Miao, Yubin; Huang, Leilei; Zhang, Shu

doi:10.3390/s21134532

Open AccessArticle

A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions

by

Yubin Miao

^*,

Leilei Huang

and

Shu Zhang

School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(13), 4532; https://doi.org/10.3390/s21134532

Submission received: 29 May 2021 / Revised: 24 June 2021 / Accepted: 27 June 2021 / Published: 1 July 2021

(This article belongs to the Section Intelligent Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

Phenotypic characteristics of fruit particles, such as projection area, can reflect the growth status and physiological changes of grapes. However, complex backgrounds and overlaps always constrain accurate grape border recognition and detection of fruit particles. Therefore, this paper proposes a two-step phenotypic parameter measurement to calculate areas of overlapped grape particles. These two steps contain particle edge detection and contour fitting. For particle edge detection, an improved HED network is introduced. It makes full use of outputs of each convolutional layer, introduces Dice coefficients to original weighted cross-entropy loss function, and applies image pyramids to achieve multi-scale image edge detection. For contour fitting, an iterative least squares ellipse fitting and region growth algorithm is proposed to calculate the area of grapes. Experiments showed that in the edge detection step, compared with current prevalent methods including Canny, HED, and DeepEdge, the improved HED was able to extract the edges of detected fruit particles more clearly, accurately, and efficiently. It could also detect overlapping grape contours more completely. In the shape-fitting step, our method achieved an average error of 1.5% in grape area estimation. Therefore, this study provides convenient means and measures for extraction of grape phenotype characteristics and the grape growth law.

Keywords:

contour fitting; edge segmentation; grape feature extraction; deep learning

1. Introduction

Phenotypic characteristics of grape particles are important indicators in grape water diagnosis, berry growth monitoring, and grape growth modeling. They are also an important basis for detecting the healthy growth of grapes and ensuring high-quality size control during their growing process [1,2,3]. Machine vision and its related advanced technology have great application prospects in modern fruit planting and phenotype detection [4]; for example, the machine vision method is applied to the segmentation of grape clusters in vineyards [5]. However, accurate detection of edge contours has become an urgent unsolved problem in the field of grape phenotypic feature detection due to the common overlaps of grapes in natural environment.

Many researchers have proposed various methods for contour detection and segmentation using traditional machine vision technology. Jin Yan et al. [6] used an improved Hough transform to segment the grapes that had low detection accuracy and could easily miss when grapes overlapped. Arman Arefi et al. [7] used a watershed algorithm to segment overlapping tomatoes; the recognition accuracy reached 96.36%, but more easily resulted in over-segmentation. Xiang Rong et al. [8] proposed an overlapping tomato-recognition method based on edge curvature analysis. The accuracy of slightly occluded overlapping tomatoes was 90.9%, but when the fruit occlusion rate was high, the accuracy of recognition effect was reduced. Wang Qiaohua et al. [9] used the characteristics of curvature angle and mutation point to extract sizes of grape particles. Lei Yan et al. [10] proposed a particle-segmentation method based on contour and ellipse fitting. This method had high detection accuracy for elliptical fruit, but when the contour of the fruit was too short, it was easy to cause over-segmentation. Chenglin Wang et al. [11] used wavelet transform and a Retinex-based image-enhancement algorithm to segment oranges with an accuracy of 97.2%, but it was difficult to segment overlapping fruits. The gPb-Owt-Ucm image segmentation method proposed by Arbeláez Pablo et al. [12] achieved a score of 74% on the Berkeley segmentation data set, but the algorithm ran slowly. Dollár Piotr et al. [13] proposed an edge-detection algorithm based on structured forest that worked well, but the detection contour was thick. Although these methods achieved good results on grape segmentation to some extent, there still exists some shortcomings and limitations: (1) The traditional segmentation process is always multi-stage, and the complicated pre-processing and post-processing steps are relatively inefficient. In addition, since most pre-processing and post-processing steps are not universal, when the environmental parameters and grape colors change, the parameters in the processing flow also need to be fine-tuned in order to achieve the best results. (2) General detection algorithms have weak resistance to false edges and noise in images under complex conditions, which affects the accurate detection of grapes. (3) The detected edges might be inconspicuous, discontinuous, or missing for overlapped grape particles.

With the rapid development of deep learning in recent years, neural networks make it possible to extract high-dimensional contour features of overlapped grapes under complex conditions. The current methods can be divided into two branches. The first branch detects contours based on local areas, such as N4-field [14], DeepEdge [15], and DeepContour [16]. Their core pipeline is extracting the pixel block with the pixel as the center, extracting features using CNN, comparing the features with real contours, and predicting the pixel-level probability of contours. The above algorithm can better identify the contour of objects, but it also has the disadvantages of being time-consuming and memory-intensive in local area sampling. The other branch is end-to-end contour detection, represented by HED [17], RCF [18], and CEDN [19]. Its main idea is directly predicting the probability of pixel-level contours using CNN. Such an algorithm is simple and efficient, and the detection accuracy is high.

2. Methodology

To better extract features from overlapped grapes, this paper proposes a method with two steps: initial contour detection using a CNN-based network, and contour refinement using prior boundary shape knowledge. These will be further introduced in Section 2.1 and Section 2.2.

2.1. Step One: Contour Detection

An HED network is proposed to solve the ambiguity problem of edge detection in automatic rich-image hierarchy learning [20]. It performs an end-to-end pixel-level prediction using a backbone of VGG16 that contains 13 convolutional layers. These layers are divided into five stages, with an individual pooling layer at the end. The last convolutional layer of each stage is connected to the side output layer that produces an expected edge-prediction map, and the final edge prediction map is the fusion of all stages.

The above-mentioned original HED network can detect most edges in the image, but sometimes these edges are thick and not obvious, and some edges with low contrast might be missed, which might affect the accuracy of subsequent grape phenotypic parameter measurement. Therefore, this paper proposes three improvement measures to address these problems, as shown in Figure 1. First, more side output network structures are introduced in order to merge more edge-prediction maps at different levels and make it more conducive to extracting multi-scales features. Then, the Dice loss function [21] is added to the original weighted cross-entropy loss function. Finally, an image pyramid for multi-scale feature fusion is also introduced. Details of these adjustments are given below.

First, conventional HED only leads out the side output layer at the last convolutional layer of each stage, which does not make full use of the edge information contained in other convolutional layers. Inspired by RCF [18], we fused the resulting layers of VGG16 network in partial stages. Given the fact that our target scene was contour detection of grape particles, we paid more attention to high-dimensional feature maps to characterize the outline of the main part of the image, while low-dimensional feature maps are more suitable for characterizing image texture, noise, and other details. Therefore, compared to RCF, which accumulates the resulting layers in each stage using an eltwise layer to attain hybrid features, the improved HED network only connects all the convolutional layers in the third to fifth stages to the side output layers.

Taking the third stage as an example, each convolutional layer leads to a side output layer, the size of the convolution kernel is 1 × 1, and the channel depth is 25. The feature maps generated in each stage are superimposed and connected to a 1 × 1 − 1 convolutional layer. The edge-prediction image at this stage is obtained using upsampling that adopts a bilinear interpolation method to restore the predicted image to original size. The feature maps of all five stages are merged to predict the final edge map.

Second, most of the pixels in the image are non-edge pixels, and the distribution of edge/non-edge pixels is very unbalanced, so the general cross-entropy loss function used in classic two classification problems is difficult to adapt to such unbalance. To solve this problem, borrowed from [22], our loss function involves weighted cross-entropy loss:

L (W, w) = - β \sum_{j \in Y_{+}} \log P r (y_{j} = 1 | X; W, w) - (1 - β) \sum_{j \in Y_{-}} \log P r (y_{j} = 0 | X; W, w)

(1)

where

Y_{+}

and

Y_{-}

represent the collection of edge pixels and the collection of non-edge pixels in the picture, respectively.

β = |Y_{-}| / |Y|

,

1 - β = |Y_{+}| / |Y|

, and

|Y| = |Y_{+}| + |Y_{-}|

represent all pixels. X represents the input picture, and

P r (y_{j} | X; W, w)

is calculated by the pixel j through the Sigmoid function.

Weight coefficient

β

in Equation (1) is used to solve non-convergence in training caused by the unbalanced picture pixel distribution, but it might also cause unclear edges in prediction results. In order to solve the contradiction caused by the weight coefficient, the Dice coefficient is introduced as the loss function [22]:

L (P, G) = D i s t (P, G) = \frac{\sum_{i}^{N} p_{i}^{2} + \sum_{i}^{N} g_{i}^{2}}{2 \sum_{i}^{N} p_{i} g_{i}}

(2)

where G is the actual edge map of the image,

P

is the predicted image, and

P_{i}

and

g_{i}

represent the pixel value of the i-th point in the predicted image and the actual edge image, respectively.

The Dice coefficient is used as a solution of unbalanced image pixel distribution because it defines a measure of pixel-level similarity between the edge-prediction image and ground truth. In this way, it supervises the model to converge to a high-quality outcome with thinner and more accurate boundaries.

Considering the characteristics of the research object in this paper, a large number of single grape particles are used as data samples during the training process, and the effective edge pixels of the fruit occupy a small proportion of the whole picture. The problem of unbalanced image pixel distribution is more serious compared with other objects. Obviously, Dice loss is more suitable for our object. Therefore, the final loss of the model in this paper is the sum of weighted cross-entropy and the Dice loss:

L (P, G) = α L_{D} (P, G) + (1 - α) L_{w} (P, G)

(3)

where

L_{D} (P, G)

represents the Dice loss function, and

L_{w} (P, G)

represents the weighted cross-entropy loss function.

Third, grape particles usually exist in small-scale form in monitoring images, so image pyramids are used to enhance the model’s ability of representing grape contours at different scales. Specifically, the resolution of input image is adjusted to various levels using bilinear interpolation to form an image pyramid as network input. Then, different-size edge-probability maps are predicted by the network and resized to the same size as the input using bilinear interpolation upsampling. Finally, these images are fused to obtain the final prediction image. The whole process is illustrated in Figure 2.

2.2. Step Two: Contour Fitting

Under some poor environmental conditions, such as large overlapped areas or low light intensity, there might be quite large notches of contours, as shown in Figure 3. This would bring difficulties to further feature-extraction processes. Therefore, this paper proposes a new method based on grape contour fitting in candidate regions. As shown in Figure 4, based on the original image of the grape cluster, the contour extraction is performed, and target detection of fruit is performed to obtain candidate areas. After combining the contour map with the candidate area, each rectangular candidate area contains the contour of a grape. The grape contour fitting is performed on each candidate area in turn, so as to obtain the contour results of all grape particles. This method, combined with the improved HED network, forms our proposed two-step measuring strategy, which can detect and fit as many grapes as possible, and ensure a high-precision prediction of the long axis and the projected area of the fruit.

2.2.1. Candidate Region Generation

Learning-based object-detection methods can be divided into two-step or single-step methods. The representative method of two-step detection is Faster R-CNN, proposed by Ren et al. [23], while the representative method of single-step detection is YOLOv3 [24], which directly predicts targets from inputs and performs at a higher speed. Therefore, in this paper, YOLOv3 was used for candidate region generation.

2.2.2. Iterative Least Squares Ellipse Fitting

There sometimes exist some noises, non-grape contours, or other grape contours in the new candidate area, and they might lead to biased outputs if all the sample data points are directly submitted into the calculation. In order to reduce the influence of error pixels on ellipse fitting, this paper proposes an iterative least squares ellipse fitting method based on the RANSAC algorithm. For a set of raw data points, they can be divided into two categories: inner points and outer points. Inner points are target data points, and their distribution can be described by a certain mathematical model. In this paper, inner points refer to outline pixels of the target grape. On the contrary, outer points such as noise, non-grape contour pixels, and other grape contour pixels are not applicable to the above mathematical models. The RANSAC algorithm was introduced to search for the best fit model that correctly clustered inner and outer points from iterative mathematical criterions. In this way, the proposed iterative least squares ellipse fitting can be summarized as follows:

(1): For the input edge contour map, record the edge contour pixels as E;
(2): Determine the number of iterations K as:

$K = \frac{\log (1 - p)}{\log (1 - {(1 - e r r)}^{s})}$

(4)

where $p$ is the probability of a situation in which all sampling points are inner points in at least one of $K$ iterations, $e r r$ is the proportion of outer points among all data points, and $s$ is the number of sample points selected for each sampling.
(3): Determine the evaluation criteria of model fitness:

$f i t n e s s = \frac{N_{D i s \leq d}}{C}$

(5)

where $C$ is the perimeter of ellipse obtained by fitting, and $N_{D i s \leq d}$ is the number of pixels in the image whose closest orthogonal distance to the ellipse is less than $d$ .
(4): Randomly select 5 pixels to perform least squares ellipse fitting, and calculate the fit degree of the obtained ellipse fitting result, denoted as f. If the fit degree of the current model is higher than that of other models in the historical iteration process, record f as $f_{m a x}$ .
(5): Repeat step 4, and loop K times to get the final ellipse equation.

Figure 5 illustrates such calculation steps using a flow chart.

3. Experiments

The experiments were carried out in the greenhouse (31°11′ N, 121°29′ E) of the Modern Agricultural Engineering Training Center of Shanghai Jiao Tong University. In order to take pictures regularly, we developed an automatic image-acquisition system that included a digital camera with a maximum resolution of 3072 × 2304, a light source, and a timing controller.

Figure 6 shows images of Sunshine Muscat grapes under different lighting conditions: side light, front light, and back light. Among them, the pictures were continuously collected every hour during the expanding period, and at the color-changed period of the grapes in 10 days.

3.1. Grape Collection and Labeling

A total number of 200 4032 × 3024-pixel photos of single Sunshine Muscat grapes under different backgrounds and environments were captured and uniformly reduced to 384 × 544 pixels, and the contours of the grapes were manually marked as ground truths, as shown in Figure 7. They were then augmented to 19,200 pictures after being sequentially flipped, rotated every 22.5°, expanded to 1.5 times, and educed to 0.5 times.

3.2. Data Preparation for Object Detection

The data set used in for object detection contained a total number of 300 pictures of grape clusters. In order to facilitate training, the sizes of the pictures were uniformly converted to 500 × 375 or 375 × 500, depending on their length and width. LabelMe [25] was used for annotation. The grapes that were not covered or the covered area that did not exceed about 80% were labeled, as shown in Figure 8.

4. Results and Discussion

4.1. Contour Detection Results

4.1.1. Model Performance Comparison

Three criteria, optical dataset scale (ODS), optical image scale (OIS), and frame per second (FPS), were used as the model’s detection indicators. ODS refers to the detection score when all test set images used the same fixed threshold, and OIS refers to the detection score when the best threshold was used for each image in the test set. Before edge detection, it was necessary to perform non-maximum suppression (NMS) on the edge-prediction image, and then use Edge box [26,27,28] to perform testing.

In order to control the variables, the network adopted the optimized loss function, where

α

in Equation (3) is set to the optimal parameter value 0.6 determined by comparative experiments. Comparison results of our algorithm with DeepEdge, multi-scale version of HED, and multi-scale version of RCF are shown in Table 1 and Figure 9. We concluded that improved HED was higher than other models in terms of OIS and FPS. The reason why RCF outperformed our method in terms of ODS was because of the bias in the test data set, which consisted of much more individual grape particles than grape clusters. However, Figure 10 shows that improved HED could predict more complete contours of grape clusters.

4.1.2. Comparison Test of Different Illumination Angles

In this section, the performances of our model are compared with Canny, HED-MS, DeepEdge, RCF-MS, and an ablated version without Dice loss. The following indicators were introduced to measure the effect of grape contour detection: Dice (D) coefficient, as the similarity between predicted contour map and ground truth; recall rate (R), as the ratio of the true examples in false-negative examples to the true examples in the predicted grape contour map; contour redundancy rate (A), as the ratio of the number of edge pixels in ground truth to that of all edge pixels in the predicted contour map, which reflects the model’s anti-interference ability against the contours or noise of objects other than grapes.

D = \frac{2 |X \cap Y|}{|X| + |Y|}

(6)

R = \frac{T P}{T P + F N}

(7)

A = 1 - \frac{T P}{C o u n t (X)} \times 100 %

(8)

where

X

is the predicted map;

Y

is ground truth; TP is the number of edge pixels of the fruit that are correctly predicted; FN is the number of edge pixels of the fruit that are predicted as background pixels; and Count(X) is the total number of edge pixels in the output contour prediction image. In this section, a manual labeling method was used to evaluate the outline of grapes.

In natural environments, changes in lighting conditions will affect grape contour detection. In order to prove the adaptability of our method in various light conditions, we tested and compared the edge-detection abilities of different algorithms on grapes under front light, side light, and back light.

It can be seen in Figure 10 that the edges of the grapes are clear and easy to separate under the side light, and the surface of the grapes is uniformly illuminated, so the edge detection was easier. Under back-light conditions, the light intensity on surfaces of grapes was weak, and the contours were blurry. Under front-light conditions, some areas of grape surfaces formed bright white spots due to light reflection, and the contours of grapes were easy to confuse with the background. The contours of overlapped grapes detected by our improved HED network were more significant, the completeness of contours was higher under side-light conditions, and the unobstructed contours of grapes under back-light and front-light conditions were successfully recognized, although part of the contours of grapes blocked inside might have been missed.

Compared with HED, the contours of overlapped grapes detected by our algorithm were more complete, while HED had the error of recognizing reflections as contours. Furthermore, although RCF-MS might detect smoother contours of grapes, our method provided more complete contours for overlapped grapes under all three conditions, especially side light and back light. In addition, the integrity of contour detection was higher with the introduction of Dice loss, and the effect did not degrade much, even in the ablated version without Dice loss. In general, our algorithm was more comprehensive in contour detection of overlapped grapes.

It can be seen in Table 2 that the Dice coefficient of improved HED was more than 3% higher than other algorithms, and the recall rate was more than 2% higher than other algorithms. The redundancy rate of grape contours under different lighting conditions was only lower than RCF, and was higher than other algorithms.

4.2. Candidate Region Generation Results

Figure 11a,b show the detection results of grape clusters without overlaps, while Figure 11c, show those with overlaps. It can be seen that most obscured and unobstructed grapes could be detected successfully. In addition, in Figure 11d, not only the grapes in the target area, but also some small-scale grapes in the background, were successfully detected.

Tests were performed on a new set that contained 100 images of grape clusters. As shown in Table 3, YOLOv3 reached 95.44% in the F1 test for grapes, a 94.56% accuracy rate, and a 96.24% recall rate.

4.3. Contour-Fitting Results

4.3.1. Recall Rate of Grape Contour Fitting

Recall rate defines the ratio of the number of successfully fitted grape contours to the actual number of grape contours, which is calculated as follows:

R = \frac{N_{D}}{N_{A}} \times 100 %

(9)

where

N_{D}

is the number of correct grape contours obtained by fitting, and

N_{A}

is the actual number of grape contours in the image.

A total of 60 grapes with different clusters shapes and different environmental conditions were tested. Due to the growth characteristics of grapes, the fruit grains were mainly blocked by other adjacent fruit grains, and the cases of being blocked by branches and leaves rarely occurred. As shown in Figure 12, the heavily occluded grapes were ignored (most of the outlines of such grapes were not visible).

Three algorithms were used to detect 60 grape cluster images and calculate the average recall rate. As shown in Table 4, the recall rate of our method was higher than that of the other two algorithms.

4.3.2. Contour-Fitting Accuracy Test

In this section, different grapes with manually made missing or noisy contours were used to verify the accuracy of our ellipse-fitting algorithm for grape particles.

Results of fitting tests on 60 artificially processed grapes are shown in Figure 13; our method accurately simulated missing or noisy grape contours. The detection results showed that it had high accuracy for incomplete grape-grain contour fitting under a complex background, and a strong anti-interference ability of non-grape contour edges and noise. Table 5 shows that the error of long axis measurement of the artificially processed grape particle profile was about 1%. Table 6 shows that the error of projected area measurement of the artificially processed grape particle contour was within 1.1%, which met the actual application requirements. In these two tables, AARD means the average absolute relative deviation.

4.4. Continuous Monitoring of the Projected Area of Grapes

In this session, continuous monitoring of grapes cultivating was used as a practical verification of the effectiveness of our method, during which hourly and overall changes of projected areas of grapes were measured within six days during the expansion period. The instance-wise average was carried out over raw measurement results to reduce random errors.

The measurement results are shown in Figure 14 and Figure 15. It can be seen that the projected area of grape fluctuated similar to a sinusoid curve every day, and the area showed an upward trend, which was consistent with the law that the projected area of grape fruit shrinks during the day and expands at night. It can be seen that the curve of projected area of fruit measured using our method was consistent with the theoretical law obtained in paper [29], which verified the effectiveness of our method. Furthermore, continuous monitoring of the projected area of grapes can be used for irrigation decision-making during grape fruit development [30].

5. Summary and Conclusions

This paper proposed a two-step measurement strategy for contour analysis of overlapped grapes that consisted of edge detection and contour fitting. For particle contour detection, an improved HED network with weighted Dice loss was introduced. It made full use of internal features from each convolutional layer and applied image pyramids to achieve multi-scale edge detection. For contour fitting, a method that combined iterative least squares ellipse fitting and a region-growth algorithm was proposed to calculate the projected area of grapes. Experiments showed that in the edge-detection step, compared with current prevalent methods, our improved HED was able to detect and extract complete fruit particle edges more clearly, accurately, and efficiently. In the shape-fitting step, our method achieved an average error of 1.5% in grape area estimation. Therefore, we hope this study can provide convenient means and measures for grape phenotype characteristic extraction and the grape growth law.

Author Contributions

Conceptualization, Y.M.; methodology, S.Z.; software, S.Z.; validation, S.Z., Y.M. and L.H.; data curation, S.Z., Y.M. and L.H.; writing—original draft preparation, S.Z. and Y.M.; supervision, Y.M.; project administration, Y.M.; funding acquisition, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 51975361) and the Shanghai Agriculture Applied Technology Development Program (2019-02-08-00-08-F01118).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cifre, J.; Bota, J.; Escalona, J.M.; Medrano, H.; Flexas, J. Physiological tools for irrigation scheduling in grapevine (Vitis vinifera L.): An open gate to improve water-use efficiency? Agric. Ecosyst. Environ. 2005, 106, 159–170. [Google Scholar] [CrossRef]
Gurovich, L.A.; Ton, Y.; Vergara, L.M. Irrigation scheduling of avocado using phytomonitoring techniques. Cienc. Investig. Agrar. 2006, 33, 117–124. [Google Scholar]
Jones, H.G. Irrigation scheduling: Advantages and pitfalls of plant-based methods. J. Exp. Bot. 2004, 55, 2427–2436. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Luo, L.; Tang, Y.; Lu, Q.; Chen, X.; Zhang, P.; Zou, X. A vision methodology for harvesting robot to detect cutting points. Comput. Ind. 2018, 99, 130–139. [Google Scholar] [CrossRef]
Tang, Y.C.; Wang, C.; Luo, L.; Zou, X. Recognition and Localization Methods for Vision-Based Fruit Picking Robots: A Review. Front. Plant Sci. 2020, 11, 510. [Google Scholar] [CrossRef] [PubMed]
Jin, Y. Study on the South grape Expert System Using Artificial Neural Network and Machine Vision. Ph.D. Thesis, Hunan Agricultural University, Changsha, China, 2009. (In Chinese). [Google Scholar]
Arefi, A.; Motlagh, A.M.; Mollazade, K.; Teimourlou, R.F. Recognition and localization of ripen tomato based on machine vision. Aust. J. Crop Sci. 2011, 5, 1144–1149. [Google Scholar]
Xiang, R.; Ying, Y.B.; Jiang, H.Y.; Rao, X.; Peng, Y. Recognition of overlapping tomatoes based on edge curvature analysis. Trans. Chin. Soc. Agric. Mach. 2012, 43, 157–162, (In Chinese with English abstract). [Google Scholar]
Wang, Q.; Ding, Y.; Luo, J.; Xu, K.; Li, M. Automatic Grading Device for Red Globe Grapes Based on Machine Vision and Method Thereof. Patent CN102680414A, 19 September 2012. (In Chinese). [Google Scholar]
Yan, L.; Park, C.W.; Lee, S.R.; Lee, C.Y. New separation algorithm for touching grain kernels based on contour segments and ellipse fitting. J. Zhejiang Univ. Sci. C 2011, 12, 54–61. [Google Scholar] [CrossRef]
Wang, C.; Tang, Y.; Zou, X.; SiTu, W.; Feng, W. A robust fruit image segmentation algorithm against varying illumination for vision system of fruit harvesting robot. Optik 2017, 131, 626–631. [Google Scholar] [CrossRef]
Arbelaez, P.; Maire, M.; Fowlkes, C.; Malik, J. Contour detection and hierarchical image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 33, 898–916. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dollár, P.; Zitnick, C.L. Fast edge detection using structured forests. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 37, 1558–1570. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ganin, Y.; Lempitsky, V. N4-fields: Neural network nearest neighbor fields for image transforms. In Proceedings of the Asian Comference on Computer Vision (ACCV), Singapore, 1–5 November 2014; Springer: Cham, Switzerland, 2014. [Google Scholar]
Bertasius, G.; Shi, J.; Torresani, L. Deepedge: A multi-scale bifurcated deepnetwork for top-down contour detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Shen, W.; Wang, X.; Wang, Y.; Bai, X.; Zhang, Z. Deep-contour: A deep convolutional feature learned by positive-sharing loss for contour detection draft version. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
Xie, S.; Tu, Z. Holistically-nested edge detection. In Proceedings of the IEEE Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015. [Google Scholar]
Liu, Y.; Cheng, M.M.; Hu, X.; Wang, K.; Bai, X. Richer convolutional features for edge detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
Yang, J.; Price, B.; Cohen, S.; Lee, H.; Yang, M.H. Object contour detection with a fully convolutional encoder-decoder network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Dice, L.R. Measures of the amount of ecologic association between species. Ecology 1945, 26, 297–302. [Google Scholar] [CrossRef]
Deng, R.; Shen, C.; Liu, S.; Wang, H.; Liu, X. Learning to predict crisp boundaries. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018. [Google Scholar]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. arXiv 2015, arXiv:1506.01497. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Russell, B.C.; Torralba, A.; Murphy, K.P.; Freeman, W.T. LabelMe: A database and web-based tool for image annotation. Int. J. Comput. Vis. 2008, 77, 157–173. [Google Scholar] [CrossRef]
Zitnick, C.L.; Dollár, P. Edge boxes: Locating object proposals from edges. In Proceedings of the European conference on computer vision (ECCV), Zurich, Switzerland, 6–12 September 2014. [Google Scholar]
Meng, C.; Li, Z.; Bai, X.; Zhou, F. Arc Adjacency Matrix-Based Fast Ellipse Detection. IEEE Trans. Image Process. 2020, 29, 4406–4420. [Google Scholar] [CrossRef] [PubMed]
Pătrăucean, V.; Gurdjos, P.; Von Gioi, R.G. A parameterless line segment and elliptical arc detector with enhanced ellipse fitting. In Proceedings of the European Conference on Computer Vision (ECCV), Florence, Italy, 7–13 October 2012. [Google Scholar]
Lou, Y.; Miao, Y.; Wang, Z.; Wang, L.; Li, J.; Zhang, C.; Xu, W.; Inoue, M.; Wang, S. Establishment of the soil water potential threshold to trigger irrigation of K yoho grapevines based on berry expansion, photosynthetic rate and photosynthetic product allocation. Aust. J. Grape Wine Res. 2016, 22, 316–323. [Google Scholar] [CrossRef]
Chen, Y.J.; Chen, L.; Lou, Y.S.; Qin, Z.G.; Dong, X.; Ma, C.; Miao, Y.B.; Zhang, C.X.; Xu, W.P.; Wang, S.P. Determination of thresholds to trigger irrigation of ‘Kyoho’grapevine during berry development periods based on variations of shoot diameter and berry projected area. J. Fruit Sci. 2019, 36, 612–620. [Google Scholar]

Figure 1. Structure of Improved HED network.

Figure 2. Schematic diagram of multi-scale contour prediction.

Figure 3. Grapes with missing contours.

Figure 4. Grape contour fitting flowchart.

Figure 5. Flow chart of the iterative least squares ellipse fitting algorithm.

Figure 6. Images of Sunshine Muscat grapes under different lighting conditions.

Figure 7. Original images and artificially labeled contour maps as ground truth.

Figure 8. Grape clusters and their annotation maps.

Figure 9. The evaluation results for the grape data set.

Figure 10. Experimental results of six algorithms for images with different lighting conditions.

Figure 11. Detection of grape particles.

Figure 12. Test results of three fitting algorithms.

Figure 13. Grape contour-fitting results.

Figure 14. Changes of the projected area of the particles of Sunshine Muscat grapes over time.

Figure 15. Theoretical curve of the change of the projection area of the berries of Sunshine Muscat grapes [29].

Table 1. Comparison of detection results of different network structures.

Network Structure	ODS	OIS	FPS
DeepEdge	0.527	0.571	8
HED-MS	0.841	0.853	32
RCF-MS	0.861	0.867	33
Ours	0.857	0.868	38

Table 2. Experimental results of five algorithms on grapes under different light conditions.

Light Conditions	Algorithms	Index
Light Conditions	Algorithms	D	R	A
Side light	Ours	0.33	0.69	35.7%
	HED-MS	0.24	0.62	48.3%
	RCF	0.30	0.67	33.9%
	Canny	0.27	0.67	46.5%
	DeepEgde	0.20	0.42	41.6%
Back light	Ours	0.27	0.59	33.1%
	HED-MS	0.19	0.57	47.5%
	RCF-MS	0.23	0.54	31.5%
	Canny	0.21	0.49	43.2%
	DeepEgde	0.20	0.38	35.1%
Front light	Ours	0.31	0.63	32.3%
	HED-MS	0.22	0.59	43.3%
	RCF-MS	0.28	0.59	31.0%
	Canny	0.18	0.51	51.9%
	DeepEgde	0.24	0.47	34.2%

Table 3. Test results of grape fruit detection.

Index	F1/%	Precision/%	Recall/%
Score	95.44	94.65	96.24

Table 4. Recall test results.

Fitting Method	Recall/100%
Ours	96.34
AMMED	88.7
ELSD	81.5

Table 5. Measurement results of long axis of different grape particle contours by different algorithms.

Serial Number	The Least Square Method/Pixel	Aamed/Pixel	Elsd/Pixel	Ours/Pixel	Ground Truth/Pixel
1	530	537	560	542	545
2	545	540	540	548	550
3	589	595	595	599	603
4	600	587	590	591	593
5	570	570	571	576	577
6	564	561	561	563	569
7	553	573	569	575	573
8	575	587	584	578	581
9	573	579	570	573	573
10	555	562	560	570	569
AARD	1.62%	1.16%	1.21%	0.62%

Table 6. Measurement results of the projected area of different grape particle contours by different algorithms.

Serial Number	The Least Square Method/Pixel	AAMED/Pixel	ELSD/Pixel	Ours/Pixel	Ground Truth/Pixel
1	304,285	315,765	305,894	315,951	318,959
2	342,628	347,598	342,146	347,561	345,793
3	274,105	289,075	274,589	289,014	288,606
4	220,553	216,800	201,454	216,859	219,015
5	238,936	242,457	245,896	241,367	242,047
6	199,784	204,784	203,654	205,657	203,955
7	55,334	55,697	54,356	53,738	54,728
8	39,128	42,475	40,289	39,418	40,242
9	54,456	53,457	53,006	53,982	53,287
10	57,055	57,689	56,126	56,529	56,224
AARD	2.21%	1.35%	2.12%	0.88%

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Miao, Y.; Huang, L.; Zhang, S. A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions. Sensors 2021, 21, 4532. https://doi.org/10.3390/s21134532

AMA Style

Miao Y, Huang L, Zhang S. A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions. Sensors. 2021; 21(13):4532. https://doi.org/10.3390/s21134532

Chicago/Turabian Style

Miao, Yubin, Leilei Huang, and Shu Zhang. 2021. "A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions" Sensors 21, no. 13: 4532. https://doi.org/10.3390/s21134532

APA Style

Miao, Y., Huang, L., & Zhang, S. (2021). A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions. Sensors, 21(13), 4532. https://doi.org/10.3390/s21134532

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Two-Step Phenotypic Parameter Measurement Strategy for Overlapped Grapes under Different Light Conditions

Abstract

1. Introduction

2. Methodology

2.1. Step One: Contour Detection

2.2. Step Two: Contour Fitting

2.2.1. Candidate Region Generation

2.2.2. Iterative Least Squares Ellipse Fitting

3. Experiments

3.1. Grape Collection and Labeling

3.2. Data Preparation for Object Detection

4. Results and Discussion

4.1. Contour Detection Results

4.1.1. Model Performance Comparison

4.1.2. Comparison Test of Different Illumination Angles

4.2. Candidate Region Generation Results

4.3. Contour-Fitting Results

4.3.1. Recall Rate of Grape Contour Fitting

4.3.2. Contour-Fitting Accuracy Test

4.4. Continuous Monitoring of the Projected Area of Grapes

5. Summary and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI