Figure 1.
Effect of the pooling operation in DCNNs. (a) The pseudocolor Landsat image. (b) The semantic segmentation label. (c) The prediction result of the UNet.
Figure 1.
Effect of the pooling operation in DCNNs. (a) The pseudocolor Landsat image. (b) The semantic segmentation label. (c) The prediction result of the UNet.
Figure 2.
Schematic diagram of the pixel shuffle.
Figure 2.
Schematic diagram of the pixel shuffle.
Figure 3.
Overview of the block shuffle network (BSNet) architecture.
Figure 3.
Overview of the block shuffle network (BSNet) architecture.
Figure 4.
Small target objects in Landsat images. (a) The pseudocolor Landsat image. (b) The semantic segmentation label.
Figure 4.
Small target objects in Landsat images. (a) The pseudocolor Landsat image. (b) The semantic segmentation label.
Figure 5.
Details of the block shuffle encoder and decoder. Since the input data of the decoder consitute the high-level feature map, we use gray blocks to represent the feature map.
Figure 5.
Details of the block shuffle encoder and decoder. Since the input data of the decoder consitute the high-level feature map, we use gray blocks to represent the feature map.
Figure 6.
Details of the superpixel branch.
Figure 6.
Details of the superpixel branch.
Figure 7.
Schematic diagram of optimizing semantic segmentation results by superpixel segmentation result. (a) The result of the semantic segmentation. (b) The result of the superpixel segmentation. (c) The overlay result (yellow represents the esult of the semantic segmentation, red represents the result of the superpixel segmentation). (d) The optimized result of semantic segmentation.
Figure 7.
Schematic diagram of optimizing semantic segmentation results by superpixel segmentation result. (a) The result of the semantic segmentation. (b) The result of the superpixel segmentation. (c) The overlay result (yellow represents the esult of the semantic segmentation, red represents the result of the superpixel segmentation). (d) The optimized result of semantic segmentation.
Figure 8.
Geographical distribution of Landsat core dataset and Landsat extend dataset in Region N. LSC dataset IDs: 2-2, 2-3, 3-2, 3-3. LSE dataset IDs: 1-1, 1-2, 1-3, 1-4, 2-1, 2-4, 3-1, 3-4, 4-1, 4-2, 4-3, 4-4. (a) Landsat images. (b) Labels.
Figure 8.
Geographical distribution of Landsat core dataset and Landsat extend dataset in Region N. LSC dataset IDs: 2-2, 2-3, 3-2, 3-3. LSE dataset IDs: 1-1, 1-2, 1-3, 1-4, 2-1, 2-4, 3-1, 3-4, 4-1, 4-2, 4-3, 4-4. (a) Landsat images. (b) Labels.
Figure 9.
Geographical distribution of Landsat supplement dataset in Region SW. LSS dataset IDs: 5-1, 5-2, 5-3, 5-4. (a) Landsat images. (b) Labels.
Figure 9.
Geographical distribution of Landsat supplement dataset in Region SW. LSS dataset IDs: 5-1, 5-2, 5-3, 5-4. (a) Landsat images. (b) Labels.
Figure 10.
Ablation study with the block shuffle structure and superpixel branch on Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet, (d) the UNet with the block shuffle structure (BSNet), (e) the UNet with the superpixel branch (UNet-SP), and (f) the BSNet with the superpixel branch (BSNet-SP).
Figure 10.
Ablation study with the block shuffle structure and superpixel branch on Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet, (d) the UNet with the block shuffle structure (BSNet), (e) the UNet with the superpixel branch (UNet-SP), and (f) the BSNet with the superpixel branch (BSNet-SP).
Figure 11.
Ablation study with the self-boosting method on Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet with the superpixel branch (UNet-SP), (d) the UNet-SP with the self-boosting method (UNet-SP-SB), (e) the BSNet with the superpixel branch (BSNet-SP), and (f) the BSNet-SP with the self-boosting method (BSNet-SP-SB).
Figure 11.
Ablation study with the self-boosting method on Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet with the superpixel branch (UNet-SP), (d) the UNet-SP with the self-boosting method (UNet-SP-SB), (e) the BSNet with the superpixel branch (BSNet-SP), and (f) the BSNet-SP with the self-boosting method (BSNet-SP-SB).
Figure 12.
Some examples of the results on the Landsat core dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 12.
Some examples of the results on the Landsat core dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 13.
Ablation study with the block shuffle structure and superpixel branch on Landsat extend dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet, (d) the UNet with the block shuffle structure (BSNet), (e) the UNet with the superpixel branch (UNet-SP), and (f) the BSNet with the superpixel branch (BSNet-SP).
Figure 13.
Ablation study with the block shuffle structure and superpixel branch on Landsat extend dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet, (d) the UNet with the block shuffle structure (BSNet), (e) the UNet with the superpixel branch (UNet-SP), and (f) the BSNet with the superpixel branch (BSNet-SP).
Figure 14.
Ablation study with the self-boosting method on Landsat extend dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet with the superpixel branch (UNet-SP), (d) the UNet-SP with the self-boosting method (UNet-SP-SB), (e) the BSNet with the superpixel branch (BSNet-SP), and (f) the BSNet-SP with the self-boosting method (BSNet-SP-SB).
Figure 14.
Ablation study with the self-boosting method on Landsat extend dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet with the superpixel branch (UNet-SP), (d) the UNet-SP with the self-boosting method (UNet-SP-SB), (e) the BSNet with the superpixel branch (BSNet-SP), and (f) the BSNet-SP with the self-boosting method (BSNet-SP-SB).
Figure 15.
Some examples of the results on the Landsat extend dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 15.
Some examples of the results on the Landsat extend dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 16.
Some examples of the results on the Landsat supplement dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 16.
Some examples of the results on the Landsat supplement dataset. Comparison between our BSNet and other methods. (a) Landsat image. (b) Ground truth. Inference result of (c) the UNet++, (d) the LinkNet, (e) the PSPNet, (f) the DeepLabV3+, (g) the PAN, (h) the UNet, and (i) our proposed BSNet.
Figure 17.
Large-scale classification results in parts of North China.
Figure 17.
Large-scale classification results in parts of North China.
Figure 18.
Large-scale classification results in parts of Southwest China.
Figure 18.
Large-scale classification results in parts of Southwest China.
Figure 19.
The visualized comparison of different upsample scale values on the Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the BSNet with the upsample scale is 1, (d) the BSNet with the upsample scale is 2, (e) the BSNet with the upsample scale is 4, and (f) the BSNet with the upsample scale is 8.
Figure 19.
The visualized comparison of different upsample scale values on the Landsat core dataset. (a) Landsat image. (b) Ground truth. Inference result of (c) the BSNet with the upsample scale is 1, (d) the BSNet with the upsample scale is 2, (e) the BSNet with the upsample scale is 4, and (f) the BSNet with the upsample scale is 8.
Figure 20.
The distribution of the number of target objects in each tile of the sample.
Figure 20.
The distribution of the number of target objects in each tile of the sample.
Figure 21.
The distribution of the pixel number by each target object.
Figure 21.
The distribution of the pixel number by each target object.
Figure 22.
The effect of on Landsat images. (a) is set to False. (b) is set to True.
Figure 22.
The effect of on Landsat images. (a) is set to False. (b) is set to True.
Figure 23.
The effect of on Landsat images. (a) is set to 0.01. (b) is set to 0.05. (c) is set to 0.1. (d) is set to 0.5.
Figure 23.
The effect of on Landsat images. (a) is set to 0.01. (b) is set to 0.05. (c) is set to 0.1. (d) is set to 0.5.
Table 1.
Feature tensor shape in block shuffle encoder and block shuffle decoder.
Table 1.
Feature tensor shape in block shuffle encoder and block shuffle decoder.
| Operation | Number of Tensor | Batch | Channel | Height | Width |
---|
Block shuffle encoder | Input | 1 | | | | |
Upsample by | 1 | | | | |
Slice | | | | | |
Concatenate | 1 | | | | |
Output | 1 | | | | |
Block shuffle decoder | Input | 1 | | | | |
Disassemble | | | | | |
Stitch | 1 | | | | |
Output | 1 | | | | |
Table 2.
The effect of the block shuffle structure, superpixel branch and self-boosting on Landsat core dataset.
Table 2.
The effect of the block shuffle structure, superpixel branch and self-boosting on Landsat core dataset.
Method | WO | GL | WE | WB | CL | AS | BL | Mean F1 | OA |
---|
UNet | 78.7 | 61.5 | 66.5 | 63.9 | 84.5 | 79.6 | 30.0 | 66.3 | 81.3 |
UNet-SP | 80.2 | 63.1 | 67.9 | 66.5 | 85.7 | 82.3 | 31.6 | 68.1 | 83.9 |
UNet-SP-SB | 81.1 | 64.6 | 68.7 | 67.7 | 86.2 | 83.9 | 32.3 | 69.2 | 84.8 |
BSNet | 80.5 | 64.1 | 71.0 | 68.2 | 86.3 | 81.5 | 30.0 | 68.8 | 83.7 |
BSNet-SP | 82.3 | 66.2 | 72.6 | 69.4 | 87.5 | 82.8 | 31.8 | 70.3 | 85.6 |
BSNet-SP-SB | 82.8 | 66.9 | 73.0 | 71.1 | 87.7 | 83.9 | 32.6 | 71.1 | 86.3 |
Table 3.
Accuracy comparison between our BSNet and other methods on the Landsat core dataset.
Table 3.
Accuracy comparison between our BSNet and other methods on the Landsat core dataset.
Method | WO | GL | WE | WB | CL | AS | BL | Mean F1 | OA |
---|
UNet++ | 78.0 | 60.6 | 64.9 | 61.8 | 83.2 | 80.2 | 28.9 | 65.3 | 80.2 |
LinkNet | 78.1 | 60.3 | 63.7 | 60.9 | 83.4 | 78.7 | 27.6 | 64.6 | 79.9 |
PSPNet | 77.5 | 59.9 | 62.7 | 59.4 | 83.1 | 77.4 | 29.2 | 64.1 | 79.3 |
DeepLabV3+ | 78.2 | 60.1 | 65.3 | 61.8 | 83.5 | 79.8 | 29.4 | 65.4 | 80.8 |
PAN | 78.6 | 60.7 | 68.5 | 61.3 | 83.9 | 78.8 | 29.5 | 65.9 | 81.0 |
UNet | 78.7 | 61.5 | 66.5 | 63.9 | 84.5 | 79.6 | 30.0 | 66.3 | 81.3 |
BSNet (Ours) | 82.8 | 66.9 | 73.0 | 71.1 | 87.7 | 83.9 | 32.6 | 71.1 | 86.3 |
Table 4.
The effect of the block shuffle structure, superpixel branch and self-boosting on Landsat extend dataset.
Table 4.
The effect of the block shuffle structure, superpixel branch and self-boosting on Landsat extend dataset.
Method | WO | GL | WE | WB | CL | AS | BL | Mean F1 | OA |
---|
UNet | 70.1 | 83.6 | 41.3 | 62.2 | 81.6 | 73.8 | 25.9 | 62.6 | 79.4 |
UNet-SP | 71.3 | 84.7 | 43.9 | 64.0 | 82.2 | 75.9 | 26.3 | 64.0 | 81.4 |
UNet-SP-SB | 71.9 | 85.0 | 44.4 | 65.2 | 82.6 | 76.6 | 26.9 | 64.6 | 81.8 |
BSNet | 71.5 | 84.4 | 44.3 | 64.6 | 82.1 | 75.0 | 25.8 | 63.9 | 81.1 |
BSNet-SP | 72.9 | 85.8 | 45.1 | 65.9 | 82.9 | 76.2 | 26.5 | 65.0 | 82.6 |
BSNet-SP-SB | 73.3 | 86.1 | 45.2 | 66.7 | 83.4 | 77.1 | 27.0 | 65.5 | 83.2 |
Table 5.
Accuracy comparison between our BSNet and other methods on the Landsat extend dataset.
Table 5.
Accuracy comparison between our BSNet and other methods on the Landsat extend dataset.
Method | WO | GL | WE | WB | CL | AS | BL | Mean F1 | OA |
---|
UNet++ | 69.3 | 82.9 | 38.1 | 61.5 | 80.5 | 73.4 | 23.5 | 61.3 | 78.1 |
LinkNet | 69.6 | 82.8 | 37.9 | 61.1 | 80.2 | 73.1 | 24.7 | 61.3 | 78.2 |
PSPNet | 68.8 | 82.3 | 36.6 | 59.0 | 79.6 | 71.6 | 24.4 | 60.3 | 77.5 |
DeepLabV3+ | 69.3 | 82.9 | 39.7 | 61.6 | 80.9 | 73.2 | 25.6 | 61.8 | 78.8 |
PAN | 69.9 | 83.3 | 44.5 | 61.8 | 81.0 | 72.9 | 24.9 | 62.6 | 79.3 |
UNet | 70.1 | 83.6 | 41.3 | 62.2 | 81.6 | 73.8 | 25.9 | 62.6 | 79.4 |
BSNet (Ours) | 73.3 | 86.1 | 45.2 | 66.7 | 83.4 | 77.1 | 27.0 | 65.5 | 83.2 |
Table 6.
Accuracy comparison between our BSNet and other methods on the Landsat supplement dataset.
Table 6.
Accuracy comparison between our BSNet and other methods on the Landsat supplement dataset.
Method | WO | GL | WE | WB | CL | AS | BL | Mean F1 | OA |
---|
UNet++ | 73.7 | 66.7 | 31.1 | 55.7 | 73.2 | 71.1 | 19.7 | 55.8 | 67.8 |
LinkNet | 72.2 | 69.8 | 27.5 | 51.1 | 74.0 | 68.2 | 19.4 | 54.6 | 65.9 |
PSPNet | 73.2 | 60.8 | 30.8 | 51.9 | 70.2 | 73.8 | 22.2 | 54.7 | 66.3 |
DeepLabV3+ | 73.7 | 65.3 | 32.6 | 53.1 | 72.2 | 68.0 | 24.2 | 55.5 | 67.2 |
PAN | 75.1 | 63.3 | 31.6 | 51.2 | 72.5 | 72.5 | 22.0 | 55.4 | 67.0 |
UNet | 72.6 | 67.5 | 33.6 | 58.2 | 74.4 | 73.0 | 22.0 | 57.3 | 68.8 |
BSNet (Ours) | 78.0 | 72.6 | 39.4 | 61.9 | 76.3 | 77.5 | 26.9 | 61.8 | 73.4 |
Table 7.
The correspondence between upsample scale, GPU memory overhead, training/prediction duration, and accuracy on the LSC dataset.
Table 7.
The correspondence between upsample scale, GPU memory overhead, training/prediction duration, and accuracy on the LSC dataset.
Upsample Scale | GPU Memory Overhead | Training/Prediction Duration | Accuracy |
---|
1 | 1 | 1 | 81.33 |
2 | 5 | 4 | 82.70 |
4 | 17 | 16 | 83.71 |
8 | 65 | 64 | 83.86 |