Figure 1.
QTPR-Net framework.
Figure 1.
QTPR-Net framework.
Figure 2.
The structure of the preliminary edge feature extraction module. (a) Shows the backbone network of the model; (b) Shows the RPN structure of the model; (c) Shows the BoxHead structure of the model.
Figure 2.
The structure of the preliminary edge feature extraction module. (a) Shows the backbone network of the model; (b) Shows the RPN structure of the model; (c) Shows the BoxHead structure of the model.
Figure 3.
The structure of the edge point feature refinement module.
Figure 3.
The structure of the edge point feature refinement module.
Figure 4.
Visualization of SSDD dataset.
Figure 4.
Visualization of SSDD dataset.
Figure 5.
Comparison chart of various indicators of the model for the NWPU VHR-10 dataset (on the left is a line graph of loss indicators, and on the right is a line graph of recall and accuracy indicators).
Figure 5.
Comparison chart of various indicators of the model for the NWPU VHR-10 dataset (on the left is a line graph of loss indicators, and on the right is a line graph of recall and accuracy indicators).
Figure 6.
Comparison chart of various indicators of different structures for the SSDD dataset (on the left is a line graph of mask loss values, and on the right is a line graph of accuracy values).
Figure 6.
Comparison chart of various indicators of different structures for the SSDD dataset (on the left is a line graph of mask loss values, and on the right is a line graph of accuracy values).
Figure 7.
The structure of TransQTA.
Figure 7.
The structure of TransQTA.
Figure 8.
Comparison chart of the various indicators of different layers for the SSDD dataset (on the left is a line graph of the point loss values, and on the right is a line graph of the accuracy values).
Figure 8.
Comparison chart of the various indicators of different layers for the SSDD dataset (on the left is a line graph of the point loss values, and on the right is a line graph of the accuracy values).
Table 1.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
Table 1.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
Method | Memory (MB) | | | | | | | |
---|
MaskRCNN [4] | 8509 M | 70.6091 | 67.4047 | 93.2067 | 75.5896 | 57.7106 | 65.3746 | 75.0193 |
CondInst [40] | 8258 M | 68.8562 | 64.3168 | 91.1648 | 66.6747 | 53.7024 | 62.4261 | 71.5836 |
BlendMask [41] | 3509 M | 63.9361 | 60.8243 | 88.8132 | 63.5941 | 48.0288 | 59.5018 | 64.2823 |
PointRend [13] | 8922 M | 68.7919 | 67.7898 | 91.8704 | 72.6399 | 60.0279 | 65.9937 | 75.1733 |
BoxInst [42] | 7320 M | 66.4603 | 50.2457 | 81.0751 | 52.2650 | 38.6690 | 46.5827 | 57.2665 |
Mask Transfiner [36] | 15,868 M | 69.7329 | 67.3434 | 91.7960 | 75.1191 | 55.8363 | 65.8000 | 75.0204 |
QTPR-Net | 10,376 M | 70.5119 | 69.1944 | 93.0919 | 75.7410 | 53.9180 | 67.5924 | 76.4796 |
Table 2.
Comparison results of category instance segmentation in the NWPU VHR-10 dataset. The abbreviations for the classes are AI: airplane, SH: ship, ST: tank, BD: baseball field, TC: tennis court, BC: basketball court, GT: ground track and field, HA: port, BR: bridge, and VE: vehicle.
Table 2.
Comparison results of category instance segmentation in the NWPU VHR-10 dataset. The abbreviations for the classes are AI: airplane, SH: ship, ST: tank, BD: baseball field, TC: tennis court, BC: basketball court, GT: ground track and field, HA: port, BR: bridge, and VE: vehicle.
Method | AI | SH | ST | BD | TC | BC | GT | HA | BR | VE |
---|
Mask RCNN [4] | 51.523 | 59.181 | 84.496 | 82.353 | 72.799 | 76.122 | 95.502 | 54.244 | 41.62 | 56.207 |
CondInst [40] | 42.917 | 53.106 | 84.74 | 82.641 | 66.744 | 79.746 | 95.915 | 52.058 | 31.163 | 54.137 |
BlendMask [41] | 48.197 | 54.025 | 82.959 | 80.785 | 62.859 | 71.316 | 90.814 | 41.366 | 28.428 | 47.494 |
PointRend [13] | 51.898 | 60.735 | 88.436 | 84.808 | 71.152 | 35.623 | 97.386 | 55.06 | 35.623 | 57.206 |
BoxInst [42] | 17.029 | 48.981 | 81.02 | 78.874 | 66.322 | 65.318 | 92.951 | 5.591 | 6.694 | 39.677 |
Mask Transfiner [36] | 53.178 | 61.218 | 85.857 | 85.953 | 68.461 | 78.373 | 91.689 | 53.325 | 39.107 | 56.271 |
QTPR-Net | 52.389 | 61.561 | 80.879 | 83.850 | 72.714 | 82.950 | 97.904 | 56.331 | 40.960 | 57.604 |
Table 3.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
Table 3.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
Method | Memory (MB) | | | | | | | |
---|
MaskRCNN [4] | 6916 M | 72.1844 | 69.9374 | 95.5500 | 87.0571 | 67.9297 | 76.2323 | 46.1304 |
CondInst [40] | 7637 M | 72.3112 | 69.3922 | 95.7354 | 85.8785 | 67.4935 | 75.8190 | 53.3663 |
BlendMask [41] | 2138 M | 69.4160 | 67.4937 | 95.4400 | 84.7919 | 67.1617 | 70.0460 | 48.5545 |
PointRend [13] | 8340 M | 71.2352 | 70.4962 | 96.2620 | 87.6452 | 69.1656 | 75.6742 | 48.0363 |
Mask Transfiner [36] | 13,127 M | 72.3341 | 70.1995 | 95.5667 | 85.5286 | 68.9570 | 74.9495 | 41.0891 |
QTPR-Net | 8509 M | 72.5246 | 71.5251 | 96.5326 | 89.6856 | 69.6230 | 78.6076 | 55.5446 |
Table 4.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
Table 4.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
Method | Memory (MB) | | | | | | | |
---|
MaskRCNN [4] | 24,444 M | 41.3018 | 34.2557 | 58.5570 | 34.8988 | 18.7573 | 41.8256 | 52.8351 |
CondInst [40] | 27,298 M | 40.9067 | 32.9986 | 58.8544 | 32.6306 | 16.8397 | 41.6669 | 51.8629 |
BlendMask [41] | 12,528 M | 41.1042 | 33.7449 | 59.0631 | 33.9347 | 18.7465 | 41.7782 | 49.8384 |
PointRend [13] | 14,673 M | 38.9323 | 34.3458 | 57.9447 | 35.7689 | 18.8701 | 41.8448 | 49.2744 |
Mask Transfiner [36] | 16,318 M | 41.2341 | 34.9860 | 59.0648 | 36.0690 | 19.1850 | 41.6859 | 52.2770 |
QTPR-Net | 17,993 M | 42.4565 | 37.0704 | 60.9745 | 38.9419 | 22.4383 | 44.6925 | 54.7648 |
Table 5.
Comparison results of category instance segmentation using the iSAID dataset. The abbreviations for the classes are: SH: Ship, ST: Storage Tank, BD: Baseball Diamond, TC: Tennis Court, BC: Basketball Court, GT: Ground Track Field, BR: Bridge, LV: Large Vehicle, SV: Small Vehicle, HE: Helicopter, SP: Swimming Pool, RO: Roundabout, SB: Soccerball Field, PL: Plane, and HA: Harbor.
Table 5.
Comparison results of category instance segmentation using the iSAID dataset. The abbreviations for the classes are: SH: Ship, ST: Storage Tank, BD: Baseball Diamond, TC: Tennis Court, BC: Basketball Court, GT: Ground Track Field, BR: Bridge, LV: Large Vehicle, SV: Small Vehicle, HE: Helicopter, SP: Swimming Pool, RO: Roundabout, SB: Soccerball Field, PL: Plane, and HA: Harbor.
Method | SH | ST | BD | TC | BC | GT | BR | LV | SV | HE | SP | RO | SB | PL | HA |
---|
MaskRCNN [4] | 37.22 | 37.22 | 51.85 | 77.283 | 77.283 | 29.076 | 19.223 | 32.641 | 11.422 | 5.837 | 32.567 | 29.936 | 43.841 | 46.427 | 25.544 |
CondInst [40] | 35.672 | 33.45 | 52.891 | 76.27 | 38.966 | 19.575 | 17.984 | 32.736 | 9.569 | 6.922 | 30.794 | 33.166 | 40.161 | 39.004 | 27.818 |
BlendMask [41] | 36.994 | 34.104 | 51.746 | 77.669 | 37.041 | 18.773 | 18.885 | 34.141 | 11.574 | 7.098 | 33.224 | 34.672 | 38.267 | 45.473 | 26.513 |
PointRend [13] | 38.219 | 34.334 | 50.913 | 77.488 | 36.489 | 26.823 | 18.075 | 35.615 | 12.375 | 6.642 | 33.483 | 27.222 | 40.155 | 49.573 | 27.782 |
Mask Transfiner [36] | 38.177 | 34.723 | 53.458 | 77.21 | 39.663 | 26.95 | 20.053 | 33.894 | 12.338 | 5.97 | 33.664 | 29.567 | 44.053 | 48.602 | 26.469 |
QTPR-Net | 40.161 | 36.47 | 53.081 | 78.599 | 36.713 | 33.336 | 22.706 | 37.529 | 13.247 | 6.929 | 34.975 | 35.568 | 43.802 | 52.553 | 30.388 |
Table 6.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
Table 6.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
NET | Memory (MB) | | | | | | | |
---|
Baseline | 8922 M | 68.7919 | 67.7898 | 91.8704 | 72.6399 | 60.0279 | 65.9937 | 75.1733 |
QTA | 9157 M | 69.0619 | 66.947 | 91.6985 | 72.7587 | 58.4274 | 64.7118 | 75.8612 |
MultiHead | 9713 M | 68.1484 | 66.9607 | 89.9188 | 72.9379 | 54.734 | 64.508 | 76.8688 |
textbfTransQTA | 10,571 M | 69.8459 | 68.7143 | 93.0919 | 75.741 | 53.918 | 67.5924 | 76.4796 |
Table 7.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
Table 7.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
NET | Memory (MB) | | | | | | | |
---|
Baseline | 8340 M | 71.2352 | 70.4962 | 96.2620 | 87.6452 | 69.1656 | 75.6742 | 48.0363 |
QTA | 8526 M | 71.0313 | 69.9524 | 94.6564 | 88.1010 | 67.6106 | 78.2828 | 62.0198 |
MultiHead | 8044 M | 71.1569 | 70.3533 | 95.5304 | 88.2287 | 68.2050 | 77.8009 | 57.5495 |
TransQTA | 8509 M | 72.5246 | 71.5251 | 96.5326 | 89.6856 | 69.6230 | 78.6076 | 55.5446 |
Table 8.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
Table 8.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
NET | Memory (MB) | | | | | | | |
---|
Baseline | 14,673 M | 38.9323 | 34.3458 | 57.9447 | 35.7689 | 18.8701 | 41.8448 | 49.2744 |
QTA | 18,993 M | 41.9985 | 36.9622 | 60.7912 | 38.9254 | 21.7348 | 44.0669 | 53.9221 |
MultiHead | 17,224 M | 39.5658 | 34.8422 | 58.3206 | 36.4288 | 18.7572 | 41.9450 | 51.3563 |
TransQTA | 17,993 M | 42.4565 | 37.0704 | 60.9745 | 38.9419 | 22.4383 | 44.6925 | 54.7648 |
Table 9.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
Table 9.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the NWPU VHR-10 dataset.
Layer | Memory (MB) | | | | | | | |
---|
No cascade | 8922 M | 68.7919 | 67.7898 | 91.8704 | 72.6399 | 60.0279 | 65.9937 | 75.1733 |
1 | 9684 M | 68.9075 | 67.5039 | 92.3354 | 73.2900 | 56.7301 | 66.6510 | 76.3986 |
2 | 10,376 M | 69.8459 | 68.7143 | 93.0919 | 75.7410 | 53.9180 | 67.5924 | 76.4796 |
3 | 11,242 M | 68.4582 | 67.4572 | 90.7146 | 74.2923 | 59.2013 | 65.6678 | 76.0052 |
Table 10.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
Table 10.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the SSDD dataset.
Layer | Memory (MB) | | | | | | | |
---|
No cascade | 8340 M | 71.2352 | 70.4962 | 96.2620 | 87.6452 | 69.1656 | 75.6742 | 48.0363 |
1 | 9287 M | 71.9147 | 70.3033 | 96.3966 | 87.3504 | 69.7567 | 74.2963 | 40.4752 |
2 | 8509 M | 72.5246 | 71.5251 | 96.5326 | 89.6856 | 69.6230 | 78.6076 | 55.5446 |
3 | 10,767 M | 71.7811 | 70.7983 | 96.3289 | 96.3289 | 69.2145 | 76.6380 | 60.0000 |
Table 11.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
Table 11.
Comparison results of memory usage, object segmentation, and instance segmentation AP values for the iSAID dataset.
Layer | Memory (MB) | | | | | | | |
---|
No cascade | 14,673 M | 38.9323 | 34.3458 | 57.9447 | 35.7689 | 18.8701 | 41.8448 | 49.2744 |
1 | 18,111 M | 42.3277 | 36.7066 | 60.3668 | 60.3668 | 60.3668 | 44.4097 | 44.4097 |
2 | 17,993 M | 42.4565 | 37.0704 | 60.9745 | 38.9419 | 22.4383 | 44.6925 | 54.7648 |
3 | 19,024 M | 42.0979 | 36.9179 | 36.9179 | 39.0054 | 21.5603 | 21.5603 | 55.5409 |