Figure 1.
The overall architecture of MBRSNet.
Figure 1.
The overall architecture of MBRSNet.
Figure 2.
Examples of input images, ground-truth segmentation masks, and ground-truth SDF maps.
Figure 2.
Examples of input images, ground-truth segmentation masks, and ground-truth SDF maps.
Figure 3.
The architecture of CGMB.
Figure 3.
The architecture of CGMB.
Figure 4.
The architecture of BAWA.
Figure 4.
The architecture of BAWA.
Figure 5.
The architecture of SGFA.
Figure 5.
The architecture of SGFA.
Figure 6.
Visualization of the ablation results on representative samples. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 6.
Visualization of the ablation results on representative samples. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 7.
Visualization of segmentation results on representative samples. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 7.
Visualization of segmentation results on representative samples. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 8.
Examples of segmentation and SDF outputs on (a) Kvasir-SEG and (b) CVC-ClinicDB. Each sample shows the input image, ground truth, predicted mask with the predicted zero-level SDF contour marked by a white dashed line, and predicted SDF output. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 8.
Examples of segmentation and SDF outputs on (a) Kvasir-SEG and (b) CVC-ClinicDB. Each sample shows the input image, ground truth, predicted mask with the predicted zero-level SDF contour marked by a white dashed line, and predicted SDF output. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 9.
Grad-CAM visualization results of different encoder stages.
Figure 9.
Grad-CAM visualization results of different encoder stages.
Figure 10.
Visualization of boundary attention in BAWA, region attention in SGFA, and feature response heatmaps on Kvasir-SEG and CVC-ClinicDB.
Figure 10.
Visualization of boundary attention in BAWA, region attention in SGFA, and feature response heatmaps on Kvasir-SEG and CVC-ClinicDB.
Figure 11.
Examples of segmentation failure cases. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Figure 11.
Examples of segmentation failure cases. Correctly segmented regions, false positives, and false negatives are marked in cyan, magenta, and blue, respectively.
Table 1.
Configuration of the ablation study. The symbols ✓ and × indicate that the corresponding component is included and excluded, respectively.
Table 1.
Configuration of the ablation study. The symbols ✓ and × indicate that the corresponding component is included and excluded, respectively.
| Model | SDF | CGMB | SGFA | BAWA |
|---|
| M1 | × | × | × | × |
| M2 | ✓ | × | × | × |
| M3 | ✓ | ✓ | × | × |
| M4 (w/o BAWA) | ✓ | ✓ | ✓ | × |
| M5 (w/o SGFA) | ✓ | ✓ | × | ✓ |
| M6 (w/o CGMB) | ✓ | × | ✓ | ✓ |
| MBRSNet | ✓ | ✓ | ✓ | ✓ |
Table 2.
Ablation results of in-domain performance on Kvasir-SEG.
Table 2.
Ablation results of in-domain performance on Kvasir-SEG.
| Model | mDice | mIoU | | | |
|---|
| M1 | 0.8569 | 0.7707 | 0.8668 | 0.8186 | 0.8927 |
| M2 | 0.8899 | 0.8294 | 0.8922 | 0.8537 | 0.9093 |
| M3 | 0.9023 | 0.8457 | 0.8943 | 0.9015 | 0.9112 |
| M4 | 0.9073 | 0.8540 | 0.9046 | 0.9114 | 0.9380 |
| M5 | 0.9130 | 0.8571 | 0.9162 | 0.9148 | 0.9415 |
| M6 | 0.9094 | 0.8530 | 0.9054 | 0.9118 | 0.9378 |
| MBRSNet | 0.9216 | 0.8681 | 0.9231 | 0.9216 | 0.9492 |
Table 3.
Ablation results of in-domain performance on CVC-ClinicDB.
Table 3.
Ablation results of in-domain performance on CVC-ClinicDB.
| Model | mDice | mIoU | | | |
|---|
| M1 | 0.8598 | 0.7713 | 0.8316 | 0.8869 | 0.9369 |
| M2 | 0.8969 | 0.8227 | 0.8769 | 0.8946 | 0.9569 |
| M3 | 0.9137 | 0.8527 | 0.9172 | 0.9130 | 0.9698 |
| M4 | 0.9258 | 0.8686 | 0.9259 | 0.9268 | 0.9730 |
| M5 | 0.9343 | 0.8814 | 0.9347 | 0.9384 | 0.9802 |
| M6 | 0.9371 | 0.8876 | 0.9305 | 0.9407 | 0.9827 |
| MBRSNet | 0.9429 | 0.8968 | 0.9434 | 0.9478 | 0.9871 |
Table 4.
Comparison results of in-domain performance on Kvasir-SEG and CVC-ClinicDB. The best and second-best results on each dataset are marked in bold and underlined.
Table 4.
Comparison results of in-domain performance on Kvasir-SEG and CVC-ClinicDB. The best and second-best results on each dataset are marked in bold and underlined.
| Dataset | Model | mDice | mIoU | | | |
|---|
| Kvasir-SEG | UNet [40] | 0.7738 | 0.6794 | 0.7971 | 0.8340 | 0.8627 |
| UNet++ [41] | 0.8333 | 0.7538 | 0.8514 | 0.8553 | 0.8704 |
| Polyp-PVT [42] | 0.8328 | 0.7530 | 0.8365 | 0.8563 | 0.8723 |
| PraNet [20] | 0.8918 | 0.8288 | 0.8845 | 0.9098 | 0.9171 |
| CFANet [25] | 0.8193 | 0.7203 | 0.8190 | 0.8464 | 0.8489 |
| CaraNet [21] | 0.9094 | 0.8454 | 0.9167 | 0.9152 | 0.9317 |
| MISNet [24] | 0.8121 | 0.7324 | 0.8340 | 0.8423 | 0.8625 |
| MSBPNet [22] | 0.8773 | 0.8118 | 0.8979 | 0.8938 | 0.9180 |
| MNet-SAt [19] | 0.8357 | 0.7498 | 0.8537 | 0.8528 | 0.8824 |
| DEP-Net [43] | 0.8821 | 0.8195 | 0.8829 | 0.8959 | 0.9138 |
| MBRSNet | 0.9216 | 0.8681 | 0.9231 | 0.9216 | 0.9492 |
| CVC-ClinicDB | UNet [40] | 0.8225 | 0.7465 | 0.8365 | 0.8467 | 0.9103 |
| UNet++ [41] | 0.8537 | 0.7822 | 0.8671 | 0.8724 | 0.9098 |
| Polyp-PVT [42] | 0.8303 | 0.7389 | 0.8425 | 0.8399 | 0.9208 |
| PraNet [20] | 0.9255 | 0.8695 | 0.9285 | 0.9414 | 0.9604 |
| CFANet [25] | 0.8036 | 0.7065 | 0.7981 | 0.8591 | 0.8558 |
| CaraNet [21] | 0.9181 | 0.8542 | 0.9257 | 0.9383 | 0.9543 |
| MISNet [24] | 0.9078 | 0.8407 | 0.9159 | 0.9275 | 0.9393 |
| MSBPNet [22] | 0.9241 | 0.8662 | 0.9304 | 0.9383 | 0.9556 |
| MNet-SAt [19] | 0.8116 | 0.7306 | 0.8343 | 0.8498 | 0.8838 |
| DEP-Net [43] | 0.8161 | 0.7244 | 0.8430 | 0.8539 | 0.8897 |
| MBRSNet | 0.9429 | 0.8968 | 0.9434 | 0.9478 | 0.9871 |
Table 5.
Image-level standard deviation and corresponding p-values on Kvasir-SEG. The best and second-best standard deviations are marked in bold and underlined.
Table 5.
Image-level standard deviation and corresponding p-values on Kvasir-SEG. The best and second-best standard deviations are marked in bold and underlined.
| Model | Dice | IoU | | | |
|---|
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
|---|
| UNet [40] | 0.2047 | <0.001 | 0.2347 | <0.001 | 0.2129 | <0.001 | 0.1186 | <0.001 | 0.1638 | <0.001 |
| UNet++ [41] | 0.2005 | <0.001 | 0.2360 | <0.001 | 0.1811 | <0.001 | 0.1066 | <0.001 | 0.1583 | <0.001 |
| PraNet [20] | 0.1503 | <0.05 | 0.1830 | <0.05 | 0.1619 | <0.05 | 0.1010 | <0.05 | 0.1511 | <0.05 |
| Polyp-PVT [42] | 0.2424 | <0.001 | 0.2503 | <0.001 | 0.2360 | <0.001 | 0.1382 | <0.001 | 0.1640 | <0.001 |
| CFANet [25] | 0.1633 | <0.001 | 0.1955 | <0.001 | 0.1830 | <0.001 | 0.1047 | <0.001 | 0.1611 | <0.001 |
| CaraNet [21] | 0.1364 | <0.05 | 0.1441 | <0.05 | 0.1327 | <0.05 | 0.0963 | <0.05 | 0.1563 | <0.05 |
| MISNet [24] | 0.2450 | <0.001 | 0.2591 | <0.001 | 0.2422 | <0.001 | 0.1379 | <0.001 | 0.1746 | <0.001 |
| MSBPNet [22] | 0.2713 | <0.001 | 0.2719 | <0.001 | 0.2639 | <0.001 | 0.1331 | <0.001 | 0.1704 | <0.001 |
| MNet-SAt [19] | 0.2598 | <0.001 | 0.2690 | <0.001 | 0.2582 | <0.001 | 0.1435 | <0.001 | 0.1966 | <0.001 |
| DEP-Net [43] | 0.2703 | <0.001 | 0.2744 | <0.001 | 0.2660 | <0.001 | 0.1462 | <0.001 | 0.1873 | <0.001 |
| MBRSNet | 0.0655 | – | 0.1083 | – | 0.1045 | – | 0.0814 | – | 0.1123 | – |
Table 6.
Image-level standard deviation and corresponding p-values on CVC-ClinicDB. The best and second-best standard deviations are marked in bold and underlined.
Table 6.
Image-level standard deviation and corresponding p-values on CVC-ClinicDB. The best and second-best standard deviations are marked in bold and underlined.
| Model | Dice | IoU | | | |
|---|
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
|---|
| UNet [40] | 0.2347 | <0.001 | 0.2482 | <0.001 | 0.2303 | <0.001 | 0.1047 | <0.001 | 0.1328 | <0.001 |
| UNet++ [41] | 0.2018 | <0.001 | 0.2203 | <0.001 | 0.1939 | <0.01 | 0.0923 | <0.001 | 0.1344 | <0.001 |
| PraNet [20] | 0.0795 | <0.001 | 0.1110 | <0.001 | 0.0872 | <0.05 | 0.0909 | <0.05 | 0.0720 | <0.001 |
| Polyp-PVT [42] | 0.2013 | <0.001 | 0.2134 | <0.001 | 0.2008 | <0.001 | 0.1094 | <0.001 | 0.1466 | <0.001 |
| CFANet [25] | 0.1744 | <0.001 | 0.2007 | <0.001 | 0.1577 | <0.001 | 0.1019 | <0.001 | 0.0911 | <0.001 |
| CaraNet [21] | 0.0746 | <0.001 | 0.1046 | <0.001 | 0.0800 | <0.01 | 0.0783 | <0.05 | 0.0694 | <0.001 |
| MISNet [24] | 0.2227 | <0.001 | 0.2236 | <0.001 | 0.2187 | <0.001 | 0.1164 | <0.001 | 0.1271 | <0.001 |
| MSBPNet [22] | 0.2609 | <0.001 | 0.2555 | <0.001 | 0.2505 | <0.001 | 0.1006 | <0.001 | 0.1714 | <0.001 |
| MNet-SAt [19] | 0.2214 | <0.001 | 0.2345 | <0.001 | 0.2126 | <0.001 | 0.1092 | <0.001 | 0.1059 | <0.001 |
| DEP-Net [43] | 0.2323 | <0.001 | 0.2372 | <0.001 | 0.2184 | <0.001 | 0.1100 | <0.001 | 0.1005 | <0.001 |
| MBRSNet | 0.0626 | – | 0.0953 | – | 0.0618 | – | 0.0532 | – | 0.0553 | – |
Table 7.
Comparison results of different encoders. Params and FLOPs denote the number of parameters and floating-point operations, respectively.
Table 7.
Comparison results of different encoders. Params and FLOPs denote the number of parameters and floating-point operations, respectively.
| Dataset | Encoder | mDice | mIoU | | | | Params (M) | FLOPs (G) |
|---|
| Kvasir-SEG | ResNet-50 [45] | 0.8877 | 0.8242 | 0.8963 | 0.8971 | 0.9252 | 45.83 | 20.70 |
| Res2Net-50 [46] | 0.9037 | 0.8447 | 0.9100 | 0.9078 | 0.9351 | 45.97 | 20.73 |
| Swin-Tiny [47] | 0.9065 | 0.8492 | 0.9150 | 0.9106 | 0.9360 | 48.73 | 20.73 |
| PVT v2 [34] | 0.9216 | 0.8681 | 0.9231 | 0.9216 | 0.9492 | 45.62 | 19.65 |
| CVC-ClinicDB | ResNet-50 [45] | 0.9342 | 0.8833 | 0.9354 | 0.9444 | 0.9819 | 45.83 | 20.70 |
| Res2Net-50 [46] | 0.9358 | 0.8899 | 0.9374 | 0.9399 | 0.9783 | 45.97 | 20.73 |
| Swin-Tiny [47] | 0.9351 | 0.8822 | 0.9427 | 0.9380 | 0.9817 | 48.73 | 20.73 |
| PVT v2 [34] | 0.9429 | 0.8968 | 0.9434 | 0.9478 | 0.9871 | 45.62 | 19.65 |
Table 8.
Five-fold cross-validation ablation results on Kvasir-SEG and CVC-ClinicDB.
Table 8.
Five-fold cross-validation ablation results on Kvasir-SEG and CVC-ClinicDB.
| Dataset | Model | mDice | mIoU | | | |
|---|
| Kvasir-SEG | M1 | 0.8565 ± 0.0089 | 0.7749 ± 0.0103 | 0.8666 ± 0.0074 | 0.8207 ± 0.0064 | 0.8872 ± 0.0052 |
| M2 | 0.8809 ± 0.0062 | 0.8147 ± 0.0074 | 0.8897 ± 0.0072 | 0.8509 ± 0.0024 | 0.9069 ± 0.0102 |
| M3 | 0.8995 ± 0.0062 | 0.8447 ± 0.0074 | 0.8957 ± 0.0072 | 0.8909 ± 0.0024 | 0.9169 ± 0.0102 |
| M4 | 0.9044 ± 0.0061 | 0.8534 ± 0.0091 | 0.9049 ± 0.0079 | 0.9020 ± 0.0060 | 0.9201 ± 0.0094 |
| M5 | 0.9114 ± 0.0045 | 0.8586 ± 0.0065 | 0.9130 ± 0.0048 | 0.9125 ± 0.0032 | 0.9323 ± 0.0092 |
| M6 | 0.9050 ± 0.0056 | 0.8561 ± 0.0088 | 0.9025 ± 0.0054 | 0.9060 ± 0.0043 | 0.9285 ± 0.0096 |
| MBRSNet | 0.9161 ± 0.0033 | 0.8630 ± 0.0041 | 0.9184 ± 0.0040 | 0.9251 ± 0.0025 | 0.9423 ± 0.0083 |
| CVC-ClinicDB | M1 | 0.8543 ± 0.0091 | 0.7356 ± 0.0065 | 0.8501 ± 0.0072 | 0.8221 ± 0.0217 | 0.9038 ± 0.0039 |
| M2 | 0.8863 ± 0.0089 | 0.7866 ± 0.0093 | 0.8926 ± 0.0062 | 0.8592 ± 0.0182 | 0.9171 ± 0.0046 |
| M3 | 0.9050 ± 0.0085 | 0.8369 ± 0.0054 | 0.9041 ± 0.0072 | 0.8957 ± 0.0054 | 0.9282 ± 0.0056 |
| M4 | 0.9131 ± 0.0055 | 0.8436 ± 0.0102 | 0.9214 ± 0.0068 | 0.9197 ± 0.0061 | 0.9449 ± 0.0055 |
| M5 | 0.9310 ± 0.0067 | 0.8795 ± 0.0019 | 0.9296 ± 0.0055 | 0.9332 ± 0.0044 | 0.9662 ± 0.0063 |
| M6 | 0.9289 ± 0.0077 | 0.8695 ± 0.0111 | 0.9236 ± 0.0065 | 0.9232 ± 0.0054 | 0.9562 ± 0.0073 |
| MBRSNet | 0.9393 ± 0.0034 | 0.8933 ± 0.0040 | 0.9391 ± 0.0039 | 0.9414 ± 0.0019 | 0.9813 ± 0.0032 |
Table 9.
Five-fold cross-validation results on Kvasir-SEG and CVC-ClinicDB. The best and second-best results on each dataset are marked in bold and underlined.
Table 9.
Five-fold cross-validation results on Kvasir-SEG and CVC-ClinicDB. The best and second-best results on each dataset are marked in bold and underlined.
| Dataset | Model | mDice | mIoU | | | |
|---|
| Kvasir-SEG | UNet [40] | 0.7782 ± 0.0411 | 0.6882 ± 0.0457 | 0.7909 ± 0.0343 | 0.8277 ± 0.0218 | 0.8507 ± 0.0186 |
| UNet++ [41] | 0.8284 ± 0.0208 | 0.7480 ± 0.0237 | 0.8412 ± 0.0144 | 0.8621 ± 0.0120 | 0.8781 ± 0.0106 |
| Polyp-PVT [42] | 0.7982 ± 0.0092 | 0.6962 ± 0.0099 | 0.8087 ± 0.0089 | 0.8257 ± 0.0048 | 0.8449 ± 0.0126 |
| PraNet [20] | 0.8930 ± 0.0088 | 0.8301 ± 0.0104 | 0.8948 ± 0.0086 | 0.9114 ± 0.0036 | 0.9127 ± 0.0070 |
| CFANet [25] | 0.7545 ± 0.0215 | 0.6616 ± 0.0240 | 0.7738 ± 0.0198 | 0.8217 ± 0.0118 | 0.8363 ± 0.0074 |
| CaraNet [21] | 0.8703 ± 0.0108 | 0.7986 ± 0.0113 | 0.8707 ± 0.0066 | 0.8942 ± 0.0058 | 0.8913 ± 0.0140 |
| MISNet [24] | 0.7977 ± 0.0258 | 0.7118 ± 0.0265 | 0.8098 ± 0.0257 | 0.8480 ± 0.0145 | 0.8597 ± 0.0168 |
| MSBPNet [22] | 0.8656 ± 0.0300 | 0.7760 ± 0.0296 | 0.8517 ± 0.0280 | 0.8835 ± 0.0151 | 0.8791 ± 0.0150 |
| MNet-SAt [19] | 0.8206 ± 0.0126 | 0.7417 ± 0.0139 | 0.8328 ± 0.0122 | 0.8508 ± 0.0135 | 0.8760 ± 0.0149 |
| DEP-Net [43] | 0.8050 ± 0.0361 | 0.6961 ± 0.0367 | 0.6345 ± 0.0293 | 0.7360 ± 0.0187 | 0.7735 ± 0.0166 |
| MBRSNet | 0.9161 ± 0.0033 | 0.8630 ± 0.0041 | 0.9184 ± 0.0040 | 0.9251 ± 0.0025 | 0.9423 ± 0.0083 |
| CVC-ClinicDB | UNet [40] | 0.8017 ± 0.0256 | 0.7149 ± 0.0286 | 0.8137 ± 0.0196 | 0.8582 ± 0.0122 | 0.8838 ± 0.0183 |
| UNet++ [41] | 0.8279 ± 0.0157 | 0.7418 ± 0.0215 | 0.8391 ± 0.0161 | 0.8703 ± 0.0061 | 0.8955 ± 0.0131 |
| Polyp-PVT [42] | 0.8065 ± 0.0154 | 0.7078 ± 0.0147 | 0.8206 ± 0.0176 | 0.7828 ± 0.0145 | 0.8866 ± 0.0138 |
| PraNet [20] | 0.9213 ± 0.0092 | 0.8679 ± 0.0090 | 0.9260 ± 0.0087 | 0.9418 ± 0.0050 | 0.9523 ± 0.0080 |
| CFANet [25] | 0.7588 ± 0.0207 | 0.6681 ± 0.0217 | 0.7873 ± 0.0184 | 0.8462 ± 0.0105 | 0.8477 ± 0.0100 |
| CaraNet [21] | 0.9171 ± 0.0097 | 0.8612 ± 0.0106 | 0.9234 ± 0.0069 | 0.9388 ± 0.0047 | 0.9521 ± 0.0050 |
| MISNet [24] | 0.8222 ± 0.0148 | 0.7444 ± 0.0170 | 0.8361 ± 0.0130 | 0.8857 ± 0.0076 | 0.8928 ± 0.0081 |
| MSBPNet [22] | 0.8689 ± 0.0267 | 0.7823 ± 0.0248 | 0.8594 ± 0.0264 | 0.8144 ± 0.0144 | 0.8753 ± 0.0177 |
| MNet-SAt [19] | 0.8590 ± 0.0106 | 0.7892 ± 0.0105 | 0.8639 ± 0.0102 | 0.8243 ± 0.0098 | 0.9317 ± 0.0065 |
| DEP-Net [43] | 0.8237 ± 0.0413 | 0.7209 ± 0.0356 | 0.8325 ± 0.0416 | 0.8195 ± 0.0183 | 0.8957 ± 0.0224 |
| MBRSNet | 0.9393 ± 0.0034 | 0.8933 ± 0.0040 | 0.9391 ± 0.0039 | 0.9414 ± 0.0019 | 0.9813 ± 0.0032 |
Table 10.
Ablation results of cross-dataset generalization on CVC-ColonDB, CVC-300, and ETIS.
Table 10.
Ablation results of cross-dataset generalization on CVC-ColonDB, CVC-300, and ETIS.
| Dataset | Model | mDice | mIoU | | | |
|---|
| CVC-ColonDB | M1 | 0.7279 | 0.6403 | 0.7106 | 0.7302 | 0.8104 |
| M2 | 0.7554 | 0.6768 | 0.7635 | 0.8325 | 0.8785 |
| M3 | 0.7611 | 0.6770 | 0.7667 | 0.8378 | 0.8775 |
| M4 | 0.7730 | 0.6891 | 0.7760 | 0.8469 | 0.8712 |
| M5 | 0.7841 | 0.7072 | 0.7876 | 0.8502 | 0.8873 |
| M6 | 0.7629 | 0.6742 | 0.7673 | 0.8416 | 0.8841 |
| MBRSNet | 0.7919 | 0.7119 | 0.7996 | 0.8564 | 0.8935 |
| CVC-300 | M1 | 0.8351 | 0.7660 | 0.8080 | 0.8125 | 0.8834 |
| M2 | 0.8688 | 0.7985 | 0.8655 | 0.9142 | 0.9402 |
| M3 | 0.8747 | 0.8003 | 0.8519 | 0.9257 | 0.9328 |
| M4 | 0.8803 | 0.8124 | 0.8767 | 0.9316 | 0.9331 |
| M5 | 0.8916 | 0.8214 | 0.8684 | 0.9347 | 0.9440 |
| M6 | 0.8819 | 0.8133 | 0.8636 | 0.9289 | 0.9482 |
| MBRSNet | 0.9067 | 0.8392 | 0.8964 | 0.9427 | 0.9635 |
| ETIS | M1 | 0.6165 | 0.5307 | 0.5753 | 0.6683 | 0.6972 |
| M2 | 0.6941 | 0.6090 | 0.6727 | 0.7898 | 0.8191 |
| M3 | 0.7014 | 0.6240 | 0.6875 | 0.8153 | 0.8316 |
| M4 | 0.7279 | 0.6449 | 0.7082 | 0.8343 | 0.8559 |
| M5 | 0.7368 | 0.6576 | 0.7112 | 0.8330 | 0.8638 |
| M6 | 0.7145 | 0.6425 | 0.7345 | 0.8253 | 0.8599 |
| MBRSNet | 0.7655 | 0.6865 | 0.7499 | 0.8519 | 0.8821 |
Table 11.
Comparison results of cross-dataset generalization on CVC-ColonDB. The best and second-best results are marked in bold and underlined.
Table 11.
Comparison results of cross-dataset generalization on CVC-ColonDB. The best and second-best results are marked in bold and underlined.
| Model | mDice | mIoU | | | |
|---|
| UNet [40] | 0.6240 | 0.5459 | 0.6423 | 0.7639 | 0.7324 |
| UNet++ [41] | 0.4590 | 0.3759 | 0.4785 | 0.6748 | 0.6175 |
| Polyp-PVT [42] | 0.7317 | 0.6450 | 0.7448 | 0.8062 | 0.7807 |
| PraNet [20] | 0.7490 | 0.6474 | 0.7191 | 0.7512 | 0.8652 |
| CFANet [25] | 0.6718 | 0.5905 | 0.6771 | 0.7841 | 0.7465 |
| CaraNet [21] | 0.7555 | 0.6751 | 0.7218 | 0.7665 | 0.8055 |
| MISNet [24] | 0.7239 | 0.6527 | 0.7366 | 0.8067 | 0.8675 |
| MSBPNet [22] | 0.6491 | 0.5683 | 0.6555 | 0.7780 | 0.7489 |
| MNet-SAt [19] | 0.6405 | 0.5682 | 0.6427 | 0.7704 | 0.7802 |
| DEP-Net [43] | 0.6532 | 0.5680 | 0.6386 | 0.7671 | 0.7504 |
| MBRSNet | 0.7919 | 0.7119 | 0.7996 | 0.8564 | 0.8935 |
Table 12.
Comparison results of cross-dataset generalization on CVC-300. The best and second-best results are marked in bold and underlined.
Table 12.
Comparison results of cross-dataset generalization on CVC-300. The best and second-best results are marked in bold and underlined.
| Model | mDice | mIoU | | | |
|---|
| UNet [40] | 0.7157 | 0.6477 | 0.7099 | 0.8120 | 0.7439 |
| UNet++ [41] | 0.7859 | 0.6992 | 0.7943 | 0.8567 | 0.8880 |
| PraNet [20] | 0.8607 | 0.7943 | 0.8690 | 0.9058 | 0.9034 |
| Polyp-PVT [42] | 0.8230 | 0.7447 | 0.7983 | 0.8009 | 0.8800 |
| CFANet [25] | 0.8025 | 0.7167 | 0.7654 | 0.7863 | 0.8515 |
| CaraNet [21] | 0.8514 | 0.7812 | 0.8441 | 0.8877 | 0.9222 |
| MISNet [24] | 0.8637 | 0.7786 | 0.8547 | 0.9064 | 0.8320 |
| MSBPNet [22] | 0.8003 | 0.7202 | 0.7860 | 0.8783 | 0.8072 |
| MNet-SAt [19] | 0.6401 | 0.5409 | 0.6458 | 0.7428 | 0.8201 |
| DEP-Net [43] | 0.7474 | 0.6436 | 0.7294 | 0.8479 | 0.7465 |
| MBRSNet | 0.9067 | 0.8392 | 0.8964 | 0.9427 | 0.9635 |
Table 13.
Comparison results of cross-dataset generalization on ETIS. The best and second-best results are marked in bold and underlined.
Table 13.
Comparison results of cross-dataset generalization on ETIS. The best and second-best results are marked in bold and underlined.
| Model | mDice | mIoU | | | |
|---|
| UNet [40] | 0.4113 | 0.3607 | 0.4158 | 0.6742 | 0.5909 |
| UNet++ [41] | 0.4855 | 0.4236 | 0.4797 | 0.7004 | 0.6386 |
| PraNet [20] | 0.7385 | 0.6602 | 0.7360 | 0.8089 | 0.8588 |
| Polyp-PVT [42] | 0.6906 | 0.6017 | 0.6777 | 0.7801 | 0.7545 |
| CFANet [25] | 0.6780 | 0.5736 | 0.5250 | 0.7991 | 0.7932 |
| CaraNet [21] | 0.7192 | 0.6452 | 0.7244 | 0.7990 | 0.8591 |
| MISNet [24] | 0.6266 | 0.5743 | 0.6455 | 0.7700 | 0.7473 |
| MSBPNet [22] | 0.6903 | 0.5931 | 0.6612 | 0.7608 | 0.7922 |
| MNet-SAt [19] | 0.5561 | 0.4639 | 0.5226 | 0.6814 | 0.6639 |
| DEP-Net [43] | 0.6201 | 0.4943 | 0.5981 | 0.6853 | 0.7698 |
| MBRSNet | 0.7655 | 0.6865 | 0.7499 | 0.8519 | 0.8821 |
Table 14.
Image-level standard deviation and corresponding p-values on CVC-ColonDB. The best and second-best standard deviations are marked in bold and underlined.
Table 14.
Image-level standard deviation and corresponding p-values on CVC-ColonDB. The best and second-best standard deviations are marked in bold and underlined.
| Model | Dice | IoU | | | |
|---|
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
|---|
| UNet [40] | 0.3675 | <0.001 | 0.3513 | <0.001 | 0.3669 | <0.001 | 0.1829 | <0.001 | 0.2383 | <0.001 |
| UNet++ [41] | 0.3423 | <0.001 | 0.3335 | <0.001 | 0.3426 | <0.001 | 0.1803 | <0.001 | 0.2414 | <0.001 |
| PraNet [20] | 0.3693 | <0.001 | 0.3299 | <0.001 | 0.3794 | <0.001 | 0.1819 | <0.001 | 0.2342 | <0.001 |
| Polyp-PVT [42] | 0.3374 | <0.001 | 0.3216 | <0.001 | 0.3369 | <0.001 | 0.1907 | <0.001 | 0.2414 | <0.001 |
| CFANet [25] | 0.3005 | <0.001 | 0.2418 | <0.001 | 0.3369 | <0.001 | 0.1317 | <0.001 | 0.2036 | <0.001 |
| CaraNet [21] | 0.3260 | <0.001 | 0.2724 | <0.001 | 0.3434 | <0.001 | 0.1506 | <0.001 | 0.2052 | <0.001 |
| MISNet [24] | 0.3654 | <0.001 | 0.3467 | <0.001 | 0.3683 | <0.001 | 0.1959 | <0.001 | 0.2538 | <0.001 |
| MSBPNet [22] | 0.2993 | <0.001 | 0.2608 | <0.001 | 0.3358 | <0.001 | 0.1482 | <0.001 | 0.2120 | <0.001 |
| MNet-SAt [19] | 0.3525 | <0.001 | 0.3412 | <0.001 | 0.3510 | <0.001 | 0.1910 | <0.001 | 0.2428 | <0.001 |
| DEP-Net [43] | 0.3784 | <0.001 | 0.3487 | <0.001 | 0.3816 | <0.001 | 0.1922 | <0.001 | 0.2400 | <0.001 |
| MBRSNet | 0.2465 | – | 0.2433 | – | 0.2474 | – | 0.1210 | – | 0.1881 | – |
Table 15.
Image-level standard deviation and corresponding p-values on CVC-300. The best and second-best standard deviations are marked in bold and underlined.
Table 15.
Image-level standard deviation and corresponding p-values on CVC-300. The best and second-best standard deviations are marked in bold and underlined.
| Model | Dice | IoU | | | |
|---|
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
|---|
| UNet [40] | 0.3549 | <0.001 | 0.3352 | <0.001 | 0.3521 | <0.01 | 0.1590 | <0.001 | 0.1930 | <0.001 |
| UNet++ [41] | 0.2509 | <0.001 | 0.2583 | <0.001 | 0.2574 | <0.001 | 0.1243 | <0.001 | 0.1963 | <0.001 |
| PraNet [20] | 0.3555 | <0.05 | 0.3379 | <0.05 | 0.3483 | <0.05 | 0.1364 | <0.05 | 0.1691 | <0.001 |
| Polyp-PVT [42] | 0.2223 | <0.001 | 0.2354 | <0.001 | 0.2275 | <0.001 | 0.1262 | <0.001 | 0.1890 | <0.001 |
| CFANet [25] | 0.2888 | <0.001 | 0.2252 | <0.001 | 0.3297 | <0.001 | 0.1108 | <0.001 | 0.1385 | <0.001 |
| CaraNet [21] | 0.3311 | <0.05 | 0.2880 | <0.05 | 0.3270 | <0.05 | 0.1155 | <0.05 | 0.1360 | <0.001 |
| MISNet [24] | 0.1288 | <0.001 | 0.1672 | <0.001 | 0.1725 | <0.001 | 0.0790 | <0.001 | 0.1759 | <0.001 |
| MSBPNet [22] | 0.2189 | <0.001 | 0.1549 | <0.001 | 0.2733 | <0.001 | 0.1165 | <0.001 | 0.1829 | <0.001 |
| MNet-SAt [19] | 0.2506 | <0.001 | 0.2617 | <0.001 | 0.2491 | <0.001 | 0.1190 | <0.001 | 0.1836 | <0.001 |
| DEP-Net [43] | 0.2409 | <0.001 | 0.2522 | <0.001 | 0.2517 | <0.001 | 0.1134 | <0.001 | 0.1825 | <0.001 |
| MBRSNet | 0.0998 | – | 0.1354 | – | 0.1428 | – | 0.0625 | – | 0.1267 | – |
Table 16.
Image-level standard deviation and corresponding p-values on ETIS. The best and second-best standard deviations are marked in bold and underlined.
Table 16.
Image-level standard deviation and corresponding p-values on ETIS. The best and second-best standard deviations are marked in bold and underlined.
| Model | Dice | IoU | | | |
|---|
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
std
|
p
|
|---|
| UNet [40] | 0.4224 | < | 0.3873 | <0.001 | 0.4210 | <0.001 | 0.1791 | <0.001 | 0.2535 | <0.001 |
| UNet++ [41] | 0.4104 | <0.001 | 0.3858 | <0.001 | 0.4053 | <0.001 | 0.1816 | <0.001 | 0.2476 | <0.001 |
| PraNet [20] | 0.3622 | <0.001 | 0.3300 | <0.001 | 0.3641 | <0.001 | 0.1798 | <0.001 | 0.2490 | <0.001 |
| Polyp-PVT [42] | 0.3869 | <0.001 | 0.3558 | <0.001 | 0.3874 | <0.001 | 0.1920 | <0.001 | 0.2628 | <0.001 |
| CFANet [25] | 0.3389 | <0.001 | 0.1896 | <0.001 | 0.3233 | <0.001 | 0.0863 | <0.001 | 0.2192 | <0.001 |
| CaraNet [21] | 0.3271 | <0.001 | 0.2778 | <0.001 | 0.3420 | <0.01 | 0.1382 | <0.01 | 0.2352 | <0.001 |
| MISNet [24] | 0.3836 | <0.001 | 0.3458 | <0.001 | 0.3728 | <0.001 | 0.1655 | <0.001 | 0.2437 | <0.001 |
| MSBPNet [22] | 0.3313 | <0.001 | 0.2841 | <0.001 | 0.3491 | <0.001 | 0.1708 | <0.001 | 0.2392 | <0.001 |
| MNet-SAt [19] | 0.4152 | <0.001 | 0.3858 | <0.001 | 0.4125 | <0.001 | 0.1949 | <0.001 | 0.2671 | <0.001 |
| DEP-Net [43] | 0.3754 | <0.001 | 0.3309 | <0.001 | 0.3830 | <0.001 | 0.1722 | <0.001 | 0.2534 | <0.001 |
| MBRSNet | 0.3278 | – | 0.3219 | – | 0.2617 | – | 0.1254 | – | 0.1803 | – |
Table 17.
Comparison of model complexity, in-domain performance, and cross-dataset generalization. The in-domain average mDice is calculated on Kvasir-SEG and CVC-ClinicDB, while the cross-dataset average mDice is calculated on CVC-ColonDB, CVC-300, and ETIS.
Table 17.
Comparison of model complexity, in-domain performance, and cross-dataset generalization. The in-domain average mDice is calculated on Kvasir-SEG and CVC-ClinicDB, while the cross-dataset average mDice is calculated on CVC-ColonDB, CVC-300, and ETIS.
| Model | Params (M) | FLOPs (G) | In-Domain Avg. mDice | Cross-Dataset Avg. mDice |
|---|
| UNet [40] | 31.03 | 83.82 | 0.7982 | 0.5837 |
| UNet++ [41] | 36.62 | 211.46 | 0.8435 | 0.5768 |
| PraNet [20] | 30.48 | 10.65 | 0.9087 | 0.7827 |
| Polyp-PVT [42] | 25.10 | 8.11 | 0.8316 | 0.7484 |
| CFANet [25] | 25.23 | 44.84 | 0.8115 | 0.7174 |
| CaraNet [21] | 44.59 | 17.62 | 0.9138 | 0.7754 |
| MISNet [24] | 33.63 | 45.94 | 0.8600 | 0.7381 |
| MSBPNet [22] | 25.52 | 12.86 | 0.9007 | 0.7132 |
| MNet-SAt [19] | 90.63 | 38.53 | 0.8237 | 0.6122 |
| DEP-Net [43] | 28.83 | 8.47 | 0.8491 | 0.6736 |
| MBRSNet | 45.62 | 19.65 | 0.9323 | 0.8214 |
Table 18.
Comparison of model complexity between MBRSNet and MBRSNet-Lite.
Table 18.
Comparison of model complexity between MBRSNet and MBRSNet-Lite.
| Model | Params (M) | FLOPs (G) |
|---|
| MBRSNet | 45.62 | 19.65 |
| MBRSNet-Lite | 4.34 | 1.56 |
Table 19.
Internal and external testing performance of MBRSNet-Lite on five benchmark polyp segmentation datasets. Kvasir-SEG and CVC-ClinicDB are used for internal testing, while CVC-ColonDB, CVC-300, and ETIS are used for external testing.
Table 19.
Internal and external testing performance of MBRSNet-Lite on five benchmark polyp segmentation datasets. Kvasir-SEG and CVC-ClinicDB are used for internal testing, while CVC-ColonDB, CVC-300, and ETIS are used for external testing.
| Testing Type | Dataset | mDice | mIoU | | | |
|---|
| Internal testing | Kvasir-SEG | 0.9016 | 0.8388 | 0.8924 | 0.9099 | 0.9296 |
| CVC-ClinicDB | 0.9297 | 0.8785 | 0.9250 | 0.9368 | 0.9776 |
| External testing | CVC-ColonDB | 0.7742 | 0.6856 | 0.7839 | 0.8462 | 0.8856 |
| CVC-300 | 0.8657 | 0.7936 | 0.8475 | 0.9192 | 0.9452 |
| ETIS | 0.7172 | 0.6347 | 0.6919 | 0.8295 | 0.8399 |