TY - JOUR
T1 - FU-Net
T2 - fast biomedical image segmentation model based on bottleneck convolution layers
AU - Olimov, Bekhzod
AU - Sanjar, Karshiev
AU - Din, Sadia
AU - Ahmad, Awaise
AU - Paul, Anand
AU - Kim, Jeonghong
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH, DE part of Springer Nature.
PY - 2021/8
Y1 - 2021/8
N2 - Recently, the introduction of Convolutional Neural Network (CNNs) has advanced the way of solving image segmentation tasks. Semantic image segmentation has considerably benefited from employing various CNN models. The most widely used network in this field is U-Net and its different variations. However, these models require significant number of trainable parameters, floating-point operations per second, and great computational power to be trained. These factors make real-time semantic segmentation in low powered devices very hard. Therefore, in the present paper, we aim to modify particular aspects of the U-Net model to improve its performance through developing a fast U-Net (FU-Net) relying on bottleneck convolution layers in the contraction and expansion paths of the model. The proposed model can be utilized in semantic segmentation applications even on the devices with limited computational power and memory by ensuring the state-of-the-art performance. The amount of memory required by the proposed model is reduced by 23 times when compared with the original U-Net. Moreover, the modifications allowed achieving better performance. In conducted experiments, we assessed the performance of the proposed model on two biomedical image segmentation datasets, namely 2018 Data Science Bowl and ICIS 2018: Skin Lesion Analysis Towards Melanoma Detection. FU-Net demonstrated the state-of-the-art results in biomedical image segmentation, requiring the number of trainable parameters reduced by eight times compared with the original U-Net model. In addition, using bottleneck layers decreased the number of computations, resulting in nearly 30% speed-up at the training, validation and test stages. Furthermore, despite relying on fewer parameters FU-Net achieved a slight improvement of the performance in terms of pixel accuracy, Jaccard index, and dice coefficient evaluation metrics.
AB - Recently, the introduction of Convolutional Neural Network (CNNs) has advanced the way of solving image segmentation tasks. Semantic image segmentation has considerably benefited from employing various CNN models. The most widely used network in this field is U-Net and its different variations. However, these models require significant number of trainable parameters, floating-point operations per second, and great computational power to be trained. These factors make real-time semantic segmentation in low powered devices very hard. Therefore, in the present paper, we aim to modify particular aspects of the U-Net model to improve its performance through developing a fast U-Net (FU-Net) relying on bottleneck convolution layers in the contraction and expansion paths of the model. The proposed model can be utilized in semantic segmentation applications even on the devices with limited computational power and memory by ensuring the state-of-the-art performance. The amount of memory required by the proposed model is reduced by 23 times when compared with the original U-Net. Moreover, the modifications allowed achieving better performance. In conducted experiments, we assessed the performance of the proposed model on two biomedical image segmentation datasets, namely 2018 Data Science Bowl and ICIS 2018: Skin Lesion Analysis Towards Melanoma Detection. FU-Net demonstrated the state-of-the-art results in biomedical image segmentation, requiring the number of trainable parameters reduced by eight times compared with the original U-Net model. In addition, using bottleneck layers decreased the number of computations, resulting in nearly 30% speed-up at the training, validation and test stages. Furthermore, despite relying on fewer parameters FU-Net achieved a slight improvement of the performance in terms of pixel accuracy, Jaccard index, and dice coefficient evaluation metrics.
KW - Batch normalization
KW - Biomedical image segmentation
KW - Bottleneck convolution layers
KW - CNN
KW - U-Net
UR - http://www.scopus.com/inward/record.url?scp=85098800373&partnerID=8YFLogxK
U2 - 10.1007/s00530-020-00726-w
DO - 10.1007/s00530-020-00726-w
M3 - Article
AN - SCOPUS:85098800373
SN - 0942-4962
VL - 27
SP - 637
EP - 650
JO - Multimedia Systems
JF - Multimedia Systems
IS - 4
ER -