TY - GEN
T1 - TransUNet-Lite
T2 - 7th International Conference on Medical and Health Informatics, ICMHI 2023
AU - Khan, Muhammad Salman
AU - Ali, Shahzad
AU - Lee, Yu Rim
AU - Kang, Min Kyu
AU - Park, Soo Young
AU - Tak, Won Young
AU - Jung, Soon Ki
N1 - Publisher Copyright:
© 2023 ACM.
PY - 2023/5/12
Y1 - 2023/5/12
N2 - Deep convolutional neural networks have demonstrated superior performance in a variety of vision tasks. For biomedical applications, these methods suffer from problems such as predicting reliable segmentation masks for variable size input images, insufficient data and imbalanced datasets. This paper introduces an efficient and lightweight TransUNet, termed as TransUNet-Lite, that exploits rich feature representations produced by the convolution-based feature extractor, an external attention module instead of conventional self-attention, a fast token selector module, and skip connections from the feature extractor to the decoder to provide lost rich contextual information. The proposed network takes patches as input rather than resized images that fail to care for the original aspect ratio. For the nuclei segmentation task on the 2018 Science Bowl dataset, our TransUNet-Lite outperformed other SOTA networks, with the highest DSC of 93.08% and IoU of 87.95%. The results of our experiments provide insight into the impact of certain network design decisions. By configuring a transformer in a simplistic and efficient manner, it is possible to achieve segmentation quality that is at least equal to SOTA network architectures.
AB - Deep convolutional neural networks have demonstrated superior performance in a variety of vision tasks. For biomedical applications, these methods suffer from problems such as predicting reliable segmentation masks for variable size input images, insufficient data and imbalanced datasets. This paper introduces an efficient and lightweight TransUNet, termed as TransUNet-Lite, that exploits rich feature representations produced by the convolution-based feature extractor, an external attention module instead of conventional self-attention, a fast token selector module, and skip connections from the feature extractor to the decoder to provide lost rich contextual information. The proposed network takes patches as input rather than resized images that fail to care for the original aspect ratio. For the nuclei segmentation task on the 2018 Science Bowl dataset, our TransUNet-Lite outperformed other SOTA networks, with the highest DSC of 93.08% and IoU of 87.95%. The results of our experiments provide insight into the impact of certain network design decisions. By configuring a transformer in a simplistic and efficient manner, it is possible to achieve segmentation quality that is at least equal to SOTA network architectures.
KW - Cell nuclei segmentation
KW - External attention
KW - Lightweight TransUNet
KW - Medical image segmentation
KW - Token selection
UR - http://www.scopus.com/inward/record.url?scp=85178041928&partnerID=8YFLogxK
U2 - 10.1145/3608298.3608344
DO - 10.1145/3608298.3608344
M3 - Conference contribution
AN - SCOPUS:85178041928
T3 - ACM International Conference Proceeding Series
SP - 251
EP - 258
BT - ICMHI 2023 - 2023 the 7th International Conference on Medical and Health Informatics
PB - Association for Computing Machinery
Y2 - 12 May 2023 through 14 May 2023
ER -