Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image Segmentation

University of Glasgow
2023 The British Machine Vision Conference (BMVC)
^*Indicates Advising

Abstract

Semi-supervised learning has demonstrated great potential in medical image segmentation by utilizing knowledge from unlabeled data. However, most existing approaches do not explicitly capture high-level semantic relations between distant regions, which limits their performance. In this paper, we focus on representation learning for semi-supervised learning, by developing a novel Multi-Scale Cross Supervised Contrastive Learning (MCSC) framework, to segment structures in medical images. We jointly train CNN and Transformer models, regularising their features to be semantically consistent across different scales. Our approach contrasts multi-scale features based on ground-truth and cross-predicted labels, in order to extract robust feature representations that reflect intra- and inter-slice relationships across the whole dataset. To tackle class imbalance, we take into account the prevalence of each class to guide contrastive learning and ensure that features adequately capture infrequent classes. Extensive experiments on two multi-structure medical segmentation datasets demonstrate the effectiveness of MCSC. It not only outperforms state-of-the-art semi-supervised methods by more than 3.0% in Dice, but also greatly reduces the performance gap with fully supervised methods.

BibTeX

[@inproceedings{Liu_2023_BMVC, author = {Qianying Liu and Xiao Gu and Paul Henderson and Fani Deligianni}, title = {Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image Segmentation}, booktitle = {34th British Machine Vision Conference 2023, {BMVC} 2023, Aberdeen, UK, November 20-24, 2023}, publisher = {BMVA}, year = {2023}, url = {https://bmvc2022.mpi-inf.mpg.de/BMVC2023/0868.pdf} }]

Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image Segmentation

Abstract

• Jointly train CNN and Transformer
• On the output level, two losses: supervision loss and cross pseudo supervision loss.
• On the feature level, multi-scale cross contrastive loss regularises their features to be semantically consistent across different scales based on crossed labels.

Multi-scale cross supervised contrastive learning. Pseudo labels from cross-teaching (right) and ground-truth, and used to guide contrastive loss.

Qualitative results from our method and the best baseline CTS trained on 4 labeled cases of ACDC.

Qualitative results from our method and the best baseline CTS trained on 7 labeled cases on Synapse.

Poster

BibTeX

Multi-Scale Cross Contrastive Learning for Semi-Supervised Medical Image Segmentation

Abstract

• Jointly train CNN and Transformer • On the output level, two losses: supervision loss and cross pseudo supervision loss. • On the feature level, multi-scale cross contrastive loss regularises their features to be semantically consistent across different scales based on crossed labels.

Multi-scale cross supervised contrastive learning. Pseudo labels from cross-teaching (right) and ground-truth, and used to guide contrastive loss.

Qualitative results from our method and the best baseline CTS trained on 4 labeled cases of ACDC.

Qualitative results from our method and the best baseline CTS trained on 7 labeled cases on Synapse.

Poster

BibTeX

• Jointly train CNN and Transformer
• On the output level, two losses: supervision loss and cross pseudo supervision loss.
• On the feature level, multi-scale cross contrastive loss regularises their features to be semantically consistent across different scales based on crossed labels.