作者投稿和查稿 主编审稿 专家审稿 编委审稿 远程编辑

计算机工程

• •    

MTM3D:融合Mamba与改进TTM的3D医学图像分析网络

  • 发布日期:2025-05-09

MTM3D:3D Medical Image Classification Network Integrating Mamba and Improved TTM

  • Published:2025-05-09

摘要: 生物医学成像在诊断和治疗多种疾病中起着至关重要的作用。将深度学习方法应用于医学图像分析能够提高医学图像的可读性,为临床决策提供更可靠的支持。然而,传统的医学图像处理方法在有效捕获三维图像中的空间特征和复杂结构信息方面存在一定局限性,尤其是在处理不同成像方式生成的复杂3D医学影像时,模型的精度和泛化能力常常受到挑战。针对这一挑战,提出了一种MTM3D模型用于医学图像分类任务,该模型结合了Mamba模型在复杂序列任务的优异性能与改进令牌图灵机(Token Turning Machines, TTM)网络的外部记忆存储功能。通过引入循环链式存储结构,MTM3D能够在记忆单元中有效交互不同空间结构的特征,从而提升对复杂空间关系的捕捉能力;此外,Mamba的引入进一步增强了记忆单元与处理单元的交互能力,使模型具备更强的泛化能力,在不同的医学影像数据集上表现出色。实验结果表明,MTM3D在MedMNIST v2数据集上的医学图像理解能力表现优异。相比现有最佳的医学图像分析网络,MTM3D的平均准确率ACC提升了3.97%,平均曲线下面积AUC提升了2.00%,充分展示了其在医学影像解读和协助医疗专业人员进行诊断与治疗规划中的巨大潜力。

Abstract: Biomedical imaging plays a crucial role in the diagnosis and treatment of various diseases. The application of deep learning methods to medical image analysis can enhance the readability of medical images and provide more reliable support for clinical decision-making. However, traditional medical image processing methods face certain limitations in effectively capturing spatial features and complex structural information in 3D images, especially when handling complex 3D medical images generated by different imaging modalities. This often challenges the model's accuracy and generalization ability. To address this challenge, an MTM3D model is proposed for medical image classification tasks. This model combines the excellent performance of the Mamba model in complex sequential tasks with the external memory storage function of the improved Token Turning Machines (TTM) network. By introducing a cyclic chain storage structure, MTM3D enables effective interaction of features from different spatial structures within memory units, thus enhancing its ability to capture complex spatial relationships. Furthermore, the incorporation of Mamba further strengthens the interaction between the memory and processing units, allowing the model to possess stronger generalization capability and perform excellently across different medical imaging datasets. Experimental results demonstrate that MTM3D exhibits outstanding medical image understanding capabilities on the MedMNIST v2 dataset. Compared to the current best medical image analysis networks, MTM3D improves the average accuracy (ACC) by 3.97% and the average area under the curve (AUC) by 2.00%, fully showcasing its tremendous potential in medical image interpretation and assisting healthcare professionals in diagnosis and treatment planning.