基于多特征融合和混合神经网络的医疗健康问题分类

doi:10.19678/j.issn.1000-3428.0069817

摘要/Abstract

摘要：

在医疗健康领域中, 现有的问题分类方法存在文本特征表示能力弱的问题, 并且对于多类别问题, 忽视了不同关键词特征的权重, 从而影响了分类的准确性。为了解决这些问题, 提出一种基于多特征融合与混合神经网络的医疗健康问题分类方法(MPC-MFF-HNN), 旨在提高医疗健康问题分类的准确性。首先, 该方法结合RoBERTa-wwm-ex模型和Word2Vec模型对文本信息进行字符级和单词级的向量表示, 以获得丰富的多特征信息, 从而弥补单一特征表示方法的不足, 使得模型在处理复杂的医疗健康文本时能够更全面地理解和表征文本语义; 其次, 通过多头注意力机制结合改进的文本卷积神经网络(TextCNN)和双向门控循环单元(BiGRU), 设计了一种混合神经网络模型MHA-APTC-BiGRU, 其采用多层次特征提取方法, 能够有效提取包含关键词权重的深层次文本特征; 最后, 分类器将语义增强的特征向量作为输入, 用于问题类别的分类。在真实公开数据集上的实验结果表明, 与其他基线算法相比, 该方法在精确率、召回率和F1值指标上均显著提升, 在医疗健康问题分类方面表现出更优越的性能。

关键词: 多特征融合, 混合神经网络, 多标签文本分类, 注意力机制, 医疗健康

Abstract:

In the field of healthcare, existing methods for problem classification suffer from weak text feature representation and often overlook the varying weights of different keywords in multi-class scenarios, thereby affecting classification accuracy. To address these issues, a Medical Problem Classification method based on Multi-Feature Fusion and a Hybrid Neural Network (MPC-MFF-HNN) is proposed. This method aims to enhance the accuracy of the healthcare problem classification. First, the approach combines the RoBERTa-wwm-ext and Word2Vec models to represent text information at both the character and word levels, thus obtaining rich multi-feature information. This approach compensates for the limitations of single-feature representation methods and enables the model to comprehensively understand and characterize complex healthcare texts. Second, a hybrid neural network model named MHA-APTC-BiGRU is designed, incorporating multi-head attention mechanisms with an enhanced Text Convolutional Neural Network (TextCNN) and a Bidirectional Gated Recurrent Unit (BiGRU). This model uses multi-level feature extraction methods to effectively capture deep-level text features, including keyword weights. Finally, the classifier uses these semantically enhanced feature vectors for problem category classification. Experiments on real-world public datasets reveal significant improvements in precision, recall rate, and F1 score metrics compared with other baseline algorithms, demonstrating superior performance in healthcare problem classification.

Key words: multi-feature fusion, hybrid neural network, multi-label text classification, attention mechanism, medical health

刘畅, 梁冰雪, 田荣坤, 秦玉华. 基于多特征融合和混合神经网络的医疗健康问题分类[J]. 计算机工程, 2026, 52(2): 342-355.

LIU Chang, LIANG Bingxue, TIAN Rongkun, QIN Yuhua. Medical and Health Question Classification Based on Multi-feature Fusion and Hybrid Neural Network[J]. Computer Engineering, 2026, 52(2): 342-355.

https://www.ecice06.com/CN/Y2026/V52/I2/342

图/表 17

图1 MPC-MFF-HNN模型架构

Fig.1 MPC-MFF-HNN model architecture

图2 APTC模型框架

Fig.2 APTC model framework

图3 注意力池化层

Fig.3 Attention pooling layer

图4 GRU结构图

Fig.4 GRU structure diagram

图5 隐藏层节点数的影响

Fig.5 Effect of number of hidden layer nodes

图6 滤波器数的影响

Fig.6 Effect of number of filters

图7 多头注意力头数的影响

Fig.7 Effect of head number of multi-head attention

图8 标题数据集分类结果

Fig.8 Classification results of title dataset

图9 SMP2017数据集分类结果

Fig.9 Classification results of SMP2017 dataset

参考文献 37

1	SAEDI A , FATEMI A , ALI NEMATBAKHSH M . Representation-centric approach for classification of consumer health questions. Expert Systems with Applications, 2023, 229, 120436. doi: 10.1016/j.eswa.2023.120436
2	YILMAZ S , TOKLU S . A deep learning analysis on question classification task using Word2Vec representations. Neural Computing and Applications, 2020, 32 (7): 2909- 2928. doi: 10.1007/s00521-020-04725-w
3	LIU J , YANG Y H , LV S Q , et al. Attention-based BiGRU-CNN for Chinese question classification. Journal of Ambient Intelligence and Humanized Computing, 2019, 10 (7): 2675- 2683.
4	ZHANG D, LEE W S. Question classification using support vector machines[C]//Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM, 2003: 26-32.
5	LIU L , YU Z T , GUO J Y , et al. Chinese question classification based on question property kernel. International Journal of Machine Learning and Cybernetics, 2014, 5 (5): 713- 720. doi: 10.1007/s13042-013-0216-y
6	MISHRA M , MISHRA V K , SHARMA H R . Question classification using semantic, syntactic and lexical features. International Journal of Web & Semantic Technology, 2013, 4 (3): 39- 47.
7	YAN W , HUYAN W , BENGONG Y . Chinese text classification with feature fusion. Data Analysis and Knowledge Discovery, 2021, 5 (10): 1- 14.
8	郑承宇, 王新, 王婷, 等. 基于迁移学习和集成学习的医疗文本分类. 计算机技术与发展, 2022, 32 (4): 28- 33.
	ZHENG C Y , WANG X , WANG T , et al. Medical text classification based on transfer learning and integrated learning. Computer Technology and Development, 2022, 32 (4): 28- 33.
9	KIM Y. Convolutional neural networks for sentence classification[EB/OL]. [2024-04-06]. https://arxiv.org/abs/1408.5882.
10	王海涛, 宋文, 王辉. 一种基于LSTM和CNN混合模型的文本分类方法. 小型微型计算机系统, 2020, 41 (6): 1163- 1168.
	WANG H T , SONG W , WANG H . A text classification method based on the mixed model of LSTM and CNN. Small Microcomputer System, 2020, 41 (6): 1163- 1168.
11	ZHANG L , WU Y , CHU Q , et al. SA-model: multi-feature fusion poetic sentiment analysis based on a hybrid word vector model. CMES-Computer Modeling in Engineering & Sciences, 2023, 137 (1): 631- 645.
12	DEVLIN J, CHANG M W, LEE K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[EB/OL]. [2024-04-06]. https://arxiv.org/abs/1810.04805.
13	LIU Y, OTT M, GOYAL N, et al. RoBERTa: a robustly optimized Bert pretraining approach[EB/OL]. [2024-04-06]. https://arxiv.org/abs/1907.11692.
14	LAN Z, CHEN M, GOODMAN S, et al. ALBERT: a lite BERT for self-supervised learning of language representations[EB/OL]. [2024-04-06]. https://arxiv.org/abs/1909.11942.
15	SANH V, DEBUT L, CHAUMOND J, et al. DistillBERT, a distilled version of BERT: smaller, faster, cheaper and lighter[EB/OL]. [2024-04-06]. https://arxiv.org/abs/1910.01108.
16	JOSHI M , CHEN D Q , LIU Y H , et al. SpanBERT: improving pre-training by representing and predicting spans. Transactions of the Association for Computational Linguistics, 2020, 8, 64- 77. doi: 10.1162/tacl_a_00300
17	QASIM R , BANGYAL W H , ALQARNI M A , et al. A fine-tuned BERT-based transfer learning approach for text classification. Journal of Healthcare Engineering, 2022, 2022, 3498123.
18	YANG Z C, YANG D Y, DYER C, et al. Hierarchical attention networks for document classification[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg, USA: ACL, 2016: 1480-1489.
19	贾旭东, 王莉. 基于多头注意力胶囊网络的文本分类模型. 清华大学学报: 自然科学版, 2020, 60 (5): 415- 421.
	JIA X D , WANG L . Text classification model based on multi-head attention capsule network. Tsinghua University Journal: Natural Science Edition, 2020, 60 (5): 415- 421.
20	李启行, 廖薇, 孟静雯. 基于注意力机制的双通道DAC-RNN文本分类模型. 计算机工程与应用, 2022, 58 (16): 157- 163.
	LI Q X , LIAO W , MENG J W . Dual-channel DAC-RNN text classification model based on attention mechanism. Computer Engineering and Application, 2022, 58 (16): 157- 163.
21	HONG S , KIM J , WOO H G , et al. Screening ideas in the early stages of technology development: a Word2Vec and convolutional neural network approach. Technovation, 2022, 112, 102407. doi: 10.1016/j.technovation.2021.102407
22	LI X Y , RAGA R C , SHI X M . GloVe-CNN-BiLSTM model for sentiment analysis on text reviews. Journal of Sensors, 2022, 2022, 7212366.
23	JATNIKA D , BIJAKSANA M A , SURYANI A A . Word2Vec model analysis for semantic similarities in English words. Procedia Computer Science, 2019, 157, 160- 167. doi: 10.1016/j.procs.2019.08.153
24	TAN K L , LEE C P , LIM K M . RoBERTa-GRU: a hybrid deep learning model for enhanced sentiment analysis. Applied Sciences, 2023, 13 (6): 3915. doi: 10.3390/app13063915
25	LIU X L , ZHAO W , MA H Q . Research on domain-specific knowledge graph based on the RoBERTa-wwm-ext pretraining model. Computational Intelligence and Neuroscience, 2022, 2022, 8656013.
26	JIANG X C , SONG C , XU Y C , et al. Research on sentiment classification for netizens based on the BERT-BiLSTM-TextCNN model. PeerJ Computer Science, 2022, 8, e1005. doi: 10.7717/peerj-cs.1005
27	ALJOHANI N R , FAYOUMI A , HASSAN S U . A novel focal-loss and class-weight-aware convolutional neural network for the classification of in-text citations. Journal of Information Science, 2023, 49 (1): 79- 92. doi: 10.1177/0165551521991022
28	李建东, 傅佳, 李佳琦. 融合双向注意力和对比增强机制的多标签文本分类. 计算机工程与应用, 2024, 60 (16): 105- 115.
	LI J D , FU J , LI J Q . Multi-label text classification combining bidirectional attention and contrast enhancement mechanism. Computer Engineering and Applications, 2024, 60 (16): 105- 115.
29	徐逸舟, 林晓, 陆黎明. 基于分层式CNN的长文本情感分类模型. 计算机工程与设计, 2022, 43 (4): 1121- 1126.
	XU Y Z , LIN X , LU L M . Long text emotion classification model based on hierarchical CNN. Computer Engineering and Design, 2022, 43 (4): 1121- 1126.
30	LI X L , ZHANG Y Y , JIN J Y , et al. A model of integrating convolution and BiGRU dual-channel mechanism for Chinese medical text classifications. PLoS One, 2023, 18 (3): e0282824. doi: 10.1371/journal.pone.0282824
31	CHEN N , SU X D , LIU T Y , et al. A benchmark dataset and case study for Chinese medical question intent classification. BMC Medical Informatics and Decision Making, 2020, 20 (Suppl 3): 125.
32	ZHANG M J, PANG J C, CAI J H, et al. DPCNN-based models for text classification[C]//Proceedings of the IEEE 10th International Conference on Cyber Security and Cloud Computing (CSCloud)/2023 IEEE 9th International Conference on Edge Computing and Scalable Cloud (EdgeCom). Washington D. C., USA: IEEE Press, 2023: 363-368.
33	刘心惠, 陈文实, 周爱, 等. 基于联合模型的多标签文本分类研究. 计算机工程与应用, 2020, 56 (14): 111- 117.
	LIU X H , CHEN W S , ZHOU A , et al. Research on multi-tag text classification based on joint model. Computer Engineering and Application, 2020, 56 (14): 111- 117.
34	刘勇, 杜建强, 罗计根, 等. 基于语义筛选的ALBERT-TextCNN中医文本多标签分类研究. 现代信息科技, 2023, 7 (19): 123- 128.
	LIU Y , DU J Q , LUO J G , et al. Research on multi-label classification of ALBERT-TextCNN TCM texts based on semantic screening. Modern Information Technology, 2023, 7 (19): 123- 128.
35	RASMY L , XIANG Y , XIE Z Q , et al. Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction. NPJ Digital Medicine, 2021, 4, 86. doi: 10.1038/s41746-021-00455-y
36	JIANG T, WANG D Q, SUN L L, et al. LightXML: transformer with dynamic negative sampling for high-performance extreme multi-label text classification[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Palo Alto, USA: AAAI Press, 2021: 7987-7994.
37	马雨萌, 黄金霞, 王昉, 等. 融合BERT与多尺度CNN的科技政策内容多标签分类研究. 情报杂志, 2022, 41 (11): 157- 163.
	MA Y M , HUANG J X , WANG F , et al. Research on the multi label classification of science and technology policy content integrating BERT and multi-scale CNN. Information Journal, 2022, 41 (11): 157- 163.

[1]	李健浪, 吴新电, 陈灵, 阳波, 唐文胜. 基于4D毫米波雷达与视觉融合的三维目标检测算法[J]. 计算机工程, 2026, 52(2): 299-310.
[2]	张信佳, 王芳. 基于多层次特征融合和注意力机制的无人机图像小目标检测算法[J]. 计算机工程, 2026, 52(2): 148-157.
[3]	许晓阳, 魏伟, 高重阳. 基于改进YOLOv7-tiny的红外船舶目标检测[J]. 计算机工程, 2026, 52(2): 209-220.
[4]	薛阳, 秦瑶, 张舒翔. 基于双重图注意力网络生成子图的图神经协同推荐[J]. 计算机工程, 2026, 52(2): 89-100.
[5]	文浪, 苟光磊, 白瑞峰, 缪宛谕. 基于邻域融合和特征增强的小样本细粒度图像分类[J]. 计算机工程, 2026, 52(2): 158-166.
[6]	宋朝琦, 刘颖, 何敬鲁, 李大湘. 基于显著位置交互Transformer的小样本图像分类方法[J]. 计算机工程, 2026, 52(2): 167-176.
[7]	王庆荣, 郝福乐, 朱昌锋, 王俊杰. 基于多特征融合的车辆轨迹预测研究[J]. 计算机工程, 2026, 52(2): 331-341.
[8]	但崇鸿, 韦洪雷, 何舟, 吴贯锋. SRMpose: 一种多尺度特征提取的关键点检测算法[J]. 计算机工程, 2026, 52(2): 136-147.
[9]	王珺, 李昆仑, 张伊菲, 朱其振, 刘磊, 王帅. 基于位置和语义分离注意力机制的轻量视频目标跟踪算法[J]. 计算机工程, 2026, 52(1): 228-241.
[10]	黎东丰, 陈雨人, 余博. 基于多层次特征融合的路面裂缝检测方法[J]. 计算机工程, 2026, 52(1): 154-165.
[11]	黄金贵, 刘朋, 唐文胜. MMD-YOLOv7:黑暗条件下车辆检测方法[J]. 计算机工程, 2025, 51(9): 340-349.
[12]	符家成, 田瑾, 张玉金, 方志军. 结合前置三元组集的知识图谱推荐[J]. 计算机工程, 2025, 51(9): 101-109.
[13]	翟志鹏, 曹阳, 沈琴琴, 施佺. 基于多时空图融合与动态注意力的交通流预测[J]. 计算机工程, 2025, 51(9): 139-148.
[14]	马淦, 谷雨, 彭冬亮. 结合改进YOLOv5s和动态数据增强的海面舰船检测[J]. 计算机工程, 2025, 51(9): 294-305.
[15]	陈彦如, 刘珂良, 冉茂亮. 基于深度强化学习的外卖即时配送实时优化[J]. 计算机工程, 2025, 51(9): 328-339.

选择文件类型/文献管理软件名称

选择包含的内容