Confidence Adjustment and Unbiased Optimal Transport based Partial Label Learning

doi:10.19678/j.issn.1000-3428.0260065

Abstract

Abstract: Partial label learning is a typical weakly supervised learning paradigm in which each training instance is assigned a candidate label set that contains the true label. The goal of partial label learning is to identify the ground-truth label from the candidate set for each instance. In real-world applications, partial label data usually exhibit class imbalance. This makes learning methods based on prediction confidence and label refinement prone to bias and thus degrades classification performance. This issue is more severe in long-tailed scenarios, where head classes dominate the disambiguation process and tail classes are insufficiently learned. Moreover, existing optimal transport–based label refinement methods still suffer from systematic bias in imbalanced scenarios. To address these issues, this paper proposes a method named C2DOT-PLL for long-tailed partial label learning. While preserving the global consistency advantage of optimal transport, the method first employs a dynamic confidence calibration mechanism to alleviate unfair comparisons caused by inconsistent confidence scales across classes and to reduce the impact of class imbalance on instance-level label competition. Then, an unbiased optimal transport scheme is introduced in the pseudo-label refinement stage to correct the systematic bias induced by entropic regularization, thereby producing more accurate pseudo labels. Experiments are conducted on multiple benchmark datasets with different imbalance levels. The results show that, compared with existing partial label learning methods, C2DOT-PLL achieves the best overall classification accuracy.

摘要： 偏标签学习是一类典型的弱监督学习方法，其训练样本被赋予一个包含真实标签的候选标签集合。偏标签学习的目标是在每个样本的候选标签集合中识别其真实标签。在实际应用中，偏标签数据通常呈现类别不平衡特性，使得基于预测置信度和标签细化的学习方法容易产生偏置，进而影响模型的分类性能。尤其是在长尾场景下，头部类往往在标签消歧过程中占据主导地位，尾部类难以获得有效学习。此外，现有基于最优传输的标签细化方法在不平衡场景下仍存在系统性偏置问题。针对上述问题，本文提出了一种面向长尾偏标签学习方法C2DOT-PLL。该方法在保持最优传输全局一致性优势的基础上，首先通过动态样本预测置信度校准机制缓解不同类别预测置信度尺度不一致带来的比较不公平问题，减少类别不平衡对样本级标签竞争的影响；随后，在伪标签细化阶段引入无偏最优传输，对熵正则化最优传输引入的系统性偏差进行修正，从而获得更加准确的伪标签。在多个具有不同平衡程度的基准数据集上对所提出的方法进行实验验证。实验结果表明，与现有偏标签学习方法相比，C2DOT-PLL在总体分类准确性取得了最优性能。

FAN Tianhao, QI Lianyong, YANG Yijie, LI Chong, SONG Te, ZHANG Dejiang. Confidence Adjustment and Unbiased Optimal Transport based Partial Label Learning[J]. Computer Engineering, doi: 10.19678/j.issn.1000-3428.0260065.

范天豪, 齐连永 , 杨忆杰, 李冲, 宋特, 张德江. 基于置信度调整与无偏最优传输的偏标签学习方法[J]. 计算机工程, doi: 10.19678/j.issn.1000-3428.0260065.

/ Recommend / Download Citations

URL: https://www.ecice06.com/EN/10.19678/j.issn.1000-3428.0260065

References

[1] SCHMIDHUBER J. Deep learning in neural networks: An Overview[J]. Neural Networks, 2015, 61: 85-117.
[2] JANIESCH C, ZSCHECH P, HEINRICH K. Machine learning and deep learning[J]. Electronic Markets, 2021, 31(3): 685-695.
[3] YAO B, KHOSLA A, FEI-FEI L. Combining randomization and discrimination for fine-grained image categorization[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2011: 1577-1584.
[4] COUR T, SAPP B, TASKAR B. Learning from partial labels[J]. Journal of Machine Learning Research, 2011, 12: 1501-1536.
[5] QIAO C, XU N, GENG X. Decompositional generation process for instance-dependent partial label learning[EB/OL]. arXiv preprint arXiv: 2204.03845, 2022. [2026-02-21]. https://arxiv.org/abs/2204.03845.
[6] 欧阳宵,陶红,范瑞东,等.利用标签相关性先验的弱监督多标签学习方法[J].软件学报,2023,34(04):1732-1748. OUYANG X, TAO H, FAN R D, et al. Weakly supervised multi-label learning method based on label correlation prior[J]. Journal of Software, 2023, 34(4): 1732-1748.
[7] REN X, HE W, QU M, et al. Label noise reduction in entity typing by heterogeneous partial-label embedding[C]//Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2016: 1825-1834.
[8] CHEN C H, PATEL V M, CHELLAPPA R. Learning from ambiguously labeled face images[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 40(7): 1653-1667.
[9] LIU L, DIETTERICH T. A conditional multinomial mixture model for superset label learning. Advances in Neural Information Processing Systems, 2012, 25: 548-556.
[10] YU F, ZHANG M L. Maximum margin partial label learning[C]//Proceedings of the Asian Conference on Machine Learning. New York, USA: PMLR Press, 2016: 96-111.
[11] LYU G, FENG S, WANG T, et al. GM-PLL: Graph matching based partial label learning[J]. IEEE Transactions on Knowledge and Data Engineering, 2019, 33(2): 521-535.
[12] WANG D B, LI L, ZHANG M L. Adaptive graph guided disambiguation for partial label learning[C]//Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. New York, USA: ACM Press, 2019: 83-91.
[13] Zhang F, Feng L, Han B, et al. Exploiting class activation value for partial-label learning[C]//Proceedings of the International Conference on Learning Representations. 2021.
[14] TIAN S, WEI H, WANG Y, et al. Crosel: Cross selection of confident pseudo labels for partial-label learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2024: 19479-19488.
[15] 刘金硕,王代辰,邓娟,等.基于长尾分类算法的网络不良信息分类[J].计算机工程,2023,49(08):13-19+28. LIU J S, WANG D C, DENG J, et al. Network undesirable information classification based on long-tailed classification algorithms[J]. Computer Engineering, 2023, 49(08): 13-19+28.
[16] ANDERSON C, NISSLEY C, ANDERSON C. The long tail[M]. New York, USA: Hyperion Audiobooks, 2006.
[17] ZHANG Y, KANG B, HOOI B, et al. Deep long-tailed learning: A survey[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 45(9): 10795-10816.
[18] WANG H, XIA M, LI Y, et al. Solar: Sinkhorn label refinery for imbalanced partial-label learning. Advances in Neural Information Processing Systems, 2022, 35: 8104-8117.
[19] PEYRÉ G, CUTURI M. Computational optimal transport: With applications to data science[J]. Foundations and Trends in Machine Learning, 2019, 11(5-6): 355-607.
[20] Villani C. Optimal transport: old and new[M]. Berlin, Germany: Springer, 2008.
[21] WANG Y, CHEN H, HENG Q, et al. Freematch: Self-adaptive thresholding for semi-supervised learning[EB/OL]. arXiv preprint arXiv:2205.07246, 2022. [2026-02-21]. https://arxiv.org/abs/2205.07246.
[22] FEYDY J, SÉJOURNÉ T, VIALARD F X, et al. Interpolating between optimal transport and mmd using sinkhorn divergences[C]//Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics. Sardinia, Italy: JMLR Press, 2019: 2681-2690.
[23] XU T, DAN J. EHM: Exploring dynamic alignment and hierarchical clustering in unsupervised domain adaptation via high-order moment-guided contrastive learning[J]. Neural Networks, 2025, 185: 107188.
[24] GENEVAY A, PEYRÉ G, CUTURI M. Learning generative models with sinkhorn divergences[C]// Proceedings of the 21st International Conference on Artificial Intelligence and Statistics. Sardinia, Italy: JMLR Press, 2018: 1608-1617.
[25] LYU G, WU Y, FENG S. Deep Graph Matching for Partial Label Learning[C]//Proceedings of the 21st International Joint Conference on Artificial Intelligence. Washington D.C., USA: IEEE Press, 2022: 3306-3312.
[26] LV J, XU M, FENG L, et al. Progressive identification of true labels for partial-label learning[C]//Proceedings of International Conference on Machine Learning. New York, USA: PMLR Press, 2020: 6500-6510.
[27] HE S, YANG G, FENG L. Candidate-aware selective disambiguation based on normalized entropy for instance-dependent partial-label learning[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. Washington D.C., USA: IEEE Press, 2023: 1792-1801.
[28] WU D D, WANG D B, ZHANG M L. Revisiting consistency regularization for deep partial label learning[C]//Proceedings of International Conference on Machine Learning. New York, USA: PMLR Press, 2022: 24212-24225.
[29] WEN H, CUI J, HANG H, et al. Leveraged weighted loss for partial label learning[C]//Proceedings of International Conference on Machine Learning. New York, USA: PMLR Press, 2021: 11091-11100.
[30] WANG H, XIAO R, LI Y, et al. PiCO: Contrastive Label Disambiguation for Partial Label Learning[C]//Proceedings of the International Conference on Learning Representations, 2022.
[31] WANG Y, ZHANG W, ZHANG M L. Partial Label Causal Representation Learning for Instance-Dependent Supervision and Domain Generalization[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Washington D. C., USA: AAAI Press, 2025, 39(20): 21366-21374.
[32] HE K, ZHANG X, REN S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2016: 770-778.
[33] 曹杰彰,莫朗元,杜卿,等.基于最优传输理论的联合分布匹配方法及应用[J].计算机学报,2021,44(06):1233-1245. Cao J Z, Mo L Y, Du Q, et al. Joint distribution matching method based on optimal transport theory and its applications[J]. Chinese Journal of Computers, 2021, 44(06): 1233-1245.
[34] 段杰,宋丽娟,马子睿.基于多层次最优传输的癌症生存预测研究[J/OL].计算机工程,1-13. [2026-02-20]. https://doi.org/10.19678/j.issn.1000-3428.0252923.
Duan J, Song L J, Ma Z R. Cancer survival prediction based on multi-level optimal transport[J/OL]. Computer Engineering, 2026, 1-13. [2026-02-20]. https://doi.org/10.19678/j.issn.1000-3428.0252923.
[35] CUTURI M. Sinkhorn distances: Lightspeed computation of optimal transport. Advances in Neural Information Processing Systems, 2013, 26: 2292-2300.
[36] CAO K, WEI C, GAIDON A, et al. Learning imbalanced datasets with label-distribution-aware margin loss. Advances in Neural Information Processing Systems, 2019, 32: 1567-1578.
[37] WEI C, SOHN K, MELLINA C, et al. Crest: A class-rebalancing self-training framework for imbalanced semi-supervised learning[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2021: 10857-10866.
[38] KRIZHEVSKY A, HINTON G. Learning multiple layers of features from tiny images [EB/OL]. [2026-2-21]. https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf
[39] BRIGGS F, FERN X Z, RAICH R. Rank-loss support instance machines for MIML instance annotation[C]//Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA: ACM Press, 2012: 534-542.
[40] ZENG Z, XIAO S, JIA K, et al. Learning by associating ambiguously labeled images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Washington D.C., USA: IEEE Press, 2013: 708-715.
[41] FENG L, LV J, HAN B, et al. Provably consistent partial-label learning. Advances in Neural Information Processing Systems, 2020, 33: 10948-10960.
[42] XU N, QIAO C, GENG X, et al. Instance-dependent partial label learning. Advances in Neural Information Processing Systems, 2021, 34: 27119-27130.
[43] SUTSKEVER I, MARTENS J, DAHL G, et al. On the importance of initialization and momentum in deep learning[C]//International Conference on Machine Learning. New York, USA: PMLR Press, 2013: 1139-1147.

Please choose a citation manager

Content to export