Author Login Editor-in-Chief Peer Review Editor Work Office Work
Research on Text Representation of Video Content Based on Multi-Modal Fusion and Multi-Layer Attention
ZHAO Hong, GUO Lan, CHEN Zhiwen, ZHENG Houze
Computer Engineering . 2022, (10): 45 -54 .  DOI: 10.19678/j.issn.1000-3428.0063294