| 1 | SARACLAR M, SPROAT R. Lattice-based search for spoken utterance retrieval[C]//Proceedings of HLT-NAACLʼ04. Washington D. C., USA: IEEE Press, 2004: 129-136. | 
																													
																							| 2 | CAN D, SARACLAR M. Lattice indexing for spoken term detection. IEEE Transactions on Audio, Speech, and Language Processing, 2011, 19 (8): 2338- 2347.  doi: 10.1109/TASL.2011.2134087
 | 
																													
																							| 3 | MAMOU J, RAMABHADRAN B, SIOHAN O. Vocabulary independent spoken term detection[C]//Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2007: 615-622. | 
																													
																							| 4 | ROSE R C, PAUL D B. A hidden Markov model based keyword recognition system[C]//Proceedings of International Conference on Acoustics, Speech, and Signal Processing. Albuquerque, USA: IEEE Press, 1990: 129-132. | 
																													
																							| 5 | SZÖKE I, SCHWARZ P, MATĚJKA P, et al. Phoneme based acoustics keyword spotting in informal continuous speech[M]. Berlin, Germany: Springer, 2005. | 
																													
																							| 6 | PANCHAPAGESAN S, SUN M, KHARE A, et al. Multi-task learning and weighted cross-entropy for DNN-based keyword spotting[C]//Proceedings of INTERSPEECHʼ16. San Francisco, USA: [s. n.] 2016: 760-764. | 
																													
																							| 7 | SUN M, NAGARAJA V, HOFFMEISTER B, et al. Model shrinking for embedded keyword spotting[C]//Proceedings of the 14th IEEE International Conference on Machine Learning and Applications. Miami, USA: IEEE Press, 2015: 369-374. | 
																													
																							| 8 | WU M H, PANCHAPAGESAN S, SUN M, et al. Monophone-based background modeling for two-stage on-device wake word detection[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Washington D. C., USA: IEEE Press, 2018: 5494-5498. | 
																													
																							| 9 | CHEN G G, PARADA C, HEIGOLD G. Small-footprint keyword spotting using deep neural networks[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Florence, Italy: IEEE Press, 2014: 4087-4091. | 
																													
																							| 10 | SUN M, RAJU A, TUCKER G, et al. Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting[C]//Proceedings of IEEE Spoken Language Technology Workshop. San Diego, USA: IEEE Press, 2016: 474-480. | 
																													
																							| 11 | ARIK S Ö, KLIEGL M, CHILD R, et al. Convolutional recurrent neural networks for small-footprint keyword spotting[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1703.05390 . | 
																													
																							| 12 | LOPEZ-ESPEJO I, TAN Z H, HANSEN J H L, et al. Deep spoken keyword spotting: an overview. IEEE Access, 2022, 10, 4169- 4199.  doi: 10.1109/ACCESS.2021.3139508
 | 
																													
																							| 13 | WANG Z M, LI X L, ZHOU J. Small-footprint keyword spotting using deep neural network and connectionist temporal classifier[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1709.03665 . | 
																													
																							| 14 | 杨润延, 程高峰, 刘建. 基于端到端语音识别的关键词检索技术研究. 计算机科学, 2022, 49 (1): 53- 58.  URL
 | 
																													
																							|  | YANG R Y, CHENG G F, LIU J. Study on keyword search framework based on end-to-end automatic speech recognition. Computer Science, 2022, 49 (1): 53- 58.  URL
 | 
																													
																							| 15 |  | 
																													
																							| 16 | LI X, LI N, WENG C, et al. Replay and synthetic speech detection with Res2Net architecture[C]//Proceedings of 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. Toronto, Canada: IEEE Press, 2021: 6354-6358. | 
																													
																							| 17 | HAN K, WANG Y H, TIAN Q, et al. GhostNet: more features from cheap operations[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE Press, 2020: 1580-1589. | 
																													
																							| 18 | GAO S H, CHENG M M, ZHAO K, et al. Res2Net: a new multi-scale backbone architecture. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (2): 652- 662.  doi: 10.1109/TPAMI.2019.2938758
 | 
																													
																							| 19 | 徐梦龙, 张晓雷. 用于语音控制的低资源关键词检索系统. 信号处理, 2020, 36 (6): 879- 884.  URL
 | 
																													
																							|  | XU M L, ZHANG X L. A small footprint keyword spotting system for voice control. Journal of Signal Processing, 2020, 36 (6): 879- 884.  URL
 | 
																													
																							| 20 |  | 
																													
																							| 21 | SHRIVASTAVA A, GUPTA A, GIRSHICK R. Training region-based object detectors with online hard example mining[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE Press, 2016: 761-769. | 
																													
																							| 22 | MCFEE B, RAFFEL C, LIANG D W, et al. Librosa: audio and music signal analysis in Python[C]//Proceedings of the 14th Python in Science Conference. Austin, USA: [s. n.], 2015: 18-25. | 
																													
																							| 23 | PARK D S, CHAN W, ZHANG Y, et al. Specaugment: a simple data augmentation method for automatic speech recognition[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1904.08779 . | 
																													
																							| 24 | 刘作桢, 吴愁, 黎塔, 等. 面向自定义语音唤醒的关键词相关的单通道语音增强. 声学学报, 2023, 48 (2): 415- 424.  URL
 | 
																													
																							|  | LIU Z Z, WU C, LI T, et al. Keyword-dependent monaural speech enhancement for open-vocabulary keyword spotting. Acta Acustica, 2023, 48 (2): 415- 424.  URL
 | 
																													
																							| 25 | HOU J Y, SHI Y Y, OSTENDORF M, et al. Mining effective negative training samples for keyword spotting[C]//Proceedings of 2020 IEEE International Conference on Acoustics, Speech and Signal Processing. Barcelona, Spain: IEEE Press, 2020: 7444-7448. | 
																													
																							| 26 | WANG Y M, LÜ H, POVEY D, et al. Wake word detection with streaming transformers[C]//Proceedings of 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. Toronto, Canada: IEEE Press, 2021: 5864-5868. | 
																													
																							| 27 | 王勇, 张连海. 基于词级DPPM的连续语音关键词检测. 计算机工程, 2014, 40 (5): 247- 251.  URL
 | 
																													
																							|  | WANG Y, ZHANG L H. Continuous speech keyword detection based on word level discriminative point process model. Computer Engineering, 2014, 40 (5): 247- 251.  URL
 |