1 |
SARACLAR M, SPROAT R. Lattice-based search for spoken utterance retrieval[C]//Proceedings of HLT-NAACLʼ04. Washington D. C., USA: IEEE Press, 2004: 129-136.
|
2 |
CAN D, SARACLAR M. Lattice indexing for spoken term detection. IEEE Transactions on Audio, Speech, and Language Processing, 2011, 19 (8): 2338- 2347.
doi: 10.1109/TASL.2011.2134087
|
3 |
MAMOU J, RAMABHADRAN B, SIOHAN O. Vocabulary independent spoken term detection[C]//Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, USA: ACM Press, 2007: 615-622.
|
4 |
ROSE R C, PAUL D B. A hidden Markov model based keyword recognition system[C]//Proceedings of International Conference on Acoustics, Speech, and Signal Processing. Albuquerque, USA: IEEE Press, 1990: 129-132.
|
5 |
SZÖKE I, SCHWARZ P, MATĚJKA P, et al. Phoneme based acoustics keyword spotting in informal continuous speech[M]. Berlin, Germany: Springer, 2005.
|
6 |
PANCHAPAGESAN S, SUN M, KHARE A, et al. Multi-task learning and weighted cross-entropy for DNN-based keyword spotting[C]//Proceedings of INTERSPEECHʼ16. San Francisco, USA: [s. n.] 2016: 760-764.
|
7 |
SUN M, NAGARAJA V, HOFFMEISTER B, et al. Model shrinking for embedded keyword spotting[C]//Proceedings of the 14th IEEE International Conference on Machine Learning and Applications. Miami, USA: IEEE Press, 2015: 369-374.
|
8 |
WU M H, PANCHAPAGESAN S, SUN M, et al. Monophone-based background modeling for two-stage on-device wake word detection[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Washington D. C., USA: IEEE Press, 2018: 5494-5498.
|
9 |
CHEN G G, PARADA C, HEIGOLD G. Small-footprint keyword spotting using deep neural networks[C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Florence, Italy: IEEE Press, 2014: 4087-4091.
|
10 |
SUN M, RAJU A, TUCKER G, et al. Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting[C]//Proceedings of IEEE Spoken Language Technology Workshop. San Diego, USA: IEEE Press, 2016: 474-480.
|
11 |
ARIK S Ö, KLIEGL M, CHILD R, et al. Convolutional recurrent neural networks for small-footprint keyword spotting[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1703.05390.
|
12 |
LOPEZ-ESPEJO I, TAN Z H, HANSEN J H L, et al. Deep spoken keyword spotting: an overview. IEEE Access, 2022, 10, 4169- 4199.
doi: 10.1109/ACCESS.2021.3139508
|
13 |
WANG Z M, LI X L, ZHOU J. Small-footprint keyword spotting using deep neural network and connectionist temporal classifier[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1709.03665.
|
14 |
杨润延, 程高峰, 刘建. 基于端到端语音识别的关键词检索技术研究. 计算机科学, 2022, 49 (1): 53- 58.
URL
|
|
YANG R Y, CHENG G F, LIU J. Study on keyword search framework based on end-to-end automatic speech recognition. Computer Science, 2022, 49 (1): 53- 58.
URL
|
15 |
|
16 |
LI X, LI N, WENG C, et al. Replay and synthetic speech detection with Res2Net architecture[C]//Proceedings of 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. Toronto, Canada: IEEE Press, 2021: 6354-6358.
|
17 |
HAN K, WANG Y H, TIAN Q, et al. GhostNet: more features from cheap operations[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE Press, 2020: 1580-1589.
|
18 |
GAO S H, CHENG M M, ZHAO K, et al. Res2Net: a new multi-scale backbone architecture. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (2): 652- 662.
doi: 10.1109/TPAMI.2019.2938758
|
19 |
徐梦龙, 张晓雷. 用于语音控制的低资源关键词检索系统. 信号处理, 2020, 36 (6): 879- 884.
URL
|
|
XU M L, ZHANG X L. A small footprint keyword spotting system for voice control. Journal of Signal Processing, 2020, 36 (6): 879- 884.
URL
|
20 |
|
21 |
SHRIVASTAVA A, GUPTA A, GIRSHICK R. Training region-based object detectors with online hard example mining[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, USA: IEEE Press, 2016: 761-769.
|
22 |
MCFEE B, RAFFEL C, LIANG D W, et al. Librosa: audio and music signal analysis in Python[C]//Proceedings of the 14th Python in Science Conference. Austin, USA: [s. n.], 2015: 18-25.
|
23 |
PARK D S, CHAN W, ZHANG Y, et al. Specaugment: a simple data augmentation method for automatic speech recognition[EB/OL]. [2023-02-10]. https://arxiv.org/abs/1904.08779.
|
24 |
刘作桢, 吴愁, 黎塔, 等. 面向自定义语音唤醒的关键词相关的单通道语音增强. 声学学报, 2023, 48 (2): 415- 424.
URL
|
|
LIU Z Z, WU C, LI T, et al. Keyword-dependent monaural speech enhancement for open-vocabulary keyword spotting. Acta Acustica, 2023, 48 (2): 415- 424.
URL
|
25 |
HOU J Y, SHI Y Y, OSTENDORF M, et al. Mining effective negative training samples for keyword spotting[C]//Proceedings of 2020 IEEE International Conference on Acoustics, Speech and Signal Processing. Barcelona, Spain: IEEE Press, 2020: 7444-7448.
|
26 |
WANG Y M, LÜ H, POVEY D, et al. Wake word detection with streaming transformers[C]//Proceedings of 2021 IEEE International Conference on Acoustics, Speech and Signal Processing. Toronto, Canada: IEEE Press, 2021: 5864-5868.
|
27 |
王勇, 张连海. 基于词级DPPM的连续语音关键词检测. 计算机工程, 2014, 40 (5): 247- 251.
URL
|
|
WANG Y, ZHANG L H. Continuous speech keyword detection based on word level discriminative point process model. Computer Engineering, 2014, 40 (5): 247- 251.
URL
|