基于环形滑动窗口与分层稀疏增强的高效KV Cache稀疏化方法
林海, 余果, 尹泽明, 徐显冲, 刘玉海
Efficient KV Cache Sparsification via Ring Buffer-Based Sliding Window and Hierarchical Sparsity Enhancement
Lin Hai, Yu Guo, Yin Zeming, Xu Xianchong, Liu Yuhai
计算机工程
.
0, (): 0
-0
.
DOI: 10.19678/j.issn.1000-3428.0252452