参考文献
[1] Manning C D, Raghavan P, Schütze H. An Introduction to Information Retrieval[M]. New York, USA: Cambridge University Press, 2008.
[2] Zhang Jiangong, Long Xiaohui, Suel T. Performance of Com- pressed Inverted List Caching in Search Engines[C]//Proc. of the 17th International Conference on World Wide Web. New York, USA: ACM Press, 2008.
[3] Anh V N, Moffat A. Inverted Index Compression Using Word-aligned Binary Codes[J]. Information Retrieval, 2005, 8(1): 151-166.
[4] Anh V N, Moffat A. Improved Word-aligned Binary Compre- ssion for Text Indexing[J]. Knowledge and Data Engineering, 2006, 18(6): 857-861.
[5] Elias P. Universal Codeword Sets and Representations of the Integers[J]. IEEE Transactions on Information Theory, 1975, 21(2): 194-203.
[6] Witten I H, Moffat A, Bell T C. Managing Gigabytes: Compressing and Indexing Documents and Images[M]. 2nd ed. San Francisco, USA: Morgan Kaufmann, 1999.
[7] Rice R. Plaunt J. Adaptive Variable-length Coding for Effi- cient Compression of Spacecraft Television Data[J]. IEEE Transactions on Communication Technology, 1971, 19(6): 889-897.
[8] Walder J, Kratky M, Baca R, et al. Fast Decoding Algorithms for Variable-lengths Codes[J]. Information Sciences, 2012,
183(1): 66-91.
[9] Scholer F, Williams H, Yiannis J, et al. Compression of Inverted Indexes for Fast Query Evaluation[C]//Proc. of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Tampere, Finland: ACM Press, 2002.
[10] Dean J. Challenges in Building Large-scale Information Re- trieval Systems: Invited Talk[C]//Proc. of the 2nd ACM International Conference on Web Search and Data Mining. New York, USA: ACM Press, 2009.
[11] Stepanov A A. SIMD-based Decoding of Posting Lists[C]//Proc. of the 20th ACM International Conference on Information and Knowledge Management. Glasgow, UK: ACM Press, 2011.
[12] Heman S. Super-scalar Database Compression Between RAM and CPU-cache[C]//Proc. of the 22nd International Conference on Data Engineering. Amsterdam, Holland: [s. n.], 2005.
[13] Delbru R, Campinas S, Tummarello G. Searching Web Data: An Entity Retrieval and High-performance Indexing Model[J]. Web Semantics, 2012, 10(1): 33-58.
[14] Anh V N, Moffat A. Index Compression Using 64-bit Words[J]. Software: Practice and Experience, 2010, 40(2): 131-147.
[15] 朱 虹, 吴 林. 倒排索引压缩及在RDBMS全文检索中的实现[J]. 华中科技大学学报: 自然科学版, 2005, 33(4): 7-9.
[16] 纪 蕾, 陈 英. 基于文档重排的索引压缩技术[J]. 清华大学学报: 自然科学版, 2005, 45(S1): 1828-1832.
[17] 王 虎, 王潜平, 对几种倒排文件压缩技术的研究与分 析[J]. 计算机工程与应用, 2006, 42(7): 169-173.
[18] Berger A L, Stephen A, Pietra D, et al. A Maximum Entropy Approach to Natural Language Processing[J]. Computational Linguistics, 1996, 22(1): 39-71.
编辑 任吉慧 |