| 1 |  REINSEL D ,  RYDNING J ,  GANTZ J F .  Worldwide global data sphere forecast, 2021—2025: the world keeps creating more data-now, what do we do with it all. International Data Corporation Research, 2021, 6 (12): 41- 61. | 
																													
																						| 2 |  XIA W ,  JIANG H ,  FENG D , et al.  A comprehensive study of the past, present, and future of data deduplication. Proceedings of the IEEE, 2016, 104 (9): 1681- 1710.  doi: 10.1109/JPROC.2016.2571298
 | 
																													
																						| 3 | SHILANE P, WALLACE G, HUANG M, et al. Delta compressed and deduplicated storage using stream-informed locality[C]//Proceedings of the 4th USENIX Workshop on Hot Topics in Storage and File Systems. Boston, USA: USENIX, 2012: 1-5. | 
																													
																						| 4 |  XIA W ,  JIANG H ,  FENG D , et al.  DARE: a deduplication-aware resemblance detection and elimination scheme for data reduction with low overheads. IEEE Transactions on Computers, 2016, 65 (6): 1692- 1705.  doi: 10.1109/TC.2015.2456015
 | 
																													
																						| 5 | ZOU X Y, XIA W, SHILANE P, et al. Building a high-performance fine-grained deduplication framework for backup storage with high deduplication ratio[C]//Proceedings of 2022 USENIX Annual Technical Conference. Carlsbad, USA: USENIX, 2022: 19-36. | 
																													
																						| 6 |  | 
																													
																						| 7 |  | 
																													
																						| 8 | MUTHITACHAROEN A, CHEN B J, MAZIōRES D. A low-bandwidth network file system[C]//Proceedings of the 18th ACM Symposium on Operating Systems Principles. New York, USA: ACM Press, 2001: 174-187. | 
																													
																						| 9 | 冯丹.  大数据时代存储相关技术研究(二). 智能物联技术, 2021, 4 (1): 1- 8. | 
																													
																						|  |  FENG D .  Storage technologies in the big data era. Technology of IoT AND AI, 2021, 4 (1): 1- 8. | 
																													
																						| 10 | 田磊, 冯丹, 岳银亮, 等.  磁盘存储系统节能技术研究综述. 智能物联技术, 2010, 37 (9): 1- 5. | 
																													
																						|  |  TIAN L ,  FENG D ,  YUE Y L , et al.  Survey on energy-saving technologies of disk-based storage systems. Technology of IoT & AI, 2010, 37 (9): 1- 5. | 
																													
																						| 11 | QUINLAN S, DORWARD S. Venti: a new approach to archival storage[C]//Proceedings of the 1st USENIX Conference on File and Storage Technologies. Monterey, USA: USENIX, 2002: 89-101. | 
																													
																						| 12 | 夏文. 数据备份系统中冗余数据的高性能消除技术研究[D]. 武汉: 华中科技大学, 2014. | 
																													
																						|  | XIA W. Research on high performance elimination technology of redundant data in data backup system[D]. Wuhan: Huazhong University of Science and Technology, 2014. (in Chinese) | 
																													
																						| 13 | WILDANI A, MILLER E L, RODEH O. HANDS: a heuristically arranged non-backup in-line deduplication system[C]//Proceedings of the 29th International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2013: 446-457. | 
																													
																						| 14 | YOU L L, POLLACK K T, LONG D D E. Deep store: an archival storage system architecture[C]//Proceedings of the 21st International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2005: 804-815. | 
																													
																						| 15 | 周玉坤, 冯丹, 夏文, 等.  面向数据去重的基于二次哈希的收敛加密策略. 计算机工程与科学, 2016, 38 (9): 1755- 1762. | 
																													
																						|  |  ZHOU Y K ,  FENG D ,  XIA W , et al.  A twice-hash based convergent encryption strategy for data deduplication. Computer Engineering & Science, 2016, 38 (9): 1755- 1762. | 
																													
																						| 16 |  | 
																													
																						| 17 | DOUGLIS F, IYENGAR A. Application-specific delta-encoding via resemblance detection[C]//Proceedings of the 2003 USENIX Annual Technical Conference. San Antonio, USA: USENIX, 2003: 113-126. | 
																													
																						| 18 | BRODER A Z. On the resemblance and containment of documents[C]//Proceedings of SEQUENCES'97. Washington D. C., USA: IEEE Press: 21-29. | 
																													
																						| 19 | MACDONALD J. File system support for delta compression[D]. Berkeley, USA: University of California at Berkeley, 2000. | 
																													
																						| 20 | ZOU X Y, DENG C, XIA W, et al. Odess: speeding up resemblance detection for redundancy elimination by fast content-defined sampling[C]//Proceedings of the IEEE 37th International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2021: 480-491. | 
																													
																						| 21 |  KARP R M ,  RABIN M O .  Efficient randomized pattern-matching algorithms. IBM Journal of Research and Development, 1987, 31 (2): 249- 260.  doi: 10.1147/rd.312.0249
 | 
																													
																						| 22 |  XIA W ,  JIANG H ,  FENG D , et al.  Ddelta: a deduplication-inspired fast delta compression approach. Performance Evaluation, 2014, 79, 258- 272.  doi: 10.1016/j.peva.2014.07.016
 | 
																													
																						| 23 | NI F, JIANG S. RapidCDC: leveraging duplicate locality to accelerate chunking in CDC-based deduplication systems[C]//Proceedings of the ACM Symposium on Cloud Computing. New York, USA: ACM Press, 2019: 220-232. | 
																													
																						| 24 | WAN B, PU L F, ZOU X Y, et al. SuperCDC: a hybrid design of high-performance content-defined chunking for fast deduplication[C]//Proceedings of the IEEE 40th International Conference on Computer Design. Washington D. C., USA: IEEE Press, 2022: 170-178. | 
																													
																						| 25 | CONSTANTINESCU C, LU M H. Quick estimation of data compression and de-duplication for large storage systems[C]//Proceedings of the 1st International Conference on Data Compression, Communications and Processing. Washington D. C., USA: IEEE Press, 2011: 98-102. | 
																													
																						| 26 | KATTAN A, POLI R. Genetic-programming based prediction of data compression saving[C]//Proceedings of International Conference on Artificial Evolution. Berlin, Germany: Springer, 2010: 182-193. | 
																													
																						| 27 |  | 
																													
																						| 28 |  ZIV J ,  LEMPEL A .  A universal algorithm for sequential data compression. IEEE Transactions on Information Theory, 1977, 23 (3): 337- 343.  doi: 10.1109/TIT.1977.1055714
 | 
																													
																						| 29 |  ZIV J ,  LEMPEL A .  Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory, 1978, 24 (5): 530- 536.  doi: 10.1109/TIT.1978.1055934
 |