1 |
REINSEL D , RYDNING J , GANTZ J F . Worldwide global data sphere forecast, 2021—2025: the world keeps creating more data-now, what do we do with it all. International Data Corporation Research, 2021, 6 (12): 41- 61.
|
2 |
XIA W , JIANG H , FENG D , et al. A comprehensive study of the past, present, and future of data deduplication. Proceedings of the IEEE, 2016, 104 (9): 1681- 1710.
doi: 10.1109/JPROC.2016.2571298
|
3 |
SHILANE P, WALLACE G, HUANG M, et al. Delta compressed and deduplicated storage using stream-informed locality[C]//Proceedings of the 4th USENIX Workshop on Hot Topics in Storage and File Systems. Boston, USA: USENIX, 2012: 1-5.
|
4 |
XIA W , JIANG H , FENG D , et al. DARE: a deduplication-aware resemblance detection and elimination scheme for data reduction with low overheads. IEEE Transactions on Computers, 2016, 65 (6): 1692- 1705.
doi: 10.1109/TC.2015.2456015
|
5 |
ZOU X Y, XIA W, SHILANE P, et al. Building a high-performance fine-grained deduplication framework for backup storage with high deduplication ratio[C]//Proceedings of 2022 USENIX Annual Technical Conference. Carlsbad, USA: USENIX, 2022: 19-36.
|
6 |
|
7 |
|
8 |
MUTHITACHAROEN A, CHEN B J, MAZIōRES D. A low-bandwidth network file system[C]//Proceedings of the 18th ACM Symposium on Operating Systems Principles. New York, USA: ACM Press, 2001: 174-187.
|
9 |
冯丹. 大数据时代存储相关技术研究(二). 智能物联技术, 2021, 4 (1): 1- 8.
|
|
FENG D . Storage technologies in the big data era. Technology of IoT AND AI, 2021, 4 (1): 1- 8.
|
10 |
田磊, 冯丹, 岳银亮, 等. 磁盘存储系统节能技术研究综述. 智能物联技术, 2010, 37 (9): 1- 5.
|
|
TIAN L , FENG D , YUE Y L , et al. Survey on energy-saving technologies of disk-based storage systems. Technology of IoT & AI, 2010, 37 (9): 1- 5.
|
11 |
QUINLAN S, DORWARD S. Venti: a new approach to archival storage[C]//Proceedings of the 1st USENIX Conference on File and Storage Technologies. Monterey, USA: USENIX, 2002: 89-101.
|
12 |
夏文. 数据备份系统中冗余数据的高性能消除技术研究[D]. 武汉: 华中科技大学, 2014.
|
|
XIA W. Research on high performance elimination technology of redundant data in data backup system[D]. Wuhan: Huazhong University of Science and Technology, 2014. (in Chinese)
|
13 |
WILDANI A, MILLER E L, RODEH O. HANDS: a heuristically arranged non-backup in-line deduplication system[C]//Proceedings of the 29th International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2013: 446-457.
|
14 |
YOU L L, POLLACK K T, LONG D D E. Deep store: an archival storage system architecture[C]//Proceedings of the 21st International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2005: 804-815.
|
15 |
周玉坤, 冯丹, 夏文, 等. 面向数据去重的基于二次哈希的收敛加密策略. 计算机工程与科学, 2016, 38 (9): 1755- 1762.
|
|
ZHOU Y K , FENG D , XIA W , et al. A twice-hash based convergent encryption strategy for data deduplication. Computer Engineering & Science, 2016, 38 (9): 1755- 1762.
|
16 |
|
17 |
DOUGLIS F, IYENGAR A. Application-specific delta-encoding via resemblance detection[C]//Proceedings of the 2003 USENIX Annual Technical Conference. San Antonio, USA: USENIX, 2003: 113-126.
|
18 |
BRODER A Z. On the resemblance and containment of documents[C]//Proceedings of SEQUENCES'97. Washington D. C., USA: IEEE Press: 21-29.
|
19 |
MACDONALD J. File system support for delta compression[D]. Berkeley, USA: University of California at Berkeley, 2000.
|
20 |
ZOU X Y, DENG C, XIA W, et al. Odess: speeding up resemblance detection for redundancy elimination by fast content-defined sampling[C]//Proceedings of the IEEE 37th International Conference on Data Engineering. Washington D. C., USA: IEEE Press, 2021: 480-491.
|
21 |
KARP R M , RABIN M O . Efficient randomized pattern-matching algorithms. IBM Journal of Research and Development, 1987, 31 (2): 249- 260.
doi: 10.1147/rd.312.0249
|
22 |
XIA W , JIANG H , FENG D , et al. Ddelta: a deduplication-inspired fast delta compression approach. Performance Evaluation, 2014, 79, 258- 272.
doi: 10.1016/j.peva.2014.07.016
|
23 |
NI F, JIANG S. RapidCDC: leveraging duplicate locality to accelerate chunking in CDC-based deduplication systems[C]//Proceedings of the ACM Symposium on Cloud Computing. New York, USA: ACM Press, 2019: 220-232.
|
24 |
WAN B, PU L F, ZOU X Y, et al. SuperCDC: a hybrid design of high-performance content-defined chunking for fast deduplication[C]//Proceedings of the IEEE 40th International Conference on Computer Design. Washington D. C., USA: IEEE Press, 2022: 170-178.
|
25 |
CONSTANTINESCU C, LU M H. Quick estimation of data compression and de-duplication for large storage systems[C]//Proceedings of the 1st International Conference on Data Compression, Communications and Processing. Washington D. C., USA: IEEE Press, 2011: 98-102.
|
26 |
KATTAN A, POLI R. Genetic-programming based prediction of data compression saving[C]//Proceedings of International Conference on Artificial Evolution. Berlin, Germany: Springer, 2010: 182-193.
|
27 |
|
28 |
ZIV J , LEMPEL A . A universal algorithm for sequential data compression. IEEE Transactions on Information Theory, 1977, 23 (3): 337- 343.
doi: 10.1109/TIT.1977.1055714
|
29 |
ZIV J , LEMPEL A . Compression of individual sequences via variable-rate coding. IEEE Transactions on Information Theory, 1978, 24 (5): 530- 536.
doi: 10.1109/TIT.1978.1055934
|