1 |
WANG J J , YU J Y , ZHAI R , et al. GMPR: a two-phase heuristic algorithm for virtual machine placement in large-scale cloud data centers. IEEE Systems Journal, 2022, 17 (1): 1419- 1430.
|
2 |
丁家满, 李海滨, 邓斌, 等. 一种基于Spark的频繁项集快速挖掘算法. 软件学报, 2023, 34 (5): 2446- 2464.
|
|
DING J M , LI H B , DENG B , et al. Fast mining algorithm of frequent itemset based on Spark. Journal of Software, 2023, 34 (5): 2446- 2464.
|
3 |
SALLOUM S , DAUTOV R , CHEN X J , et al. Big data analytics on Apache Spark. International Journal of Data Science and Analytics, 2016, 1 (3): 145- 164.
|
4 |
AHMED N , BARCZAK A L C , SUSNJAK T , et al. A comprehensive performance analysis of Apache Hadoop and Apache Spark for large scale data sets using HiBench. Journal of Big Data, 2020, 7 (1): 110.
|
5 |
钱文君, 沈晴霓, 吴鹏飞, 等. 大数据计算环境下的隐私保护技术研究进展. 计算机学报, 2022, 45 (4): 669- 701.
|
|
QIAN W J , SHEN Q N , WU P F , et al. Research progress on privacy-preserving techniques in big data computing environment. Chinese Journal of Computers, 2022, 45 (4): 669- 701.
|
6 |
KANG M , LEE J G . An experimental analysis of limitations of MapReduce for iterative algorithms on Spark. Cluster Computing, 2017, 20 (4): 3593- 3604.
|
7 |
KANG M , LEE J G . Effect of garbage collection in iterative algorithms on Spark: an experimental analysis. The Journal of Supercomputing, 2020, 76 (9): 7204- 7218.
|
8 |
卞琛, 修位蓉, 于炯. 异构Spark集群数据倾斜修正调度策略. 计算机工程与科学, 2022, 44 (4): 620- 630.
|
|
BIAN C , XIU W R , YU J . A data skew correction scheduling strategy of heterogeneous Spark cluster. Computer Engineering & Science, 2022, 44 (4): 620- 630.
|
9 |
|
10 |
夏立斌, 刘晓宇, 姜晓巍, 等. 基于分布式数据集的并行计算框架内存优化方法. 计算机工程, 2023, 49 (4): 43- 51.
doi: 10.19678/j.issn.1000-3428.0066025
|
|
XIA L B , LIU X Y , JIANG X W , et al. Memory optimization method for parallel computing framework based on distributed dataset. Computer Engineering, 2023, 49 (4): 43- 51.
doi: 10.19678/j.issn.1000-3428.0066025
|
11 |
|
12 |
|
13 |
|
14 |
|
15 |
|
16 |
|
17 |
SONG Y X , YU J Y , WANG J J , et al. Memory management optimization strategy in Spark framework based on less contention. The Journal of Supercomputing, 2023, 79 (2): 1504- 1525.
|
18 |
WANG B , TANG J , ZHANG R , et al. A task-aware fine-grained storage selection mechanism for in-memory big data computing frameworks. International Journal of Parallel Programming, 2021, 49 (1): 25- 50.
|
19 |
|
20 |
JIANG K , DU S F , ZHAO F , et al. Effective data management strategy and RDD weight cache replacement strategy in Spark. Computer Communications, 2022, 194, 66- 85.
|
21 |
|
22 |
GENG Y Z , SHI X H , PEI C , et al. LCS: an efficient data eviction strategy for Spark. International Journal of Parallel Programming, 2017, 45 (6): 1285- 1297.
|
23 |
|
24 |
|
25 |
ZHAO Y , DONG J , LIU H W , et al. Improving cache management with redundant RDDs eviction in Spark. Computers, Materials & Continua, 2021, 68 (1): 727- 741.
|
26 |
LI H , JI S P , ZHONG H , et al. LPW: an efficient data-aware cache replacement strategy for Apache Spark. Science China Information Sciences, 2022, 66 (1): 112104.
|