1 |
VALIANT L G. A bridging model for parallel computation. Communications of the ACM, 1990, 33(8): 103- 111.
doi: 10.1145/79173.79181
|
2 |
|
3 |
WIENKE S, SPRINGER P, TERBOVEN C, et al. OpenACC—first experiences with real-world applications[M]//KAKLAMANIS C, PAPATHEODOROU T, SPIRAKIS P G. Euro-Par 2012 parallel processing. Berlin, Germany: Springer, 2012: 859-870.
|
4 |
BLUMOFE R D, JOERG C F, KUSZMAUL B C, et al. Cilk: an efficient multithreaded runtime system. ACM SIGPLAN Notices, 1995, 30(8): 207- 216.
doi: 10.1145/209937.209958
|
5 |
DURAN A, FERRER R, AYGUADÉ E, et al. A proposal to extend the OpenMP tasking model with dependent tasks. International Journal of Parallel Programming, 2009, 37(3): 292- 305.
doi: 10.1007/s10766-009-0101-1
|
6 |
VOSS M, ASENJO ZEGERS R, REINDERS J. Pro TBB: C++ parallel programming with threading building blocks. Berkeley, USA: Apress, 2019.
|
7 |
REINDERS J, ASHBAUGH B, BRODMAN J, et al. Data parallel C++: mastering DPC++ for programming of heterogeneous systems using C++ and SYCL. Berkeley, USA: Apress, 2021.
|
8 |
HALBWACHS N, CASPI P, RAYMOND P, et al. The synchronous data flow programming language LUSTRE. Proceedings of the IEEE, 1991, 79(9): 1305- 1320.
doi: 10.1109/5.97300
|
9 |
LEE E A, MESSERSCHMITT D G. Synchronous data flow. Proceedings of the IEEE, 1987, 75(9): 1235- 1245.
doi: 10.1109/PROC.1987.13876
|
10 |
LEE E A, MESSERSCHMITT D G. Static scheduling of synchronous data flow programs for digital signal processing. IEEE Transactions on Computers, 1987, 36(1): 24- 35.
|
11 |
RASKAR S. Dataflow software pipelining for Codelet model using hardware-software co-design[D]. Newark, USA: University of Delaware, 2021.
|
12 |
PEI S W, WANG J K, CUI W Y, et al. Codelet scheduling by genetic algorithm[C]//Proceedings of IEEE Trustcom/BigDataSE/ISPA. Washington D. C., USA: IEEE Press, 2017: 1492-1499.
|
13 |
JOSHUA S. DARTS: a runtime based on the Codelet execution model[D]. Newark, USA: University of Delaware, 2014.
|
14 |
ZUCKERMAN S, SUETTERLEIN J, KNAUERHASE R, et al. Using a "Codelet" program execution model for exascale machines: position paper[C]//Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era. New York, USA: ACM Press, 2011: 64-69.
|
15 |
CHEN C, WU Y, SUETTERLEIN J, et al. Automatic locality exploitation in the Codelet model[C]//Proceedings of the 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications. Washington D. C., USA: IEEE Press, 2013: 853-862.
|
16 |
ZUCKERMAN S, LANDWEHR A, LIVINGSTON K, et al. Toward a self-aware Codelet execution model[C]//Proceedings of the 4th Workshop on Data-Flow Execution Models for Extreme Scale Computing. Washington D. C., USA: IEEE Press, 2015: 26-29.
|
17 |
高光荣. 大数据的流动之美——数据流与大数据: 挑战与机遇. 中国计算机学会通讯, 2013, 9(12): 16- 18.
|
|
GAO G R. The joy of big data flow——dataflow and big data: challenges and opportunities. Communications of the CCF, 2013, 9(12): 16- 18.
|
18 |
LAUDERDALE C, KHAN R. Towards a Codelet-based runtime for exascale computing: position paper[C]//Proceedings of the 2nd International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era. New York, USA: ACM Press, 2012: 21-26.
|
19 |
HOQUE R, HERAULT T, BOSILCA G, et al. Dynamic task discovery in PaRSEC: a data-flow task-based runtime[C]//Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems. New York, USA: ACM Press, 2017: 1-8.
|
20 |
HUANG T W, LIN D L, LIN C X, et al. Taskflow: a lightweight parallel and heterogeneous task graph computing system. IEEE Transactions on Parallel and Distributed Systems, 2022, 33(6): 1303- 1320.
doi: 10.1109/TPDS.2021.3104255
|
21 |
HUANG T W, LIN Y B, LIN C X, et al. Cpp-taskflow: a general-purpose parallel task programming system at scale. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2021, 40(8): 1687- 1700.
doi: 10.1109/TCAD.2020.3025075
|
22 |
高希然. "神威·太湖之光"上任务图并行调度优化研究[D]. 青岛: 山东科技大学, 2018.
|
|
GAO X R. Research on parallel scheduling optimization of task graph on "Sunway TaihuLight"[D]. Qingdao: Shandong University of Science and Technology, 2018. (in Chinese)
|
23 |
SU Z C, CHEN J S, LIN H, et al. A dataflow-based runtime support on a 100P actual system[C]//Proceedings of IEEE International Symposium on Parallel and Distributed Processing with Applications and IEEE International Conference on Ubiquitous Computing and Communications. Washington D. C., USA: IEEE Press, 2018: 599-606.
|
24 |
苏志超. 神威·太湖之光上数据流编程模型的设计与实现[D]. 合肥: 中国科学技术大学, 2018.
|
|
SU Z C. Design and implementation of data stream programming model on Sunway TaihuLight[D]. Hefei: University of Science and Technology of China, 2018. (in Chinese)
|
25 |
|