1 Qichen Chen, "smCompactor: A Workloadaware Fine-grained Resource Management Framework for GPGPUs" 2021
2 Xu, Qiumin, "Warped-slicer: efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming" IEEE 2016
3 Yijie Huangfu, "Warp-Based Load/Store Reordering to Improve GPU Time Predictability" 11 (11): 58-68, 2017
4 Wang, Zhenning, "Simultaneous multikernel GPU: Multi-tasking throughput processors via fine-grained sharing" IEEE 2016
5 Che, Shuai, "Rodinia: A benchmark suite for heterogeneous computing" IEEE 2009
6 Stratton, John A., "Parboil: A revised benchmark suite for scientific and commercial throughput computing" Center for Reliable and High-Performance Computing 2012
7 "NVIDIA profiler"
8 "NVIDIA Multi Process Service (MPS)"
9 "NVIDIA Hyper-Q technology"
10 "NVIDIA CUDA Sample"
1 Qichen Chen, "smCompactor: A Workloadaware Fine-grained Resource Management Framework for GPGPUs" 2021
2 Xu, Qiumin, "Warped-slicer: efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming" IEEE 2016
3 Yijie Huangfu, "Warp-Based Load/Store Reordering to Improve GPU Time Predictability" 11 (11): 58-68, 2017
4 Wang, Zhenning, "Simultaneous multikernel GPU: Multi-tasking throughput processors via fine-grained sharing" IEEE 2016
5 Che, Shuai, "Rodinia: A benchmark suite for heterogeneous computing" IEEE 2009
6 Stratton, John A., "Parboil: A revised benchmark suite for scientific and commercial throughput computing" Center for Reliable and High-Performance Computing 2012
7 "NVIDIA profiler"
8 "NVIDIA Multi Process Service (MPS)"
9 "NVIDIA Hyper-Q technology"
10 "NVIDIA CUDA Sample"
11 Ukidave, Yash, "Mystic: Predictive scheduling for gpu based cloud servers using machine learning" IEEE 2016
12 Zhao, Xia, "HSM: A Hybrid Slowdown Model for Multitasking GPUs" 2020
13 김광복, "GPU 성능 향상을 위한 지연시간 숨김 기반 워프 스케줄링" 한국컴퓨터정보학회 24 (24): 1-9, 2019
14 Bao, Yixin, "Deep Learning-based Job Placement in Distributed Machine Learning Clusters" IEEE 2019