TY - GEN
T1 - Exploring the problem of GPU programming for data-intensive applications
T2 - 5th Symposium on Information and Communication Technology, SoICT 2014
AU - Kitsukawa, Yuki
AU - Hirabayashi, Manato
AU - Kato, Shinpei
AU - Edahiro, Masato
PY - 2014/12/4
Y1 - 2014/12/4
N2 - Recently General-Purpose Computing on Graphics Processing Units (GPGPU) has been used to reduce the processing time of various applications, but the degree of acceleration by the Graphical Processing Unit (GPU) depends on the application. This study focuses on data analysis as an application example of GPGPU, specifically, the design and implementation of GPGPU computation libraries for data-intensive workloads. The effects of efficient memory allocation and high-speed read-only memories on the execution time are evaluated. In addition to employing a single GPU, the scalability using multiple GPUs is also evaluated. Compared to a Central Processing Unit (CPU) alone, the memory allocation method reduces the execution time for memory copies by approximately 60% when a GPU is used, while utilizing read-only memories results in an approximately 20% reduction in the overall program execution time. Moreover, expanding the number of GPUs from one to four reduces the execution time by approximately 10%.
AB - Recently General-Purpose Computing on Graphics Processing Units (GPGPU) has been used to reduce the processing time of various applications, but the degree of acceleration by the Graphical Processing Unit (GPU) depends on the application. This study focuses on data analysis as an application example of GPGPU, specifically, the design and implementation of GPGPU computation libraries for data-intensive workloads. The effects of efficient memory allocation and high-speed read-only memories on the execution time are evaluated. In addition to employing a single GPU, the scalability using multiple GPUs is also evaluated. Compared to a Central Processing Unit (CPU) alone, the memory allocation method reduces the execution time for memory copies by approximately 60% when a GPU is used, while utilizing read-only memories results in an approximately 20% reduction in the overall program execution time. Moreover, expanding the number of GPUs from one to four reduces the execution time by approximately 10%.
KW - GPGPU
KW - GPU
KW - Many-core
KW - Parallelization
UR - http://www.scopus.com/inward/record.url?scp=84962600997&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84962600997&partnerID=8YFLogxK
U2 - 10.1145/2676585.2676616
DO - 10.1145/2676585.2676616
M3 - Conference contribution
AN - SCOPUS:84962600997
T3 - ACM International Conference Proceeding Series
SP - 256
EP - 262
BT - Proceedings of the 5th Symposium on Information and Communication Technology, SoICT 2014
PB - Association for Computing Machinery
Y2 - 4 December 2014 through 5 December 2014
ER -