Static Coarse Grain Task Scheduling with Cache Optimization Using OpenMP

Hirofumi Nakano, Kazuhisa Ishizaka, Motoki Obata, Keiji Kimura, Hironori Kasahara

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Effective use of cache memory is getting more important with increasing gap between the processor speed and memory access speed. Also, use of multigrain parallelism is getting more important to improve effective performance beyond the limitation of loop iteration level parallelism. Considering these factors, this paper proposes a coarse grain task static scheduling scheme considering cache optimization, The proposed scheme schedules coarse grain tasks to threads so that shared data among coarse grain tasks can be passed via cache after task and data decomposition considering cache size at compile time. It is implemented on OSCAR Fortran multigrain parallelizing compiler and evaluated on Sun Ultra80 four-processor SMP workstation using Swim and Tomcatv from the SPEC fp 95. As the results, the proposed scheme gives us 4.56 times speedup for Swim and 2.37 times on 4 processors for Tomcatv respectively against the Sun Forte HPC Ver. 6 update 1 loop parallelizing compiler.

Original languageEnglish
Pages (from-to)211-223
Number of pages13
JournalInternational Journal of Parallel Programming
Volume31
Issue number3
DOIs
Publication statusPublished - 2003 Jun 1

    Fingerprint

Keywords

  • Cache optimization
  • Coarse grain task parallelization
  • OpenMP
  • Scheduling algorithm

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Information Systems

Cite this