TY - JOUR
T1 - A 45-nm 37.3 GOPS/W heterogeneous multi-core SOC with 16/32 bit instruction-set general-purpose core
AU - Nishii, Osamu
AU - Yuyama, Yoichi
AU - Ito, Masayuki
AU - Kiyoshige, Yoshikazu
AU - Nitta, Yusuke
AU - Ishikawa, Makoto
AU - Yamada, Tetsuya
AU - Miyakoshi, Junichi
AU - Wada, Yasutaka
AU - Kimura, Keiji
AU - Kasahara, Hironori
AU - Maejima, Hideo
PY - 2011/4
Y1 - 2011/4
N2 - We built a 12.4mm× 12.4mm, 45-nm CMOS, chip that integrates eight 648-MHz general purpose cores, two matrix processor (MX-2) cores, four flexible engine (FE) cores and media IP (VPU5) to establish heterogeneous multi-core chip architecture. The general purpose core had its IPC (instructions per cycle) performance enhanced by adding 32-bit instructions to the existing 16-bit fixed-length instruction set and executing up to two 32-bit instructions per cycle. Considering these five-toseven years of embedded LSI and increasing trend of access-master within LSI, we predict that the memory usage of single core will not exceed 32-bit physical area (i.e. 4 GB), but chip-total memory usage will exceed 4GB. Based on this prediction, the physical address was expanded from 32-bit to 40-bit. The fabricated chip was tested and a parallel operation of eight general purpose cores and four FE cores and eight data transfer units (DTU) is obtained on AAC (Advanced Audio Coding) encode processing.
AB - We built a 12.4mm× 12.4mm, 45-nm CMOS, chip that integrates eight 648-MHz general purpose cores, two matrix processor (MX-2) cores, four flexible engine (FE) cores and media IP (VPU5) to establish heterogeneous multi-core chip architecture. The general purpose core had its IPC (instructions per cycle) performance enhanced by adding 32-bit instructions to the existing 16-bit fixed-length instruction set and executing up to two 32-bit instructions per cycle. Considering these five-toseven years of embedded LSI and increasing trend of access-master within LSI, we predict that the memory usage of single core will not exceed 32-bit physical area (i.e. 4 GB), but chip-total memory usage will exceed 4GB. Based on this prediction, the physical address was expanded from 32-bit to 40-bit. The fabricated chip was tested and a parallel operation of eight general purpose cores and four FE cores and eight data transfer units (DTU) is obtained on AAC (Advanced Audio Coding) encode processing.
KW - Heterogeneous
KW - Instruction set
KW - MMU
UR - http://www.scopus.com/inward/record.url?scp=79953331729&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79953331729&partnerID=8YFLogxK
U2 - 10.1587/transele.E94.C.663
DO - 10.1587/transele.E94.C.663
M3 - Article
AN - SCOPUS:79953331729
SN - 0916-8524
VL - E94-C
SP - 663
EP - 669
JO - IEICE Transactions on Electronics
JF - IEICE Transactions on Electronics
IS - 4
ER -