Architecture and Code Optimization (TACO)


Search Issue
enter search term and/or author name


ACM Transactions on Architecture and Code Optimization (TACO), Volume 7 Issue 4, December 2010

Impact of high-level transformations within the ROCCC framework
Betul Buyukkurt, John Cortes, Jason Villarreal, Walid A. Najjar
Article No.: 17
DOI: 10.1145/1880043.1880044

Reconfigurable computers, where one or more FPGAs are attached to a conventional microprocessor, are promising platforms for code acceleration. Despite their advantages, programmability concerns and the lack of efficient design tools/compilers for...

DisIRer: Converting a retargetable compiler into a multiplatform binary translator
Yuan-Shin Hwang, Tzong-Yen Lin, Rong-Guey Chang
Article No.: 18
DOI: 10.1145/1880043.1880045

This article proposes an alternative yet effective way of constructing a multiplatform binary translator, by converting a retargetable compiler into a binary translator. The rationale is that a retargetable compiler usually parses source programs...

Federation: Boosting per-thread performance of throughput-oriented manycore architectures
Michael Boyer, David Tarjan, Kevin Skadron
Article No.: 19
DOI: 10.1145/1880043.1880046

Manycore architectures designed for parallel workloads are likely to use simple, highly multithreaded, in-order cores. This maximizes throughput, but only with enough threads to keep hardware utilized. For applications or phases with more limited...

Collective optimization: A practical collaborative approach
Grigori Fursin, Olivier Temam
Article No.: 20
DOI: 10.1145/1880043.1880047

Iterative optimization is a popular and efficient research approach to optimize programs using feedback-directed compilation. However, one of the key limitations that prevented widespread use in production compilers and day-to-day practice is the...

Understanding the behavior and implications of context switch misses
Fang Liu, Yan Solihin
Article No.: 21
DOI: 10.1145/1880043.1880048

One of the essential features in modern computer systems is context switching, which allows multiple threads of execution to time-share a limited number of processors. While very useful, context switching can introduce high performance overheads,...