enter search term and/or author name
Hybrid analytical modeling of pending cache hits, data prefetching, and MSHRs
Xi E. Chen, Tor M. Aamodt
Article No.: 10
This article proposes techniques to predict the performance impact of pending cache hits, hardware prefetching, and miss status holding register resources on superscalar microprocessors using hybrid analytical models. The proposed models focus on...
CATCH: A mechanism for dynamically detecting cache-content-duplication in instruction caches
Marios Kleanthous, Yiannakis Sazeides
Article No.: 11
Cache-content-duplication (CCD) occurs when there is a miss for a block in a cache and the entire content of the missed block is already in the cache in a block with a different tag. Caches aware of content-duplication can have lower miss penalty...
Managing SMT resource usage through speculative instruction window weighting
Hans Vandierendonck, André Seznec
Article No.: 12
Simultaneous multithreading processors dynamically share processor resources between multiple threads. In general, shared SMT resources may be managed explicitly, for instance, by dynamically setting queue occupation bounds for each thread as in...
As technology continues to shrink, reducing leakage is critical to achieving energy efficiency. Previous studies on low-power GPUs (Graphics Processing Units) focused on techniques for dynamic power reduction, such as DVFS (Dynamic Voltage and...
In this article, we propose a new cache replacement policy that makes the replacement decision based on the reuse information of the cache lines and the requested data. We present the architectural support and evaluate the performance of our...
Evaluating placement policies for managing capacity sharing in CMP architectures with private caches
Ahmad Samih, Yan Solihin, Anil Krishna
Article No.: 15
Chip Multiprocessors (CMP) with distributed L2 caches suffer from a cache fragmentation problem; some caches may be overutilized while others may be underutilized. To avoid such fragmentation, researchers have proposed capacity sharing mechanisms...
Maintaining performance on power gating of microprocessor functional units by using a predictive pre-wakeup strategy
Chang-Ching Yeh, Kuei-Chung Chang, Tien-Fu Chen, Chingwei Yeh
Article No.: 16
Power gating is an effective technique for reducing leakage power in deep submicron CMOS technology. Microarchitectural techniques for power gating of functional units have been developed by detecting suitable idle regions and turning them off to...
DEFCAM: A design and evaluation framework for defect-tolerant cache memories
Hyunjin Lee, Sangyeun Cho, Bruce R. Childers
Article No.: 17
Advances in deep submicron technology call for a careful review of existing cache designs and design practices in terms of yield, area, and performance. This article presents a Design and Evaluation Framework for defect-tolerant Cache Memories...