Loading...
Please wait, while we are loading the content...
Similar Documents
The design of cost-effective stride-prefetching for modern processors
| Content Provider | CiteSeerX |
|---|---|
| Author | Al-Sukhni, Hassan Holt, James Connors, Daniel A. Snyder, Mike Smittle, Matt Grayson, Brian |
| Abstract | Abstract—Data prefetching of regular access patterns is an effective mechanism to hide the memory latency for modern microprocessors. However, to be included in an architecture design, prefetching systems must be cost-effective and have little impact to the microarchitecture. For example, while many proposed prefetching systems use the full program counter (PC) to help detect patterns with arbitrary strides, such systems are impractical and prohibitive. To overcome the issues related to using the entire PC for effective prefetching, this paper combines other instruction attributes with a small subset of the PC to help detect the regularity in program data accesses. Such detection is enabled by a finite state machine that resolves data stream allocation, maintains prefetch priorities, and manages prefetch run-ahead. The experimental results suggest that as little as 4 bits of the PC are sufficient to achieve within 1 % of the same prefetching effectiveness as using the full PC. I. |
| File Format | |
| Language | English |
| Access Restriction | Open |
| Subject Keyword | Modern Processor Cost-effective Stride-prefetching Prefetch Priority Full Program Counter Experimental Result Regular Access Pattern Entire Pc Memory Latency Modern Microprocessor Prefetching Effectiveness Finite State Machine Arbitrary Stride Data Stream Allocation Effective Prefetching Full Pc Abstract Data Prefetching Architecture Design Program Data Access Effective Mechanism Small Subset Little Impact |
| Content Type | Text |
| Resource Type | Technical Report |