CO C4 The Processor

RECALLProblem issues for single-cycle implementation

Longest delay determines clock period
- Critical path: load instruction
- Instruction memory – register file – ALU – data memory – register file
Not feasible to vary period for different instructions
Violates design principle: Making the common case fast.

Performance can be improved by pipelining.

Pipelining

Pipelining is implementation technique whereby different instructions are overlapped in execution at the same time, which makes fast CPUs.

Balanced stage makes speedup ideal:
$\rm Time\ between\ instructions_{pipelined} = \frac{Time\ between\ instructions_{nonpipelined}}{N_{stages}}$
Ideal speedup is $N_{stages}$ ，i.e Number of pipe stages.
If not balanced, speedup is less
Speedup due to increased throughput, However, latency (time for each instruction) does not decrease.(even increase! )

Instruction set design affects complexity of pipeline implementation.
RISC-V ISA is designed for pipelining.

All instructions has fixed length, 32-bits
- Easier to fetch and decode in one cycle
- c.f x86: 1- to 17-byte instructions
Few and regular instruction formats Can decode and read registers in one step
Load/store addressing
Can calculate address in 3rd stage, access memory in 4th stage

Right-to-left flow leads to hazards.

To avoid data disorder, we need pipeline registers (or latch) between stages to hold information produced in previous cycle.
However, some computations just won’t divide into any finer (shorter in time) logical implementation since the latches are not free: area comsumption and time delay.

\rm Cycle_{Machine} \ge latency_{latch} + clock\ skew

Computer Science > Computer Organization

#Computer Hardware

CO C4 The Processor

http://example.com/2023/04/26/CO-5/

Author

Tekhne Chen

Posted on

April 26, 2023

Licensed under