Documentos de Académico
Documentos de Profesional
Documentos de Cultura
Architecture
Introduction
• AMD K7 is the first 7th generation PC CPU. First six generations
were 8086, 80286, 80386, 80486, Pentium (AMD K5/K6) and
Pentium II (AMD K6-2/K6-3). It is designed to operate above
500MHz.
• Its competitor, Intel Pentium III was also released in the same
year and these two processors will be compared whenever
possible throughout the presentation.
Main Features
• Out-of-order, 3-way superscalar x86 uP
• 9 independent execution pipelines, with 10
stage integer and 15-stage FP pipeline :
– 3 Integer Execution Units
– 3 Address Calculation Units
– 3 Floating Point Execution Units
• 64kB instruction and 64kB data L1 caches
• Integrated L2 cache controller up to 8MB
• Extended 3DNow! instructions
Main Features
• K7 uses Digital™ Alpha™ EV6 system bus
interface. This is probably the most important
architectural difference from the previous
generations. EV6 provides:
- Use of both rising and falling edges, resulting in
doubled bus speed
- Scalability beyond 200MHz(beyond 400MHz bus speed)
- Highest bandwidth of that time:
Athlon using 100MHz(x2) 1.60 GB/s
PIII using 133MHz 1.01 GB/s
- 72(64 + 8ECC) bit data bus
- Independent address bus able to address 8 terabytes
- Independent snoop bus
Main Features – EV6 cont.
- low-voltage signaling for
low-cost motherboard
implementations
Motherboards with
GeForce, Dolby and
Ethernet available below
$80.
• AMD chose the first one that achieves most of the performance benefit
with significantly less area and power. Since no additional registers are
used, no new states are introduced compatibility with the existing
OSs.
• The second choice is implemented in PowerPC G4 under the name
AltiVec.
3DNow! Technology
• Instead of division and sqrt, reciprocal and reciprocal sqrt are
implemented in AMD K7 since they are encountered more often in
multimedia applications.
• MMX and 3DNow! instructions have at most 4 cycle latency (only for
3DNow! Add and Mul ) and 1 cycle throughput. This is much faster
than single precision FP division(13~) and sqrt(16~).
• Using 2 FP pipelines simultaneously, maximum throughput is 4
FPops/~.
Integer Performance of AMD Athlon
Floating Point Performance of AMD
Athlon
Conclusion
• Being the first 7th generation CPU, AMD K7
has been a major leap forward in the CPU
history.
• It had both performance and cost benefits
when compared to Intel PIII and started
the competition that ended with today’s
AMD Athlon XP and P4 processors.
References
• Hesley, S., V. Andrade, B. Burd,G. Constant, J. Correll, M. Crowley, M.
Golden, N. Hopkins, S. Islam, S. Johnson, R. Khondker, D. Meyer, J.
Moench, H. Partovi, R. Posey, F. Weber and J. Yong, “A 7 th Generation
x86 Microprocessor ”, IEEE International Solid State Circuits Conference,
pp. 92-93,1999.
• Scherer, A., M. Golden, N. Juffa, S. Meier, S. Oberman, H. Partovi and F.
Weber, “ An Out-of-Order Three-Way Superscalar Multimedia Floating
Point Unit ”, IEEE International Solid State Circuits Conference, pp. 94-
95,1999.
• Oberman, S., “ Floating Point Division and Square Root Algorithms and
Implementation in the AMD-K7 Microprocessor ”, 14th IEEE Symposium on
Computer Arithmetic, pp. 106-115, 1999.
• Oberman, S., G. Favor and F. Weber, “ AMD 3DNow! Technology:
Architecture and Implementations ”, IEEE Micro, 1999.
• AMD Athlon Processor Datasheet and Technical Brief from www.amd.com
• Intel PIII Processor Datasheet from www.intel.com
Questions?