Recent from talks
Nothing was collected or created yet.
Pentium III
View on Wikipedia
| General information | |
|---|---|
| Launched | February 28, 1999 |
| Discontinued | April 23, 2004 (for desktop units) May 18, 2007 (for mobile units)[1] July 14, 2009 (discontinuation and end of life) |
| Marketed by | Intel |
| Designed by | Intel |
| Common manufacturer |
|
| Product code |
|
| Performance | |
| Max. CPU clock rate | 400 MHz to 1.4 GHz |
| FSB speeds | 100 MT/s to 133 MT/s |
| Cache | |
| L1 cache | 32 KB (16 KB data + 16 KB instructions) |
| L2 cache | 128–512 KB |
| Architecture and classification | |
| Technology node | 250 nm to 130 nm |
| Microarchitecture | P6 |
| Instruction set | IA-32 |
| Extensions | |
| Physical specifications | |
| Transistors |
|
| Cores |
|
| Sockets | |
| Products, models, variants | |
| Core names |
|
| Variant | |
| History | |
| Predecessor | Pentium II |
| Successor | Pentium 4 |
| Support status | |
| Unsupported | |
The Pentium III[2] (marketed as Intel Pentium III Processor, informally PIII or P3) brand refers to Intel's 32-bit x86 desktop and mobile CPUs based on the sixth-generation P6 microarchitecture introduced on February 28, 1999.[citation needed] The brand's initial processors were very similar to the earlier Pentium II-branded processors. The most notable differences were the addition of the Streaming SIMD Extensions (SSE) instruction set (to accelerate floating point and parallel calculations), and the introduction of a controversial serial number embedded in the chip during manufacturing.
Even after the release of the Pentium 4 in late 2000, the Pentium III continued to be produced with new models introduced up until early 2003. They were then discontinued in April 2004 for desktop units[3] and May 2007 for mobile units.[1] They were leaving of support and removed from the official price lists in July 2009.
Processor cores
[edit]Similarly to the Pentium II it superseded, the Pentium III was also accompanied by the Celeron brand for lower-end versions, and the Xeon for high-end (server and workstation) derivatives. The Pentium III was eventually superseded by the Pentium 4, but its Tualatin core also served as the basis for the Pentium M CPUs, which used many ideas from the P6 microarchitecture. Subsequently, it was the Pentium M microarchitecture of Pentium M branded CPUs, and not the NetBurst found in Pentium 4 processors, that formed the basis for Intel's energy-efficient Core microarchitecture of CPUs branded Core 2, Pentium Dual-Core, Celeron (Core), and Xeon.
| Intel Pentium III processor family | |||||||
|---|---|---|---|---|---|---|---|
| Standard logo (1999–2003) |
Desktop | Mobile logo (2001–2003) |
Mobile | ||||
| Code-named | Node | Date released | Code-named | Node | Date released | ||
| Katmai Coppermine Coppermine T Tualatin |
250 nm 180 nm 180 nm 130 nm |
February 1999 October 1999 June 2001 June 2001 |
Coppermine Tualatin |
180 nm 130 nm |
October 1999 July 2001 | ||
| List of desktop processors | List of mobile processors | ||||||
Katmai
[edit]

The first Pentium III variant was the Katmai (Intel product code 80525). It was a further development of the Deschutes Pentium II. The Pentium III saw an increase of 2 million transistors over the Pentium II. The differences were the addition of execution units and SSE instruction support, and an improved L1 cache controller[citation needed] (the L2 cache controller was left unchanged, as it would be fully redesigned for Coppermine anyway), which were responsible for the minor performance improvements over the "Deschutes" Pentium IIs. It was first released at speeds of 450 and 500 MHz on February 28, 1999. Two more versions were released: 550 MHz on May 17, 1999, and 600 MHz on August 2, 1999. On September 27, 1999, Intel released the 533B and 600B running at 533 & 600 MHz respectively. The 'B' suffix indicated that it featured a 133 MT/s FSB, instead of the 100 MT/s FSB of prior models.
The Katmai contains 9.5 million transistors, not including the 512 Kbytes L2 cache (which adds 25 million transistors), and has dimensions of 12.3 mm by 10.4 mm (128 mm2). It is fabricated in Intel's P856.5 process, a 250 nm complementary metal–oxide–semiconductor (CMOS) process with five levels of aluminum interconnect.[4] The Katmai used the same slot-based design as the Pentium II but with the newer Slot 1 Single Edge Contact Cartridge (SECC) 2 that allowed direct CPU core contact with the heat sink. There have been some early models of the Pentium III with 450 and 500 MHz packaged in an older SECC cartridge intended for original equipment manufacturers (OEMs).
A notable stepping level for enthusiasts was SL35D. This version of Katmai was officially rated for 450 MHz, but often contained cache chips for the 600 MHz model and thus usually can run at 600 MHz.
Coppermine
[edit]

The second version, codenamed Coppermine (Intel product code: 80526), was released on October 25, 1999, running at 500, 533, 550, 600, 650, 667, 700, and 733 MHz. From December 1999 to May 2000, Intel released Pentium IIIs running at speeds of 750, 800, 850, 866, 900, 933 and 1000 MHz (1 GHz). Both 100 MT/s FSB and 133 MT/s FSB models were made. For models that were already available with the same frequency, an "E" was appended to the model name to indicate cores using the new 180 nm fabrication process. An additional "B" was later appended to designate 133 MHz FSB models, resulting in an "EB" suffix. In overall performance, Coppermine had a small advantage over the Advanced Micro Devices (AMD) Athlons it was released against, which was reversed when AMD applied their own die shrink and added an on-die L2 cache to the Athlon. Athlon held the advantage in floating-point intensive code, while the Coppermine could perform better when SSE optimizations were used, but in practical terms there was little difference in how the two chips performed, clock-for-clock. However, AMD were able to clock the Athlon higher, reaching speeds of 1.2 GHz before the launch of the Pentium 4.
In performance, Coppermine arguably marked a bigger step than Katmai by introducing an on-chip L2 cache, which Intel names Advanced Transfer Cache (ATC). The ATC operates at the core clock rate and has a capacity of 256 KB, twice that of the on-chip cache formerly on Mendocino Celerons. It is eight-way set-associative and is accessed via a Double Quad Word Wide 256-bit bus, four times as wide as Katmai's. Further, latency was dropped to a quarter compared to Katmai. Another marketing term by Intel was Advanced System Buffering, which encompassed improvements to better take advantage of a 133 MT/s system bus. These include 6 fill buffers (vs. 4 on Katmai), 8 bus queue entries (vs. 4 on Katmai) and 4 write-back buffers (vs. 1 on Katmai).[5] Under competitive pressure from the AMD Athlon, Intel reworked the internals, finally removing some well-known pipeline stalls.[citation needed] As a result, applications affected by the stalls ran faster on Coppermine by up to 30%.[citation needed] The Coppermine contained 29 million transistors and was fabricated in a 180 nm process.
The Coppermine was available in 370-pin FC-PGA or FC-PGA2 for use with Socket 370, or in SECC2 for Slot 1 (all speeds except 900 and 1100). FC-PGA and Slot 1 Coppermine CPUs have an exposed die, however most higher frequency SKUs starting with the 866 MHz model were also produced in FC-PGA2 variants that feature an integrated heat spreader (IHS). This in itself did not improve thermal conductivity, since it added another layer of metal and thermal paste between the die and the heatsink, but it greatly assisted in holding the heatsink flat against the die. Earlier Coppermines without the IHS made heatsink mounting challenging.[6] If the heatsink was not situated flat against the die, heat transfer efficiency was greatly reduced. Some heatsink manufacturers began providing pads on their products, similar to what AMD did with the "Thunderbird" Athlon to ensure that the heatsink was mounted flatly. The enthusiast community went so far as to create shims to assist in maintaining a flat interface.[7]
A 1.13 GHz version (S-Spec SL4HH) was released in mid-2000 but famously recalled after a collaboration between HardOCP and Tom's Hardware[8] discovered various instabilities with the operation of the new CPU speed grade. The Coppermine core was unable to reliably reach the 1.13 GHz speed without various tweaks to the processor's microcode, effective cooling, higher voltage (1.75 V vs. 1.65 V), and specifically validated platforms.[8] Intel only officially supported the processor on its own VC820 i820-based motherboard, but even this motherboard displayed instability in the independent tests of the hardware review sites. In benchmarks that were stable, performance was shown to be sub-par, with the 1.13 GHz CPU equalling a 1.0 GHz model. Tom's Hardware attributed this performance deficit to relaxed tuning of the CPU and motherboard to improve stability.[9] Intel needed at least six months to resolve the problems using a new cD0 stepping and re-released 1.1 GHz and 1.13 GHz versions in 2001.
Microsoft's Xbox game console uses a variant of the Pentium III/Mobile Celeron family in a Micro-PGA2 form factor. The sSpec designator of the chips is SL5Sx, which makes it more similar to the Mobile Celeron Coppermine-128 processor. It shares with the Coppermine-128 Celeron its 128 KB L2 cache, and 180 nm process technology, but keeps the 8-way cache associativity from the Pentium III.[10]
Although its codename could give the impression that it used copper interconnects, in reality, its interconnects were aluminium.
Coppermine T
[edit]This revision is an intermediate step between Coppermine and Tualatin, with support for lower-voltage system logic present on the latter but core power within previously defined voltage specs of the former so it could work in older system boards.
Intel used the latest FC-PGA2 Coppermines with the cD0 stepping and modified them so that they worked with low voltage system bus operation at 1.25 V AGTL as well as normal 1.5 V AGTL+ signal levels, and would auto detect differential or single-ended clocking. This modification made them compatible to the latest generation Socket 370 boards supporting Tualatin CPUs while maintaining compatibility with older Socket 370 boards. The Coppermine T also had two way symmetrical multiprocessing capabilities, but only in Tualatin boards.
They can be distinguished from Tualatin processors by their part numbers, which include the digits "80533", e.g. the 1133 MHz SL5QK P/N is RK80533PZ006256, while the 1000 MHz SL5QJ P/N is RK80533PZ001256.[11]
Tualatin
[edit]

The third revision, Tualatin (80530), was a trial for Intel's new 130 nm process. Tualatin-based Pentium IIIs were released during 2001 until early 2002 at speeds of 1.0, 1.13, 1.2, 1.26, 1.33 and 1.4 GHz. A basic shrink of Coppermine, no new features were added, except for added data prefetch logic similar to Pentium 4 and Athlon XP for potentially better use of the L2 cache, although its use compared to these newer CPUs is limited due to the relatively smaller FSB bandwidth (FSB was still kept at 133 MHz).[12] Variants with 256 and 512 KB L2 cache were produced, the latter being dubbed Pentium III-S; this variant was mainly intended for low-power consumption servers and also exclusively featured SMP support within the Tualatin line.
Although the Socket 370 designation was kept, the use of 1.25 AGTL signaling in place of 1.5 V AGTL+ rendered prior motherboards incompatible.[12] This confusion carried over to the chipset naming, where only the B-stepping of the i815 chipset was compatible with Tualatin processors.[13] A new VRM guideline was also designed by Intel, version 8.5, which required finer voltage steps and debuted load line Vcore (in place of fixed voltage regardless of current on 8.4).[14][15][16] Some motherboard manufacturers would mark the change with blue sockets (instead of white), and were often also backwards compatible with Coppermine CPUs.
The Tualatin also formed the basis for the highly popular Pentium III-M mobile processor, which became Intel's front-line mobile chip (the Pentium 4 drew significantly more power, and so was not well-suited for this role) for the next two years. The chip offered a good balance between power consumption and performance, thus finding a place in both performance notebooks and the "thin and light" category.
The Tualatin-based Pentium III performed well in some applications compared to the fastest Willamette-based Pentium 4, and even the Thunderbird-based Athlons. Despite this, its appeal was limited due to the aforementioned incompatibility with existing systems, and Intel's only officially supported chipset for Tualatins, the i815, could only handle 512 MB RAM as opposed to 1 GB of registered RAM with the older, incompatible 440BX chipset. However, the enthusiast community found a way to run Tualatins on then-ubiquitous BX chipset based boards, although it was often a non-trivial task and required some degree of technical skills.
Tualatin-based Pentium III CPUs can usually be visually distinguished from Coppermine-based processors by the metal integrated heat-spreader (IHS) fixed on top of the package. However, the last models of Coppermine Pentium IIIs also featured the IHS — the integrated heat spreader is actually what distinguishes the FC-PGA2 package from the FC-PGA — both are for Socket 370 motherboards.[17]
Before the addition of the heat spreader, it was sometimes difficult to install a heatsink on a Pentium III. One had to be careful not to put force on the core at an angle because doing so would cause the edges and corners of the core to crack and could destroy the CPU. It was also sometimes difficult to achieve a flat mating of the CPU and heatsink surfaces, a factor of critical importance to good heat transfer. This became increasingly challenging with the Socket 370 CPUs, compared with their Slot 1 predecessors, because of the force required to mount a socket-based cooler and the narrower, 2-sided mounting mechanism (Slot 1 featured 4-point mounting). As such, and because the 130 nm Tualatin had an even smaller core surface area than the 180 nm Coppermine, Intel installed the metal heatspreader on Tualatin and all future desktop processors.
The Tualatin core was named after the Tualatin Valley and Tualatin River in Oregon, where Intel has large manufacturing and design facilities.
New features
[edit]Streaming SIMD Extensions
[edit]
Since Katmai was built in the same 250 nm process as Pentium II "Deschutes", it had to implement Streaming SIMD Extensions (SSE) using minimal silicon.[18] To achieve this goal, Intel implemented the 128-bit architecture by double-cycling the existing 64-bit data paths and by merging the SIMD-FP multiplier unit with the x87 scalar FPU multiplier into a single unit. To utilize the existing 64-bit data paths, Katmai issues each SIMD-FP instruction as two μops. To compensate partially for implementing only half of SSE's architectural width, Katmai implements the SIMD-FP adder as a separate unit on the second dispatch port. This organization allows one half of a SIMD multiply and one half of an independent SIMD add to be issued together bringing the peak throughput back to four floating point operations per cycle — at least for code with an even distribution of multiplies and adds.[4][19]
The issue was that Katmai's hardware-implementation contradicted the parallelism model implied by the SSE instruction-set. Programmers faced a code-scheduling dilemma: "Should the SSE-code be tuned for Katmai's limited execution resources, or should it be tuned for a future processor with more resources?" Katmai-specific SSE optimizations yielded the best possible performance from the Pentium III family but was suboptimal for Coppermine onwards as well as future Intel processors, such as the Pentium 4 and Core series.
Processor Serial Number
[edit]The Pentium III was the first x86 CPU to include a unique, retrievable identification number called Processor Serial Number (PSN). A Pentium III's PSN can be read by software[20] through the CPUID instruction if this feature has not been disabled through the BIOS.
On November 29, 1999, the Science and Technology Options Assessment (STOA) Panel of the European Parliament, following their report on electronic surveillance techniques asked parliamentary committee members to consider legal measures that would "prevent these chips from being installed in the computers of European citizens."[21]
Intel eventually removed the PSN feature from Tualatin-based Pentium IIIs, and the feature was absent in Pentium 4 and Pentium M.
A largely equivalent feature, the Protected Processor Identification Number (PPIN) was later added to x86 CPUs with little public notice, starting with Intel's Ivy Bridge architecture and compatible Zen 2 AMD CPUs. It is implemented as a set of model-specific registers and is useful for machine check exception handling.[22]
Hardware random number generator
[edit]A new feature was added to the Pentium III: a hardware-based random number generator.[23][24] This hardware RNG uses multiple oscillators that mix their signals; the resulting unusual waveform is then sampled at irregular intervals.[25]
Core specifications
[edit]| Code- name |
First Re- lease |
Pro- cess size |
Cache | Instruc- tions |
Package | VCore (V) | Clock Rate (MHz) |
Front side bus (MT/s) | |
|---|---|---|---|---|---|---|---|---|---|
| L1 (data + instr.) |
L2 | ||||||||
| Katmai | Feb 26 1999 |
250 nm | 16+16 KB | 512 KB, half speed |
MMX, SSE |
Slot 1 (SECC, SECC2) |
2.00 (<600 MHz) 2.05 (600 MHz) |
450, 500, 550, 600 (450, 500, 550, 600) |
100 |
| 533, 600 (533B, 600B) |
133 | ||||||||
| Copper- mine |
Oct 25 1999 |
180 nm | 256 KB, full speed |
Slot 1 (SECC2), Socket 370 (FC-PGA, FC-PGA2) |
1.60,1.65, 1.70,1.75 |
500,550,600,650,700,750, 800,850,900,1000,1100 (E-Models) |
100 | ||
| 533,600,667,733,800,866, 933,1000,1133 (EB-Models) |
133 | ||||||||
| Copper- mine T |
Aug 2000 |
Socket 370 (FC-PGA, FC-PGA2) |
1.75 | 800, 866, 933, 1000, 1133 | 133 | ||||
| Tualatin | June 2001 |
130 nm | 256 KB, full speed |
MMX, SSE, Hardware Prefetch |
Socket 370 (FC-PGA2) |
1.45/1.475 | 1000,1100,1200,1300,1400 (Celeron) | ||
| 1000,1133,1200,1333,1400 (Pentium III) | |||||||||
| 512 KB, full speed |
1133, 1266, 1400 (Pentium III-S) | ||||||||
| List of Intel Pentium III processors | |||||||||

See also
[edit]References
[edit]- ^ a b "Product Change Notification #104109-00" (PDF). Intel. May 14, 2004. Archived from the original (PDF) on July 19, 2004. Retrieved October 14, 2019.
- ^ Microprocessor Hall of Fame, Intel Corporation, archived from the original on April 6, 2008, retrieved August 11, 2007
- ^ "Product Change Notification #102839-00" (PDF). Intel. October 14, 2002. Archived from the original (PDF) on March 22, 2003. Retrieved October 14, 2019.
- ^ a b Diefendorff, Keith (March 8, 1999). "Pentium III = Pentium II + SSE: Internet SSE Architecture Boosts Multimedia Performance" (PDF). Microprocessor Report. 13 (3). Retrieved September 1, 2017.
- ^ Pabst, Thomas (October 25, 1999). "Coppermine's New Enhancements". Retrieved September 1, 2017.
- ^ "Alpha FC-PAL35T & POP66T Cooler Review". The Tech Zone. April 12, 2000. Archived from the original on June 27, 2006.
- ^ Verbist, Tim (December 3, 2000). "Copper Shims". Overclockers Online.
- ^ a b Pabst, Thomas (August 28, 2000). "Intel Admits Problems With Pentium III 1.13 GHz: Production and Shipments Halted". Tom's Hardware.
- ^ Pabst, Thomas (August 28, 2000). "Latest Update On Intel's 1.13 GHz Pentium III". Tom's Hardware.
- ^ "VHJ: More on the Xbox CPU". Van's Hardware Journal.
- ^ Intel Pentium III Coppermine-T core, retrieved July 8, 2010
- ^ a b Shimpi, Anand Lal (July 30, 2001). "Intel Pentium III 1.2GHz 0.13-micron Tualatin: The Celeron of the Future". Anandtech. Archived from the original on May 12, 2011. Retrieved April 5, 2018.
- ^ "B-Stepping With The I815/Solano - Last Passing Maneuver: Tualatin 1266 With 512 kB Versus Athlon And P4". Tom's Hardware. September 19, 2001. Retrieved April 5, 2018.
- ^ "Server Tualatin CPU review". iXBT Labs. Retrieved April 5, 2018.
- ^ "iXBT Labs Review - Tualatin based Intel Celeron 1.2 GHz for Socket 370". iXBT Labs. Retrieved April 5, 2018.
- ^ VRM 8.5 DC–DC Converter Design Guidelines. Intel. July 2001.
- ^ Lal Shimpi, Anand. Intel Pentium III 1.2 GHz 0.13-micron Tualatin: The Celeron of the Future, Anandtech, July 30, 2001.
- ^ Jagannath Keshava, Vladimir Pentkovski (1999). "Pentium III Processor Implementation Tradeoffs" (PDF). Intel Technology Journal. Retrieved September 1, 2017.
- ^ "Intel Architecture Optimization - Reference Manual" (PDF). 1999. Retrieved September 1, 2017.
- ^ "P3 Serial Un-Support Page".
- ^ "Advisory group asks EU to consider Pentium III ban". CNN. November 29, 1999.
- ^ Larabel, Michael (March 19, 2020). "AMD Plumbing Linux Support For Reading the CPU's Protected Processor Identification Number (PPIN)". Phoronix. Retrieved March 20, 2020.
- ^ Robert Moscowitz (July 12, 1999). "Privacy's Random Nature". Network Computing.
- ^ "Hardwiring Security". Wired. January 1999.
- ^ Terry Ritter (January 21, 1999). "The Pentium III RNG".
External links
[edit]Pentium III
View on GrokipediaDevelopment and Release History
Origins and Design Goals
The Pentium III processor emerged as a direct successor to the Pentium II within Intel's sixth-generation P6 microarchitecture family, with development centered on the Katmai core to extend the Deschutes core's capabilities. Originating in the late 1990s amid rising demand for multimedia computing, the design retained the Pentium II's Slot 1 packaging and back-side L2 cache configuration while incorporating architectural modifications to address limitations in handling internet-era workloads. This evolution was necessitated by competitive pressures and the shift toward data-intensive applications, building on the P6 pipeline's established efficiency without a full redesign.[7] Key design objectives prioritized the addition of Streaming SIMD Extensions (SSE), initially branded as Internet SSE, to enhance single-instruction, multiple-data operations for floating-point and packed integer computations, targeting accelerations in streaming media, 3D rendering, audio/video processing, and advanced imaging. Intel engineers aimed to integrate these 128-bit SIMD capabilities by adapting the existing 64-bit data paths through double-cycling and merging SIMD floating-point adders with multimedia extensions, thereby enabling richer internet experiences without fundamentally altering the core execution units.[8][9][10] To balance performance gains with economic viability, development goals emphasized constraining the die size increase to approximately 10% over the Pentium II—adding about 2 million transistors primarily for SSE support—while preserving a frequency scaling roadmap comparable to prior models, thus minimizing manufacturing costs and power demands. This approach allowed for rapid production ramp-up, positioning the Pentium III as an optimized platform for media-rich personal computing both online and offline, as articulated in Intel's launch strategy. The inclusion of a unique Processor Serial Number (PSN) further supported goals for secure e-commerce identification and personalized services, though it introduced later debates on privacy implications.[1][11][8]Launch and Production Timeline
The Intel Pentium III processor was launched on February 26, 1999, with the initial Katmai core models available at clock speeds of 450 MHz and 500 MHz in the Single Edge Connector Cartridge 2 (SECC2) package for Slot 1 motherboards.[1] The 550 MHz variant followed in the second quarter of 1999.[1] These processors were fabricated using a 250 nm CMOS process and featured 512 KB of L2 cache running at half the core frequency.[12] Katmai production spanned from the launch through September 1999, during which Intel released higher-speed variants up to 600 MHz, including models with 1 MB L2 cache in the 600EB variant.[12] In October 1999, Intel introduced the Coppermine core, shifting to a 180 nm process and Flip-Chip Pin Grid Array (FCPGA) packaging compatible with Socket 370, starting at 500 MHz with full-speed on-die 256 KB L2 cache.[4] Coppermine production continued into 2001, encompassing clock speeds from 500 MHz to 1.13 GHz and supporting both Slot 1 via adapters and Socket 370.[6] An intermediate Coppermine T core variant appeared in 2001, retaining the 180 nm process but optimized for Socket 370 with minor efficiency improvements.[4] The final major revision, the Tualatin core, debuted in 2001 on a 130 nm process, offering 512 KB L2 cache and speeds up to 1.4 GHz, primarily for Socket 370 systems and extending into early 2002.[13] Desktop Pentium III production phased out by 2003 as Intel shifted focus to the Pentium 4, though mobile variants persisted longer until around 2004.[2]Microarchitecture and Core Variants
Katmai Core (1999)
The Katmai core served as the debut microarchitecture for Intel's Pentium III processor line, retaining the sixth-generation P6 design foundation from the preceding Pentium II Deschutes core while introducing key enhancements. Fabricated on a 0.25 μm CMOS process with five layers of aluminum interconnect, it encompassed 9.5 million transistors in the primary die, excluding the separate L2 cache components.[14][7] Initial Katmai processors launched on February 26, 1999, at clock speeds of 450 MHz and 500 MHz, paired with a 100 MHz front-side bus (FSB) and operating at core voltages of approximately 2.0 V.[15][7] Subsequent variants extended frequencies to 550 MHz in May 1999 and 600 MHz by August 1999, with later models supporting a 133 MHz FSB for improved bandwidth.[12][16] The core's principal innovation over its predecessor lay in the integration of Streaming SIMD Extensions (SSE), comprising 70 new instructions that enabled 128-bit packed single-precision floating-point processing across four elements simultaneously, aimed at accelerating multimedia and 3D graphics workloads.[7] This addition necessitated minimal die area adjustments due to the shared manufacturing process, preserving compatibility with existing Slot 1 motherboards via the Single Edge Connector Cartridge 2 (SECC2) packaging.[7] Katmai processors featured 16 KB of L1 instruction cache and 16 KB of L1 data cache, alongside a 512 KB unified off-die L2 cache running at half the core clock speed to balance performance and power efficiency.[17] The design supported MMX instructions inherited from prior generations, but SSE required software optimization for meaningful gains, as initial application support was limited.[7] Production emphasized backward compatibility, allowing Katmai to function as a near drop-in upgrade for Pentium II systems, though thermal design power ranged from 25 W to 32 W depending on speed.[18]Coppermine Core (2000)
The Coppermine core, the second generation microarchitecture variant of the Pentium III processor, was fabricated on a 0.18 micrometer process node, enabling the integration of 256 KB of L2 cache directly on the processor die operating at full core clock speed. This design shift from the Katmai core's external, half-speed L2 cache significantly reduced latency and improved overall performance, particularly in cache-intensive workloads.[19] Initial Coppermine processors launched on October 25, 1999, with clock speeds ranging from 500 MHz to 733 MHz, supporting front-side bus frequencies of 100 MHz. Subsequent releases in 2000 expanded the lineup to higher frequencies, including models up to 1.13 GHz by mid-year, all featuring 28 million transistors and compatibility with Socket 370 via the FC-PGA package for desktop variants.[3] The core retained the Katmai pipeline with 9 execution units, including support for Streaming SIMD Extensions (SSE), but benefited from the smaller process for lower power consumption and higher clock scalability.[20] Early Coppermine models initially used Slot 1 packaging, but the transition to Socket 370 facilitated smaller form factors and easier motherboard integration. Performance gains over Katmai were estimated at 10-20% in integer and floating-point benchmarks due to the unified on-die cache, though real-world benefits varied by application.[21] Intel addressed initial yield issues from the process shrink through revisions, enabling sustained production into 2001 before the Coppermine T refinement.[22] The core's Advanced Transfer Cache mechanism optimized bandwidth for simultaneous reads and writes, enhancing efficiency in multiprocessor environments.[3]Coppermine T and Tualatin Cores (2001–2003)
![KL Intel Pentium III Tualatin][float-right] The Coppermine T core represented a refinement of the earlier Coppermine architecture, maintaining the 180 nm process while introducing the FC-PGA2 package with an integrated heat spreader for improved thermal management.[23] These processors operated at clock speeds from 800 MHz to 1.13 GHz with a 133 MHz front-side bus and 256 KB of on-die full-speed L2 cache. Designed for Socket 370 compatibility with existing motherboards, Coppermine T variants supported AGTL signaling and core voltages of 1.5-1.75 V, enabling higher frequencies without requiring chipset changes.[3] In contrast, the Tualatin core marked Intel's shift to a 130 nm process node, enhancing transistor density and power efficiency over prior Pentium III iterations.[24] Released starting in mid-2001, Tualatin-based desktop Pentium III processors reached speeds up to 1.4 GHz, paired with 256 KB L2 cache and 133 MHz FSB, while server variants under the Pentium III-S branding featured 512 KB L2 cache.[6] Key architectural updates included adoption of AGTL+ signaling and a reduced core voltage of approximately 1.25 V, necessitating compatible chipsets like Intel's 82815 or later for proper operation, as earlier boards lacked support for the revised electrical specifications.[25] Tualatin processors omitted the Processor Serial Number feature present in earlier cores, addressing privacy concerns raised during Pentium III's lifecycle. Production extended into 2003 primarily for embedded and low-volume markets, as Intel prioritized the NetBurst architecture with Pentium 4, though Tualatin offered superior per-watt performance in legacy systems due to its P6 pipeline optimizations and smaller die.[26] Both cores retained SSE instructions and other Pentium III hallmarks, but Tualatin's process shrink yielded marginal improvements in overclocking potential and heat dissipation, with typical TDPs around 28-33 W at peak frequencies.[27]Technical Features and Innovations
Streaming SIMD Extensions (SSE)
Streaming SIMD Extensions (SSE), initially developed under the codename Katmai New Instructions (KNI) and later marketed as Internet Streaming SIMD Extensions, represent Intel's extension to the IA-32 architecture for accelerating multimedia and scientific computing workloads.[28] Introduced with the Pentium III Katmai core processors launched on February 28, 1999, SSE added 70 new instructions focused on single-precision floating-point SIMD operations, building on the earlier MMX integer SIMD capabilities to address demands for enhanced 3D graphics and video processing in personal computers.[7][29] Development of these extensions began in late 1995, targeting performance scaling for visual computing applications akin to console and workstation capabilities by the 1999 timeframe.[28] SSE introduced eight dedicated 128-bit XMM registers (XMM0 through XMM7), separate from the existing x87 FPU and MMX registers, along with a 32-bit MXCSR control and status register for managing exception masks, rounding modes, and flags in SIMD floating-point operations.[28] These registers enable packed data processing of four 32-bit single-precision floating-point elements simultaneously, supporting arithmetic (add, subtract, multiply, divide), logical operations, comparisons, and conversions, with capabilities for data shuffling, packing/unpacking, and reciprocal approximations to optimize common multimedia algorithms.[7] The instruction set also includes new-media integer extensions for enhanced 128-bit packed byte/word/dword operations, complementing the floating-point focus.[7] A core innovation in SSE is its support for memory streaming, designed to handle sequential data access patterns in bandwidth-intensive tasks without polluting the data cache.[28] This includes non-temporal store instructions like MOVNTPS (for aligned single-precision floats) and MOVNTQ (for 64-bit MMX data), which bypass cache write-back and allocate minimally in write-combining buffers—expanded to four such buffers in Pentium III for up to 20% higher bus write bandwidth.[9][7] Prefetch instructions (PREFETCHNTA, PREFETCHT0, PREFETCHT1, PREFETCHT2) allow software-controlled data movement from system memory into cache levels, reducing latency for streaming workloads.[28] Hardware implementation in the Katmai core leveraged the existing 64-bit datapath by splitting 128-bit SSE instructions into two micro-operations, adding specialized execution units such as a packed single-precision adder on port 1, a merged multiplier for x87 and SSE on port 0, and dedicated shuffle/logical units, while increasing die area by approximately 10% over the Pentium II Deschutes core.[9] In terms of performance, SSE enabled up to 2x speedup in 3D geometry pipelines and 74% improvement in 3D WinBench 99 transform and lighting tests compared to Pentium II, with peak throughput reaching 2.2 GFLOPS at 550 MHz through simultaneous dual-issue addition and multiplication.[28][7][9] Targeted applications included real-time MPEG-2 video encoding/decoding, 3D rendering, image processing, speech recognition (up to 60x faster training), and audio processing, where the extensions facilitated efficient vectorized computation without the overhead of scalar x87 floating-point emulation.[28][7] Precise exception handling was maintained via a Check Next Micro-Operation (CNU) mechanism to ensure compatibility with legacy IA-32 software.[9] Subsequent Pentium III cores like Coppermine and Tualatin retained full SSE compatibility, with SSE serving as the foundation for later extensions such as SSE2 in Pentium 4.[29]Processor Serial Number (PSN)
The Processor Serial Number (PSN) was a hardware-embedded unique identifier introduced in Intel's Pentium III processors with the Katmai core, launched on February 26, 1999.[1] This 96-bit value, comprising processor family/model details and a unique serial portion, was irreversibly programmed into the processor die using polysilicon fuses during manufacturing at wafer sort, leveraging salicide agglomeration technology for reliability and redundancy.[30][31] The fuses created a 64-bit unique component derived from on-die elements, ensuring each processor's identifier remained distinct and tamper-proof against software or system alterations.[31] Software access to the PSN occurred via the CPUID instruction, with feature support signaled by bit 18 set in the EDX register when executing CPUID with EAX=1.[31] Retrieval involved executing CPUID with EAX=3, populating registers such as ECX and EDX with segments of the 96-bit number; this was permissible at all privilege levels (PL0–PL3) unless disabled.[31][32] A dedicated disable bit in a model-specific register (MSR), writable only at PL0 by supervisory code like BIOS, provided a "sticky" lockout mechanism—once set, it prevented PSN reads until a hardware reset.[31] Intel positioned the PSN as a technical enabler for precise system identification in enterprise environments, such as asset inventory across networks or authentication in e-commerce transactions, where traditional identifiers like MAC addresses could change.[31] The feature was standard in Katmai and Coppermine cores but excluded from subsequent Tualatin implementations, reflecting shifts in design priorities.[12] By default, BIOS settings enabled PSN readability upon system initialization, allowing operating systems and applications to query it directly.[31]Additional Hardware Capabilities
The Pentium III processor supported symmetric multiprocessing (SMP) configurations, enabling up to two CPUs in compatible motherboards and chipsets, which facilitated improved performance in multi-threaded applications for workstations and small servers.[3] This hardware capability built on the P6 architecture's existing bus protocols, allowing shared memory access while maintaining cache coherency through Intel's MESI protocol extensions.[3] A key innovation in later core revisions, such as Coppermine, was the Advanced Transfer Cache (ATC) for the on-die L2 cache, which utilized a 256-bit wide interface to the execution core for doubled bandwidth compared to prior designs.[3] This architecture permitted concurrent cache fills and victim evictions, minimizing stalls during data prefetching and boosting hit rates in bandwidth-limited scenarios by up to 20% in synthetic tests. The ATC's non-blocking nature complemented the processor's dual independent buses, optimizing instruction and data throughput independently.[33]Specifications and Performance Characteristics
Clock Speeds, Cache, and Power Consumption
The Katmai core, introduced in 1999 on a 250 nm process, operated at clock speeds from 450 MHz to 600 MHz for desktop variants in Slot 1 packaging. It featured 512 KB of external L2 cache running at half the core clock speed, with 16 KB L1 instruction and 16 KB L1 data caches integrated on-die.[34] Power consumption reached maximum dissipation levels of approximately 34 W at 450 MHz and up to 37.5 W at 500 MHz, with thermal design power (TDP) around 25-28 W depending on the specific model; core voltage was fixed at 2.0 V.[15][35] The Coppermine core, released in 2000 on a 180 nm process, extended clock speeds to 500 MHz through 1.13 GHz, supporting both 100 MHz and 133 MHz front-side bus (FSB) frequencies.[3] L2 cache was reduced to 256 KB but integrated on-die with full core speed operation via Intel's Advanced Transfer Cache, improving latency over Katmai's external design while maintaining the same 32 KB L1 configuration.[3] Core voltage dropped to 1.60-1.76 V, enabling lower power draw: TDP ranged from 19.6 W at 600 MHz to 29 W at 1 GHz, with higher models like 1.13 GHz approaching 37.5 W maximum dissipation under load.[36][3] Tualatin core variants, produced from 2001 on a 130 nm process, achieved the highest clock speeds of 1.0 GHz to 1.4 GHz, primarily for Socket 370 packaging with 133 MHz FSB support.[37] L2 cache options included 256 KB for standard desktop models and 512 KB for server-oriented Pentium III-S designations, both on-die at full speed to enhance efficiency.[37] Voltage further decreased to around 1.25-1.45 V, yielding TDP values near 30-32 W for top-end 1.4 GHz models with 512 KB cache, reflecting denser integration and process shrinks that balanced higher frequencies against modest power increases.[38]| Core Variant | Clock Speed Range (MHz) | L2 Cache | Typical TDP (W) | Core Voltage (V) |
|---|---|---|---|---|
| Katmai | 450–600 | 512 KB external, half-speed | 25–28 | 2.0 |
| Coppermine | 500–1,130 | 256 KB on-die, full-speed | 20–29 | 1.6–1.76 |
| Tualatin | 1,000–1,400 | 256/512 KB on-die, full-speed | 30–32 | 1.25–1.45 |
Benchmark Comparisons and Real-World Efficiency
In synthetic benchmarks such as SPEC CPU2000, the Pentium III exhibited scalable performance closely tied to clock speed, with the Coppermine core showing measurable gains over the Katmai due to on-die L2 cache integration, which reduced latency and improved data throughput. A 1.0 GHz Coppermine Pentium III in a Dell Precision Workstation 420 configuration yielded a SPECint_base2000 score of 418 and a SPECfp_base2000 score of 329, reflecting balanced integer and floating-point execution suitable for scientific and multimedia workloads of the era.[39] Higher-clock variants further extended this, with a 1.13 GHz model achieving SPECfp_base2000 of 377 and a 1.4 GHz Tualatin-based unit reaching 437, underscoring the core's efficiency in FP-intensive tasks leveraging SSE instructions.[40][41]| Clock Speed | Core | SPECint_base2000 | SPECfp_base2000 | System Example |
|---|---|---|---|---|
| 800 MHz | Coppermine | 361 | N/A | Dell Precision 420[42] |
| 1.0 GHz | Coppermine | 418 | 329 | Dell Precision 420[39] |
| 1.13 GHz | Coppermine | N/A | 377 | Dell PowerEdge 2550[40] |
| 1.26 GHz | Tualatin | 611 | 422 | Dell PowerEdge 1500SC[43][44] |
| 1.4 GHz | Tualatin | 648 | 437 | Dell PowerEdge 1500SC[45][41] |