Hubbry Logo
Pentium IIIPentium IIIMain
Open search
Pentium III
Community hub
Pentium III
logo
8 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
Pentium III
Pentium III
from Wikipedia

Pentium III
General information
LaunchedFebruary 28, 1999
DiscontinuedApril 23, 2004 (for desktop units)
May 18, 2007 (for mobile units)[1]
July 14, 2009 (discontinuation and end of life)
Marketed byIntel
Designed byIntel
Common manufacturer
  • Intel
Product code
  • Katmai: 80525
  • Coppermine: 80526
  • Coppermine T: 80533
  • Tualatin: 80530
Performance
Max. CPU clock rate400 MHz to 1.4 GHz
FSB speeds100 MT/s to 133 MT/s
Cache
L1 cache32 KB (16 KB data + 16 KB instructions)
L2 cache128–512 KB
Architecture and classification
Technology node250 nm to 130 nm
MicroarchitectureP6
Instruction setIA-32
Extensions
Physical specifications
Transistors
  • Katmai: 9.5 million
  • Coppermine: 28 million
  • Tualatin: 47 million
Cores
  • 1
Sockets
Products, models, variants
Core names
  • Katmai (desktop only)
  • Coppermine
  • Coppermine T (desktop only)
  • Tualatin
Variant
History
PredecessorPentium II
SuccessorPentium 4
Support status
Unsupported

The Pentium III[2] (marketed as Intel Pentium III Processor, informally PIII or P3) brand refers to Intel's 32-bit x86 desktop and mobile CPUs based on the sixth-generation P6 microarchitecture introduced on February 28, 1999.[citation needed] The brand's initial processors were very similar to the earlier Pentium II-branded processors. The most notable differences were the addition of the Streaming SIMD Extensions (SSE) instruction set (to accelerate floating point and parallel calculations), and the introduction of a controversial serial number embedded in the chip during manufacturing.

Even after the release of the Pentium 4 in late 2000, the Pentium III continued to be produced with new models introduced up until early 2003. They were then discontinued in April 2004 for desktop units[3] and May 2007 for mobile units.[1] They were leaving of support and removed from the official price lists in July 2009.

Processor cores

[edit]

Similarly to the Pentium II it superseded, the Pentium III was also accompanied by the Celeron brand for lower-end versions, and the Xeon for high-end (server and workstation) derivatives. The Pentium III was eventually superseded by the Pentium 4, but its Tualatin core also served as the basis for the Pentium M CPUs, which used many ideas from the P6 microarchitecture. Subsequently, it was the Pentium M microarchitecture of Pentium M branded CPUs, and not the NetBurst found in Pentium 4 processors, that formed the basis for Intel's energy-efficient Core microarchitecture of CPUs branded Core 2, Pentium Dual-Core, Celeron (Core), and Xeon.

Intel Pentium III processor family
Standard logo
(1999–2003)
Desktop Mobile logo
(2001–2003)
Mobile
Code-named Node Date released Code-named Node Date released
Pentium III logo (1999–2003) Katmai
Coppermine
Coppermine T
Tualatin
250 nm
180 nm
180 nm
130 nm
February 1999
October 1999
June 2001
June 2001
Pentium III-M Logo (1999–2003) Coppermine
Tualatin
180 nm
130 nm
October 1999
July 2001
List of desktop processors List of mobile processors

Katmai

[edit]
A Pentium III Katmai SECC2 cartridge with heatsink removed.
Katmai Die shot

The first Pentium III variant was the Katmai (Intel product code 80525). It was a further development of the Deschutes Pentium II. The Pentium III saw an increase of 2 million transistors over the Pentium II. The differences were the addition of execution units and SSE instruction support, and an improved L1 cache controller[citation needed] (the L2 cache controller was left unchanged, as it would be fully redesigned for Coppermine anyway), which were responsible for the minor performance improvements over the "Deschutes" Pentium IIs. It was first released at speeds of 450 and 500 MHz on February 28, 1999. Two more versions were released: 550 MHz on May 17, 1999, and 600 MHz on August 2, 1999. On September 27, 1999, Intel released the 533B and 600B running at 533 & 600 MHz respectively. The 'B' suffix indicated that it featured a 133 MT/s FSB, instead of the 100 MT/s FSB of prior models.

The Katmai contains 9.5 million transistors, not including the 512 Kbytes L2 cache (which adds 25 million transistors), and has dimensions of 12.3 mm by 10.4 mm (128 mm2). It is fabricated in Intel's P856.5 process, a 250 nm complementary metal–oxide–semiconductor (CMOS) process with five levels of aluminum interconnect.[4] The Katmai used the same slot-based design as the Pentium II but with the newer Slot 1 Single Edge Contact Cartridge (SECC) 2 that allowed direct CPU core contact with the heat sink. There have been some early models of the Pentium III with 450 and 500 MHz packaged in an older SECC cartridge intended for original equipment manufacturers (OEMs).

A notable stepping level for enthusiasts was SL35D. This version of Katmai was officially rated for 450 MHz, but often contained cache chips for the 600 MHz model and thus usually can run at 600 MHz.

Coppermine

[edit]
A 900 MHz Coppermine FC-PGA Pentium III.
Coppermine Die shot

The second version, codenamed Coppermine (Intel product code: 80526), was released on October 25, 1999, running at 500, 533, 550, 600, 650, 667, 700, and 733 MHz. From December 1999 to May 2000, Intel released Pentium IIIs running at speeds of 750, 800, 850, 866, 900, 933 and 1000 MHz (1 GHz). Both 100 MT/s FSB and 133 MT/s FSB models were made. For models that were already available with the same frequency, an "E" was appended to the model name to indicate cores using the new 180 nm fabrication process. An additional "B" was later appended to designate 133 MHz FSB models, resulting in an "EB" suffix. In overall performance, Coppermine had a small advantage over the Advanced Micro Devices (AMD) Athlons it was released against, which was reversed when AMD applied their own die shrink and added an on-die L2 cache to the Athlon. Athlon held the advantage in floating-point intensive code, while the Coppermine could perform better when SSE optimizations were used, but in practical terms there was little difference in how the two chips performed, clock-for-clock. However, AMD were able to clock the Athlon higher, reaching speeds of 1.2 GHz before the launch of the Pentium 4.

In performance, Coppermine arguably marked a bigger step than Katmai by introducing an on-chip L2 cache, which Intel names Advanced Transfer Cache (ATC). The ATC operates at the core clock rate and has a capacity of 256 KB, twice that of the on-chip cache formerly on Mendocino Celerons. It is eight-way set-associative and is accessed via a Double Quad Word Wide 256-bit bus, four times as wide as Katmai's. Further, latency was dropped to a quarter compared to Katmai. Another marketing term by Intel was Advanced System Buffering, which encompassed improvements to better take advantage of a 133 MT/s system bus. These include 6 fill buffers (vs. 4 on Katmai), 8 bus queue entries (vs. 4 on Katmai) and 4 write-back buffers (vs. 1 on Katmai).[5] Under competitive pressure from the AMD Athlon, Intel reworked the internals, finally removing some well-known pipeline stalls.[citation needed] As a result, applications affected by the stalls ran faster on Coppermine by up to 30%.[citation needed] The Coppermine contained 29 million transistors and was fabricated in a 180 nm process.

The Coppermine was available in 370-pin FC-PGA or FC-PGA2 for use with Socket 370, or in SECC2 for Slot 1 (all speeds except 900 and 1100). FC-PGA and Slot 1 Coppermine CPUs have an exposed die, however most higher frequency SKUs starting with the 866 MHz model were also produced in FC-PGA2 variants that feature an integrated heat spreader (IHS). This in itself did not improve thermal conductivity, since it added another layer of metal and thermal paste between the die and the heatsink, but it greatly assisted in holding the heatsink flat against the die. Earlier Coppermines without the IHS made heatsink mounting challenging.[6] If the heatsink was not situated flat against the die, heat transfer efficiency was greatly reduced. Some heatsink manufacturers began providing pads on their products, similar to what AMD did with the "Thunderbird" Athlon to ensure that the heatsink was mounted flatly. The enthusiast community went so far as to create shims to assist in maintaining a flat interface.[7]

A 1.13 GHz version (S-Spec SL4HH) was released in mid-2000 but famously recalled after a collaboration between HardOCP and Tom's Hardware[8] discovered various instabilities with the operation of the new CPU speed grade. The Coppermine core was unable to reliably reach the 1.13 GHz speed without various tweaks to the processor's microcode, effective cooling, higher voltage (1.75 V vs. 1.65 V), and specifically validated platforms.[8] Intel only officially supported the processor on its own VC820 i820-based motherboard, but even this motherboard displayed instability in the independent tests of the hardware review sites. In benchmarks that were stable, performance was shown to be sub-par, with the 1.13 GHz CPU equalling a 1.0 GHz model. Tom's Hardware attributed this performance deficit to relaxed tuning of the CPU and motherboard to improve stability.[9] Intel needed at least six months to resolve the problems using a new cD0 stepping and re-released 1.1 GHz and 1.13 GHz versions in 2001.

Microsoft's Xbox game console uses a variant of the Pentium III/Mobile Celeron family in a Micro-PGA2 form factor. The sSpec designator of the chips is SL5Sx, which makes it more similar to the Mobile Celeron Coppermine-128 processor. It shares with the Coppermine-128 Celeron its 128 KB L2 cache, and 180 nm process technology, but keeps the 8-way cache associativity from the Pentium III.[10]

Although its codename could give the impression that it used copper interconnects, in reality, its interconnects were aluminium.

Coppermine T

[edit]

This revision is an intermediate step between Coppermine and Tualatin, with support for lower-voltage system logic present on the latter but core power within previously defined voltage specs of the former so it could work in older system boards.

Intel used the latest FC-PGA2 Coppermines with the cD0 stepping and modified them so that they worked with low voltage system bus operation at 1.25 V AGTL as well as normal 1.5 V AGTL+ signal levels, and would auto detect differential or single-ended clocking. This modification made them compatible to the latest generation Socket 370 boards supporting Tualatin CPUs while maintaining compatibility with older Socket 370 boards. The Coppermine T also had two way symmetrical multiprocessing capabilities, but only in Tualatin boards.

They can be distinguished from Tualatin processors by their part numbers, which include the digits "80533", e.g. the 1133 MHz SL5QK P/N is RK80533PZ006256, while the 1000 MHz SL5QJ P/N is RK80533PZ001256.[11]

Tualatin

[edit]
A 1.13 GHz FC-PGA2 Tualatin-256 Intel Pentium III-T.
Tualatin die shot

The third revision, Tualatin (80530), was a trial for Intel's new 130 nm process. Tualatin-based Pentium IIIs were released during 2001 until early 2002 at speeds of 1.0, 1.13, 1.2, 1.26, 1.33 and 1.4 GHz. A basic shrink of Coppermine, no new features were added, except for added data prefetch logic similar to Pentium 4 and Athlon XP for potentially better use of the L2 cache, although its use compared to these newer CPUs is limited due to the relatively smaller FSB bandwidth (FSB was still kept at 133 MHz).[12] Variants with 256 and 512 KB L2 cache were produced, the latter being dubbed Pentium III-S; this variant was mainly intended for low-power consumption servers and also exclusively featured SMP support within the Tualatin line.

Although the Socket 370 designation was kept, the use of 1.25 AGTL signaling in place of 1.5 V AGTL+ rendered prior motherboards incompatible.[12] This confusion carried over to the chipset naming, where only the B-stepping of the i815 chipset was compatible with Tualatin processors.[13] A new VRM guideline was also designed by Intel, version 8.5, which required finer voltage steps and debuted load line Vcore (in place of fixed voltage regardless of current on 8.4).[14][15][16] Some motherboard manufacturers would mark the change with blue sockets (instead of white), and were often also backwards compatible with Coppermine CPUs.

The Tualatin also formed the basis for the highly popular Pentium III-M mobile processor, which became Intel's front-line mobile chip (the Pentium 4 drew significantly more power, and so was not well-suited for this role) for the next two years. The chip offered a good balance between power consumption and performance, thus finding a place in both performance notebooks and the "thin and light" category.

The Tualatin-based Pentium III performed well in some applications compared to the fastest Willamette-based Pentium 4, and even the Thunderbird-based Athlons. Despite this, its appeal was limited due to the aforementioned incompatibility with existing systems, and Intel's only officially supported chipset for Tualatins, the i815, could only handle 512 MB RAM as opposed to 1 GB of registered RAM with the older, incompatible 440BX chipset. However, the enthusiast community found a way to run Tualatins on then-ubiquitous BX chipset based boards, although it was often a non-trivial task and required some degree of technical skills.

Tualatin-based Pentium III CPUs can usually be visually distinguished from Coppermine-based processors by the metal integrated heat-spreader (IHS) fixed on top of the package. However, the last models of Coppermine Pentium IIIs also featured the IHS — the integrated heat spreader is actually what distinguishes the FC-PGA2 package from the FC-PGA — both are for Socket 370 motherboards.[17]

Before the addition of the heat spreader, it was sometimes difficult to install a heatsink on a Pentium III. One had to be careful not to put force on the core at an angle because doing so would cause the edges and corners of the core to crack and could destroy the CPU. It was also sometimes difficult to achieve a flat mating of the CPU and heatsink surfaces, a factor of critical importance to good heat transfer. This became increasingly challenging with the Socket 370 CPUs, compared with their Slot 1 predecessors, because of the force required to mount a socket-based cooler and the narrower, 2-sided mounting mechanism (Slot 1 featured 4-point mounting). As such, and because the 130 nm Tualatin had an even smaller core surface area than the 180 nm Coppermine, Intel installed the metal heatspreader on Tualatin and all future desktop processors.

The Tualatin core was named after the Tualatin Valley and Tualatin River in Oregon, where Intel has large manufacturing and design facilities.

New features

[edit]

Streaming SIMD Extensions

[edit]
Slot 1 Pentium III CPU mounted on a motherboard

Since Katmai was built in the same 250 nm process as Pentium II "Deschutes", it had to implement Streaming SIMD Extensions (SSE) using minimal silicon.[18] To achieve this goal, Intel implemented the 128-bit architecture by double-cycling the existing 64-bit data paths and by merging the SIMD-FP multiplier unit with the x87 scalar FPU multiplier into a single unit. To utilize the existing 64-bit data paths, Katmai issues each SIMD-FP instruction as two μops. To compensate partially for implementing only half of SSE's architectural width, Katmai implements the SIMD-FP adder as a separate unit on the second dispatch port. This organization allows one half of a SIMD multiply and one half of an independent SIMD add to be issued together bringing the peak throughput back to four floating point operations per cycle — at least for code with an even distribution of multiplies and adds.[4][19]

The issue was that Katmai's hardware-implementation contradicted the parallelism model implied by the SSE instruction-set. Programmers faced a code-scheduling dilemma: "Should the SSE-code be tuned for Katmai's limited execution resources, or should it be tuned for a future processor with more resources?" Katmai-specific SSE optimizations yielded the best possible performance from the Pentium III family but was suboptimal for Coppermine onwards as well as future Intel processors, such as the Pentium 4 and Core series.

Processor Serial Number

[edit]

The Pentium III was the first x86 CPU to include a unique, retrievable identification number called Processor Serial Number (PSN). A Pentium III's PSN can be read by software[20] through the CPUID instruction if this feature has not been disabled through the BIOS.

On November 29, 1999, the Science and Technology Options Assessment (STOA) Panel of the European Parliament, following their report on electronic surveillance techniques asked parliamentary committee members to consider legal measures that would "prevent these chips from being installed in the computers of European citizens."[21]

Intel eventually removed the PSN feature from Tualatin-based Pentium IIIs, and the feature was absent in Pentium 4 and Pentium M.

A largely equivalent feature, the Protected Processor Identification Number (PPIN) was later added to x86 CPUs with little public notice, starting with Intel's Ivy Bridge architecture and compatible Zen 2 AMD CPUs. It is implemented as a set of model-specific registers and is useful for machine check exception handling.[22]

Hardware random number generator

[edit]

A new feature was added to the Pentium III: a hardware-based random number generator.[23][24] This hardware RNG uses multiple oscillators that mix their signals; the resulting unusual waveform is then sampled at irregular intervals.[25]

Core specifications

[edit]
Code-
name
First
Re-
lease
Pro-
cess
size
Cache Instruc-
tions
Package VCore (V) Clock Rate
(MHz)
Front
side
bus

(MT/s)
L1
(data +
instr.)
L2
Katmai Feb 26
1999
250 nm 16+16 KB 512 KB,
half speed
MMX,
SSE
Slot 1
(SECC,
SECC2)
2.00
(<600 MHz)
2.05
(600 MHz)
450, 500, 550, 600
(450, 500, 550, 600)
100
533, 600
(533B, 600B)
133
Copper-
mine
Oct 25
1999
180 nm 256 KB,
full speed
Slot 1
(SECC2),
 
Socket 370
(FC-PGA,
FC-PGA2)
1.60,1.65,
1.70,1.75
500,550,600,650,700,750,
800,850,900,1000,1100
(E-Models)
100
533,600,667,733,800,866,
933,1000,1133
(EB-Models)
133
Copper-
mine T
Aug
2000
Socket 370
(FC-PGA,
FC-PGA2)
1.75 800, 866, 933, 1000, 1133 133
Tualatin June
2001
130 nm 256 KB,
full speed
MMX,
SSE,
Hardware
Prefetch
Socket 370
(FC-PGA2)
1.45/1.475 1000,1100,1200,1300,1400
(Celeron)
1000,1133,1200,1333,1400
(Pentium III)
512 KB,
full speed
1133, 1266, 1400
(Pentium III-S)
List of Intel Pentium III processors
Size comparison of Pentium III dies

See also

[edit]

References

[edit]
[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The Pentium III is Intel's designation for a lineage of 32-bit x86 microprocessors anchored in the sixth-generation P6 microarchitecture, launched on February 26, 1999, as a successor to the Pentium II for desktop, mobile, and server platforms. The inaugural Katmai core, etched on a 0.25-micrometer process with 9.5 million transistors, debuted in single-edge processor packaging (SEPP) for Slot 1, delivering initial clock rates from 450 to 600 MHz alongside an external L2 cache. A hallmark innovation was the advent of Streaming SIMD Extensions (SSE), enabling parallel processing of 128-bit vector operations to boost efficiency in 3D graphics, video encoding, and internet streaming tasks. Evolving variants amplified these capabilities: the Coppermine core, introduced in late 1999 on 0.18-micrometer silicon, embedded 256 KB of full-speed L2 cache on-die for latency reductions and uplifts, supporting Socket 370 interfaces and frequencies up to 1.13 GHz. The culminating Tualatin revision in 2001 shrank to 0.13 micrometers, doubling L2 cache to 512 KB in select models, attaining peaks of 1.4 GHz, and incorporating power optimizations for sustained viability in legacy systems post the architecture's debut. Early implementations embedded a Processor (PSN) for hardware , yet privacy apprehensions from civil liberties advocates compelled to disable it by default via controls and firmware updates. These processors underpinned binary compatibility with antecedent x86 offerings, powering workstations, consumer PCs, and embedded applications through the early 2000s.

Development and Release History

Origins and Design Goals

The Pentium III processor emerged as a direct successor to the Pentium II within Intel's sixth-generation P6 microarchitecture family, with development centered on the Katmai core to extend the Deschutes core's capabilities. Originating in the late 1990s amid rising demand for multimedia computing, the design retained the Pentium II's Slot 1 packaging and back-side L2 cache configuration while incorporating architectural modifications to address limitations in handling internet-era workloads. This evolution was necessitated by competitive pressures and the shift toward data-intensive applications, building on the P6 pipeline's established efficiency without a full redesign. Key design objectives prioritized the addition of (SSE), initially branded as Internet SSE, to enhance single-instruction, multiple-data operations for floating-point and packed integer computations, targeting accelerations in , , audio/, and advanced . Intel engineers aimed to integrate these 128-bit SIMD capabilities by adapting the existing 64-bit data paths through double-cycling and merging SIMD floating-point adders with multimedia extensions, thereby enabling richer experiences without fundamentally altering the core execution units. To balance performance gains with economic viability, development goals emphasized constraining the die size increase to approximately 10% over the —adding about 2 million transistors primarily for SSE support—while preserving a roadmap comparable to prior models, thus minimizing manufacturing costs and power demands. This approach allowed for rapid production ramp-up, positioning the Pentium III as an optimized platform for media-rich personal computing both , as articulated in Intel's launch strategy. The inclusion of a unique Processor Serial Number (PSN) further supported goals for secure identification and personalized services, though it introduced later debates on implications.

Launch and Production Timeline

The Pentium III processor was launched on February 26, 1999, with the initial Katmai core models available at clock speeds of 450 MHz and 500 MHz in the Single Edge Connector Cartridge 2 (SECC2) package for motherboards. The 550 MHz variant followed in the second quarter of 1999. These processors were fabricated using a 250 nm process and featured 512 KB of L2 cache running at half the core frequency. Katmai production spanned from the launch through September 1999, during which released higher-speed variants up to 600 MHz, including models with 1 MB L2 cache in the 600EB variant. In October 1999, introduced the Coppermine core, shifting to a and Flip-Chip (FCPGA) packaging compatible with Socket 370, starting at 500 MHz with full-speed on-die 256 KB L2 cache. Coppermine production continued into 2001, encompassing clock speeds from 500 MHz to 1.13 GHz and supporting both via adapters and Socket 370. An intermediate Coppermine T core variant appeared in 2001, retaining the 180 nm process but optimized for Socket 370 with minor efficiency improvements. The final major revision, the Tualatin core, debuted in 2001 on a , offering 512 KB L2 cache and speeds up to 1.4 GHz, primarily for Socket 370 systems and extending into early 2002. Desktop Pentium III production phased out by 2003 as Intel shifted focus to the , though mobile variants persisted longer until around 2004.

Microarchitecture and Core Variants

Katmai Core (1999)

The Katmai core served as the debut microarchitecture for Intel's Pentium III processor line, retaining the sixth-generation P6 design foundation from the preceding Deschutes core while introducing key enhancements. Fabricated on a 0.25 μm process with five layers of aluminum interconnect, it encompassed 9.5 million transistors in the primary die, excluding the separate L2 cache components. Initial Katmai processors launched on February 26, 1999, at clock speeds of 450 MHz and 500 MHz, paired with a 100 MHz (FSB) and operating at core voltages of approximately 2.0 V. Subsequent variants extended frequencies to 550 MHz in May 1999 and 600 MHz by August 1999, with later models supporting a 133 MHz FSB for improved bandwidth. The core's principal innovation over its predecessor lay in the integration of (SSE), comprising 70 new instructions that enabled 128-bit packed single-precision floating-point processing across four elements simultaneously, aimed at accelerating and 3D workloads. This addition necessitated minimal die area adjustments due to the shared manufacturing process, preserving compatibility with existing motherboards via the Single Edge Connector Cartridge 2 (SECC2) packaging. Katmai processors featured 16 KB of L1 instruction cache and 16 KB of L1 data cache, alongside a 512 KB unified off-die L2 cache running at half the core clock speed to balance performance and power efficiency. The design supported MMX instructions inherited from prior generations, but SSE required software optimization for meaningful gains, as initial application support was limited. Production emphasized , allowing Katmai to function as a near drop-in upgrade for systems, though ranged from 25 W to 32 W depending on speed.

Coppermine Core (2000)

The Coppermine core, the second generation variant of the Pentium III processor, was fabricated on a 0.18 micrometer process node, enabling the integration of 256 KB of L2 cache directly on the processor die operating at full core clock speed. This design shift from the Katmai core's external, half-speed L2 cache significantly reduced latency and improved overall , particularly in cache-intensive workloads. Initial Coppermine processors launched on October 25, 1999, with clock speeds ranging from 500 MHz to 733 MHz, supporting frequencies of 100 MHz. Subsequent releases in 2000 expanded the lineup to higher frequencies, including models up to 1.13 GHz by mid-year, all featuring 28 million transistors and compatibility with via the FC-PGA package for desktop variants. The core retained the Katmai pipeline with 9 execution units, including support for (SSE), but benefited from the smaller process for lower power consumption and higher clock scalability. Early Coppermine models initially used packaging, but the transition to Socket 370 facilitated smaller form factors and easier integration. Performance gains over Katmai were estimated at 10-20% in and floating-point benchmarks due to the unified on-die cache, though real-world benefits varied by application. Intel addressed initial yield issues from the process shrink through revisions, enabling sustained production into 2001 before the Coppermine T refinement. The core's Advanced Transfer Cache mechanism optimized bandwidth for simultaneous reads and writes, enhancing efficiency in multiprocessor environments.

Coppermine T and Tualatin Cores (2001–2003)

![KL Intel Pentium III Tualatin][float-right] The Coppermine T core represented a refinement of the earlier Coppermine architecture, maintaining the 180 nm process while introducing the FC-PGA2 package with an integrated heat spreader for improved thermal management. These processors operated at clock speeds from 800 MHz to 1.13 GHz with a 133 MHz front-side bus and 256 KB of on-die full-speed L2 cache. Designed for Socket 370 compatibility with existing motherboards, Coppermine T variants supported AGTL signaling and core voltages of 1.5-1.75 V, enabling higher frequencies without requiring chipset changes. In contrast, the Tualatin core marked Intel's shift to a 130 nm process node, enhancing transistor density and power efficiency over prior Pentium III iterations. Released starting in mid-2001, Tualatin-based desktop Pentium III processors reached speeds up to 1.4 GHz, paired with 256 KB L2 cache and 133 MHz FSB, while server variants under the Pentium III-S branding featured 512 KB L2 cache. Key architectural updates included adoption of AGTL+ signaling and a reduced core voltage of approximately 1.25 V, necessitating compatible chipsets like Intel's 82815 or later for proper operation, as earlier boards lacked support for the revised electrical specifications. Tualatin processors omitted the Processor Serial Number feature present in earlier cores, addressing concerns raised during Pentium III's lifecycle. Production extended into 2003 primarily for embedded and low-volume markets, as Intel prioritized the NetBurst architecture with , though Tualatin offered superior per-watt performance in legacy systems due to its P6 optimizations and smaller die. Both cores retained SSE instructions and other Pentium III hallmarks, but Tualatin's process shrink yielded marginal improvements in potential and heat dissipation, with typical TDPs around 28-33 W at peak frequencies.

Technical Features and Innovations

Streaming SIMD Extensions (SSE)

Streaming SIMD Extensions (SSE), initially developed under the codename Katmai New Instructions (KNI) and later marketed as , represent Intel's extension to the architecture for accelerating multimedia and scientific computing workloads. Introduced with the Pentium III Katmai core processors launched on February 28, 1999, SSE added 70 new instructions focused on single-precision floating-point SIMD operations, building on the earlier MMX integer SIMD capabilities to address demands for enhanced 3D and in personal computers. Development of these extensions began in late 1995, targeting performance scaling for visual computing applications akin to console and capabilities by the 1999 timeframe. SSE introduced eight dedicated 128-bit XMM registers (XMM0 through XMM7), separate from the existing FPU and MMX registers, along with a 32-bit MXCSR control and status register for managing exception masks, rounding modes, and flags in SIMD floating-point operations. These registers enable packed data processing of four 32-bit single-precision floating-point elements simultaneously, supporting arithmetic (add, subtract, multiply, divide), logical operations, comparisons, and conversions, with capabilities for data shuffling, packing/unpacking, and reciprocal approximations to optimize common algorithms. The instruction set also includes new-media integer extensions for enhanced 128-bit packed byte/word/dword operations, complementing the floating-point focus. A core innovation in SSE is its support for memory streaming, designed to handle sequential data access patterns in bandwidth-intensive tasks without polluting the data cache. This includes non-temporal store instructions like MOVNTPS (for aligned single-precision floats) and MOVNTQ (for 64-bit MMX data), which bypass cache write-back and allocate minimally in write-combining buffers—expanded to four such buffers in Pentium III for up to 20% higher bus write bandwidth. Prefetch instructions (PREFETCHNTA, PREFETCHT0, PREFETCHT1, PREFETCHT2) allow software-controlled data movement from system memory into cache levels, reducing latency for streaming workloads. Hardware implementation in the Katmai core leveraged the existing 64-bit datapath by splitting 128-bit SSE instructions into two micro-operations, adding specialized execution units such as a packed single-precision adder on port 1, a merged multiplier for x87 and SSE on port 0, and dedicated shuffle/logical units, while increasing die area by approximately 10% over the Pentium II Deschutes core. In terms of performance, SSE enabled up to 2x speedup in 3D geometry pipelines and 74% improvement in 3D WinBench 99 transform and lighting tests compared to , with peak throughput reaching 2.2 GFLOPS at 550 MHz through simultaneous dual-issue addition and multiplication. Targeted applications included real-time video encoding/decoding, , image processing, (up to 60x faster training), and audio processing, where the extensions facilitated efficient vectorized computation without the overhead of scalar floating-point emulation. Precise exception handling was maintained via a Check Next Micro-Operation (CNU) mechanism to ensure compatibility with legacy software. Subsequent Pentium III cores like Coppermine and Tualatin retained full SSE compatibility, with SSE serving as the foundation for later extensions such as in Pentium 4.

Processor Serial Number (PSN)

The Processor Serial Number (PSN) was a hardware-embedded unique identifier introduced in Intel's Pentium III processors with the Katmai core, launched on February 26, 1999. This 96-bit value, comprising processor family/model details and a unique serial portion, was irreversibly programmed into the processor die using polysilicon fuses during manufacturing at wafer sort, leveraging salicide agglomeration technology for reliability and redundancy. The fuses created a 64-bit unique component derived from on-die elements, ensuring each processor's identifier remained distinct and tamper-proof against software or system alterations. Software access to the PSN occurred via the instruction, with feature support signaled by bit 18 set in the register when executing with EAX=1. Retrieval involved executing with EAX=3, populating registers such as ECX and with segments of the 96-bit number; this was permissible at all privilege levels (PL0–PL3) unless disabled. A dedicated disable bit in a (MSR), writable only at PL0 by supervisory code like , provided a "sticky" lockout mechanism—once set, it prevented PSN reads until a hardware reset. Intel positioned the PSN as a technical enabler for precise in enterprise environments, such as asset inventory across networks or in transactions, where traditional identifiers like MAC addresses could change. The feature was standard in Katmai and Coppermine cores but excluded from subsequent Tualatin implementations, reflecting shifts in design priorities. By default, BIOS settings enabled PSN readability upon system initialization, allowing operating systems and applications to query it directly.

Additional Hardware Capabilities

The Pentium III processor supported (SMP) configurations, enabling up to two CPUs in compatible motherboards and chipsets, which facilitated improved performance in multi-threaded applications for workstations and small servers. This hardware capability built on the P6 architecture's existing bus protocols, allowing access while maintaining cache coherency through Intel's extensions. A key innovation in later core revisions, such as Coppermine, was the Advanced Transfer Cache (ATC) for the on-die L2 cache, which utilized a 256-bit wide interface to the execution core for doubled bandwidth compared to prior designs. This permitted concurrent cache fills and victim evictions, minimizing stalls during data prefetching and boosting hit rates in bandwidth-limited scenarios by up to 20% in synthetic tests. The ATC's non-blocking nature complemented the processor's dual independent buses, optimizing instruction and data throughput independently.

Specifications and Performance Characteristics

Clock Speeds, Cache, and Power Consumption

The Katmai core, introduced in 1999 on a , operated at clock speeds from MHz to 600 MHz for desktop variants in packaging. It featured 512 KB of external L2 cache running at half the core clock speed, with 16 KB L1 instruction and 16 KB L1 data caches integrated on-die. Power consumption reached maximum dissipation levels of approximately at MHz and up to at 500 MHz, with (TDP) around 25-28 W depending on the specific model; core voltage was fixed at 2.0 V. The Coppermine core, released in on a , extended clock speeds to 500 MHz through 1.13 GHz, supporting both 100 MHz and 133 MHz (FSB) frequencies. L2 cache was reduced to 256 KB but integrated on-die with full core speed operation via Intel's Advanced Transfer Cache, improving latency over Katmai's external design while maintaining the same 32 KB L1 configuration. Core voltage dropped to 1.60-1.76 V, enabling lower power draw: TDP ranged from 19.6 W at 600 MHz to 29 W at 1 GHz, with higher models like 1.13 GHz approaching 37.5 W maximum dissipation under load. Tualatin core variants, produced from on a , achieved the highest clock speeds of 1.0 GHz to 1.4 GHz, primarily for Socket 370 packaging with 133 MHz FSB support. L2 cache options included 256 KB for standard desktop models and 512 KB for server-oriented Pentium III-S designations, both on-die at full speed to enhance efficiency. Voltage further decreased to around 1.25-1.45 V, yielding TDP values near 30-32 W for top-end 1.4 GHz models with 512 KB cache, reflecting denser integration and process shrinks that balanced higher frequencies against modest power increases.
Core VariantClock Speed Range (MHz)L2 CacheTypical TDP (W)Core Voltage (V)
Katmai450–600512 KB external, half-speed25–282.0
Coppermine500–1,130256 KB on-die, full-speed20–291.6–1.76
Tualatin1,000–1,400256/512 KB on-die, full-speed30–321.25–1.45

Benchmark Comparisons and Real-World Efficiency

In synthetic benchmarks such as SPEC CPU2000, the Pentium III exhibited scalable performance closely tied to clock speed, with the Coppermine core showing measurable gains over the Katmai due to on-die L2 cache integration, which reduced latency and improved throughput. A 1.0 GHz Coppermine Pentium III in a Workstation 420 configuration yielded a SPECint_base2000 score of 418 and a SPECfp_base2000 score of 329, reflecting balanced and floating-point execution suitable for scientific and workloads of the era. Higher-clock variants further extended this, with a 1.13 GHz model achieving SPECfp_base2000 of 377 and a 1.4 GHz Tualatin-based unit reaching 437, underscoring the core's efficiency in FP-intensive tasks leveraging SSE instructions.
Clock SpeedCoreSPECint_base2000SPECfp_base2000System Example
800 MHzCoppermine361N/A Precision 420
1.0 GHzCoppermine418329 Precision 420
1.13 GHzCoppermineN/A377 2550
1.26 GHzTualatin611422 1500SC
1.4 GHzTualatin648437 1500SC
Against competitors like the AMD Athlon, the Pentium III trailed in raw integer performance per clock cycle owing to the Athlon's superior branch prediction and larger L1 caches, with reviews noting Athlon processors outperforming equivalent-speed Pentium III units by 10-20% in multi-threaded or branch-heavy applications. However, in SSE-optimized floating-point scenarios, such as or early video encoding, the Pentium III closed the gap, particularly with Coppermine and Tualatin cores benefiting from 256-512 KB on-die L2 cache versus the Athlon's off-chip designs in initial models. Real-world efficiency favored the later Tualatin core, fabricated on a with reduced voltage (1.475 V versus 1.65-1.7 V for Coppermine), enabling lower thermal output around 28 TDP at 1.2-1.4 GHz compared to 30-35 for prior variants, which translated to sustained performance in office productivity and light without aggressive cooling. Cache-sensitive tasks like file compression highlighted Tualatin's advantages, where a 1.4 GHz unit outperformed Coppermine equivalents by up to 15% due to doubled L2 capacity (512 KB), enhancing instructions-per-clock (IPC) in bandwidth-limited workloads. Overall, the Pentium III's power efficiency supported its prevalence in enterprise servers and workstations, where stability trumped peak throughput against AMD's more volatile high-end offerings.

Market Reception and Competitive Landscape

Commercial Deployment and Sales Performance

The Intel Pentium III processor entered commercial deployment on February 26, 1999, with initial availability of 450 MHz and 500 MHz models for desktop and mobile applications. The 550 MHz variant followed in the second quarter of 1999, enabling rapid scaling to higher clock speeds amid growing demand for and internet-enabled computing. allocated $300 million for marketing, emphasizing the processor's for enhanced 3D graphics and in consumer and systems. Deployment extended to workstations and entry-level servers through the Pentium III Xeon, which supported multi-processor configurations for data center use. By March 1999, systems incorporating the Pentium III accounted for about 9% of U.S. retail PC sales, reflecting quick OEM adoption by manufacturers like and despite premium pricing—$823 to $899 for 500 MHz units. Strong initial uptake led to supply shortages, notably for 750 MHz and 800 MHz models in late 1999 and early 2000, as production ramped to meet volume commitments. Sales performance bolstered Intel's overall revenue, contributing to a 1999 total of $29.4 billion—a 12% year-over-year increase primarily from shipments. Intel periodically reduced prices to sustain momentum, such as a 26% cut on the 550 MHz desktop model to $487 in August 1999, which broadened accessibility amid competition. Specific unit volumes for the Pentium III were not itemized in public filings, but the processor's ecosystem integration and manufacturing scale helped Intel preserve an 80-85% share of the x86 CPU market through 2003, even as AMD's gained traction with superior integer performance in benchmarks. This dominance stemmed from Intel's supply chain reliability and partnerships, offsetting AMD's clock speed advantages in enthusiast segments.

Rivalry with AMD Athlon and Other Competitors

The AMD , launched in June 1999, directly challenged the Pentium III's market position by delivering higher performance in key benchmarks shortly after the Pentium III Katmai core's February 1999 debut. Early Athlon models, such as the 600 MHz variant using Slot A packaging, outperformed equivalent Pentium III processors in gaming workloads, achieving approximately 10 frames per second more in the Crusher test compared to the Pentium III 600 MHz. In broader application testing, the Athlon demonstrated a 10 percent edge in standard Windows productivity tasks and up to 20 percent in gaming scenarios over contemporaneous Pentium III chips. This superiority stemmed from the Athlon's K7 architecture, featuring deeper pipelining, a larger 512 KB external L2 cache, and enhanced execution units, which provided better per-clock efficiency in non-floating-point operations than the Pentium III's initial Katmai design. Intel responded with the Coppermine core in October 1999, integrating 256 KB of L2 cache on-die at 180 nm, which narrowed the gap in overall system performance and excelled in (SSE)-accelerated floating-point workloads where the Pentium III held an advantage over 's 3DNow! instructions. By early 2000, at 1 GHz clock speeds, the Pentium III demonstrated a slight lead over the 1 GHz in select mixed benchmarks, though retained edges in integer-heavy tasks. AMD's subsequent Thunderbird core, introduced in 2000 with and integrated 256 KB L2 cache, sustained competitiveness through higher clock scalability, while the Pentium III Tualatin revision in 2001 added 512 KB L2 but remained constrained by 's focus shifting toward the Pentium 4. processors up to 800 MHz were generally faster and priced lower than matching Pentium III equivalents, appealing to performance-per-dollar buyers. The rivalry extended to pricing and availability, where AMD capitalized on Intel's 1999-2000 production shortages—exacerbated by delays in chipsets like the i820—to gain traction, with Athlon systems often undercutting Intel equivalents by 10-20 percent while matching or exceeding speeds. This value proposition helped AMD expand its x86 microprocessor market share, rising to challenge Intel's dominance; Intel's desktop CPU share declined from 82.2 percent in 2000 to 78.7 percent in 2001, ceding ground primarily to AMD. Intel retained overall leadership through superior supply chains, OEM partnerships, and brand loyalty, but the Athlon forced price reductions and architectural refinements in Pentium III lines, intensifying industry competition. Beyond , lesser competitors like Cyrix's MediaGX and VIA's C3 processors targeted low-end segments with sub-500 MHz speeds and minimal performance, posing no significant threat to the Pentium III or in mainstream or high-end desktops during 1999-2003; these alternatives focused on embedded or budget embedded applications rather than direct rivalry in consumer performance computing. The -Pentium III contest ultimately highlighted AMD's architectural innovations against Intel's manufacturing scale, setting the stage for sustained x86 competition into the era.

Controversies and Criticisms

PSN Privacy Debates and Technical Responses

The Processor Serial Number (PSN), a 96-bit embedded in each Pentium III processor, was introduced by on February 26, 1999, primarily to facilitate secure in electronic commerce and network applications by verifying processor identity without relying solely on software-based methods. Privacy advocates argued that the PSN posed risks to user anonymity, as websites could potentially access and log the number via applets or controls, enabling persistent tracking across sessions and devices, even if users cleared or changed IP addresses, thereby undermining online protections. Criticism intensified in early 1999, with groups including the (EFF), Center for Democracy & Technology (CDT), and (ACLU) contending that the feature could facilitate unauthorized profiling by marketers or governments, as the PSN's hardware-level permanence made evasion difficult without specialized tools. On January 25, 1999, these organizations called for a of products until the PSN was permanently disabled, highlighting that even with user consent requirements, the identifier's accessibility via standard programming interfaces created inherent vulnerabilities. Demonstrations emerged showing software capable of retrieving the PSN without explicit user notification, fueling claims that 's safeguards were insufficient against surreptitious collection. In response to mounting pressure, announced on January 26, 1999, that PSN-equipped Pentium III processors would ship with the feature disabled by default, requiring users to enable it explicitly through provided software utilities or settings to mitigate unauthorized access risks. The company further developed detection tools allowing users to verify if PSN transmission was occurring and emphasized that the feature required affirmative for any legitimate use, such as in digital signatures, while asserting it did not inherently enable tracking without application-level . Critics, however, dismissed these measures as inadequate, arguing that the mere presence of the hardware ID invited abuse and that -level disabling was not universally implemented across motherboards, leaving non-technical users exposed. Ultimately, adoption of PSN remained low due to the controversy, and ceased including it in subsequent processor generations, such as the , reflecting a concession to priorities over specialized needs.

Architectural Limitations and Transition Challenges

The P6 microarchitecture of the Pentium III, originating from designs dating to the mid-1990s, excelled in instructions per clock (IPC) efficiency through its relatively short 10- to 14-stage pipeline and out-of-order execution, but encountered fundamental barriers to further clock frequency scaling as process nodes shrank from 250 nm to 130 nm. Stall penalties from branch mispredictions and cache misses did not diminish proportionally with reduced cycle times, while clock setup and hold requirements consumed a growing fraction of each cycle, limiting stable operation beyond approximately 1.4 GHz in desktop variants. For example, the 1.13 GHz Coppermine-core Pentium III suffered from instability, failing to complete benchmarks on non-Intel reference motherboards due to electrical and thermal constraints inherent to the aging pipeline design. Integration of (SSE) in the Katmai and subsequent cores represented an incremental addition to the Deschutes-era pipeline rather than a ground-up redesign, constraining opportunities for deeper optimizations in floating-point and vector workloads compared to rivals like AMD's , which featured larger on-die caches and superior integer throughput. These limitations manifested in diminishing performance returns per process generation; Tualatin-core Pentium IIIs at 130 nm achieved only marginal gains over Coppermine predecessors despite the shrink, underscoring the architecture's exhaustion for sustaining Moore's Law-like improvements in single-threaded performance. The transition to the 's in November 2000 aimed to circumvent P6's frequency ceiling via a deeper 20-stage , targeting clocks exceeding 2 GHz, but introduced severe challenges including a 20-40% IPC reduction relative to in branch-heavy and integer tasks, causing initial 1.3-1.5 GHz models to lag behind 1.13-1.4 GHz equivalents in real-world applications like compilation and workloads. This architectural pivot, motivated by projections of 10 GHz scalability through aggressive clocking, overlooked near-term power and heat dissipation barriers on 180 nm processes, with Willamette-core s drawing up to 75 W versus 30-50 W for comparable s, exacerbating cooling demands and system costs. Compatibility issues compounded the shift, as Socket 423 (later 478) platforms discarded Socket 370 support, requiring full motherboard replacements and software recompilation for exploitation, while Intel's GHz-centric marketing created mismatched consumer expectations for seamless upgrades. Ultimately, these trade-offs delayed 's viability until Northwood revisions and in 2002, validating P6 derivatives like the as more efficient stopgaps.

Successors, Evolution, and Enduring Legacy

Shift to Pentium 4 and Influence on Pentium M

The introduction of the Pentium 4 processor on November 20, 2000, initiated Intel's shift away from the Pentium III's P6 microarchitecture toward the new NetBurst architecture, which emphasized megahertz scaling through deeper pipelines and higher clock frequencies to achieve performance gains. Initial Pentium 4 models, such as the 1.3–1.5 GHz Willamette cores on 180 nm process, exhibited higher power consumption and thermal output compared to equivalent Pentium III variants, leading to mixed early reception despite marketing focused on clock speed milestones like the first 2 GHz desktop CPU in 2001. Pentium III desktop production, including the 130 nm Tualatin variants reaching up to 1.4 GHz, continued into early 2002 to support lingering demand in cost-sensitive segments, after which Intel fully transitioned desktop volume to Pentium 4 iterations like Northwood. This architectural pivot prioritized speculative execution and instruction-level parallelism over the Pentium III's balanced efficiency, but NetBurst's long pipelines resulted in diminished instructions per clock (IPC) efficiency, particularly evident in workloads favoring integer performance where Pentium III held advantages until Pentium 4 optimizations matured around 2002–2003. The transition accelerated amid competitive pressures from AMD's Athlon, prompting Intel to phase out Pentium III socket support (e.g., Socket 370) in favor of Socket 423/478 for Pentium 4, though legacy systems sustained some Pentium III deployments in embedded and low-end markets beyond 2002. The Pentium III's influence proved pivotal in mobile computing via the Pentium M series, launched in March 2003 as the Banias core, which reverted to a derivative of the P6 microarchitecture—specifically enhancing the Tualatin Pentium III's design with doubled L2 cache (up to 1 MB), improved branch prediction, and micro-op fusion to boost IPC without NetBurst's power penalties. Intel developed Pentium M in response to the mobile Pentium 4-M's excessive 30–50 W TDP and heat dissipation, which hindered battery life and thermal management in laptops, whereas Pentium M achieved comparable or superior performance per watt through P6-derived optimizations like enhanced decode logic and power gating absent in early NetBurst mobile implementations. This lineage extended Pentium III principles of and on-die caching into a mobile-optimized paradigm, incorporating select elements like trace cache for instruction fetching while rejecting its high-frequency focus, ultimately enabling (and successors like Dothan) to outperform mobile in real-world efficiency metrics such as SPECint and battery endurance tests by 2003–2004. The approach validated the P6 core's enduring viability for power-constrained environments, influencing Intel's later Core microarchitecture shift away from by 2006.

Role in Embedded Systems and Retro Computing Applications

The Intel Pentium III processor was adapted for embedded computing through specialized variants, including low-voltage models designed for power-constrained and thermally sensitive environments such as industrial controls, network appliances, and point-of-sale systems. In February 2000, Intel released dedicated Pentium III processors targeting applications like automated teller machines (ATMs) and smart network devices, featuring extended temperature ranges and compatibility with embedded motherboards for . These processors incorporated features like 512 KB on-die L2 cache and data prefetch logic to enhance performance in real-time tasks, with support for differential clocking and in select configurations. Low-power Pentium III iterations, such as the 800 MHz model (RJ80530KZ800512), were engineered for embedded platforms requiring minimal thermal output, often integrated into single-board computers (SBCs) like the NEPTUNE-P3 based on the chipset, which supported up to Pentium III or CPUs for rugged applications. Dual-processor boards, including the CPCI-786 from Force Computers introduced in 2002, utilized low-voltage Pentium III cores for and packet-switching backplanes, delivering up to 1.1 GHz per socket with enhanced reliability for mission-critical systems. The Tualatin-core variants extended availability into embedded markets until their discontinuation in 2012, with last orders in July 2012 and shipments ceasing in January 2013, underscoring their role in long-lifecycle industrial deployments. In retro computing, the Pentium III maintains niche appeal among hardware enthusiasts for reconstructing late-1990s to early-2000s PC configurations, owing to its architectural efficiency and compatibility with legacy x86 software ecosystems. Systems featuring overclocked Tualatin-core Pentium III processors, reaching up to 1.4 GHz, outperform early models (e.g., 1.5 GHz Willamette) in integer-heavy workloads and legacy applications, making them suitable for running unmodified , , or XP alongside period-specific games and without emulation overhead. Enthusiast communities value the processor's reliability, with no inherent failure modes reported beyond typical age-related degradation, enabling sustained operation in custom builds like dual- or quad-socket motherboards for authentic retro gaming rigs. Its Socket 370 form factor facilitates modular upgrades in vintage chassis, contributing to ongoing projects focused on preserving pre-multicore x86 heritage.

References

Add your contribution
Related Hubs
User Avatar
No comments yet.