Recent from talks
Nothing was collected or created yet.
AMD APU
View on Wikipedia
![]() A-series APU | |
| Release date | 2011 (Original); 2017 (Zen based) |
|---|---|
| Codename | Fusion Desna Ontario Zacate Llano Hondo Trinity Weatherford Richland Kaveri Godavari Kabini Temash Carrizo Bristol Ridge Raven Ridge Picasso Renoir Cezanne Phoenix IGP Wrestler WinterPark BeaverCreek |
| Architecture | AMD64 |
| Models |
|
| Cores | 1 to 8 |
| Transistors |
|
| API support | |
| OpenCL | 1.2 |
| OpenGL | 4.1+ |
| DirectX | Direct3D 11 Direct3D 12 |
| History | |
| Predecessor | Athlon II Sempron |
| Successor | Ryzen Zen-based Athlon |
AMD Accelerated Processing Unit (APU), formerly known as Fusion, is a series of 64-bit microprocessors from Advanced Micro Devices (AMD), combining a general-purpose AMD64 central processing unit (CPU) and 3D integrated graphics processing unit (IGPU) on a single die.
AMD announced the first generation APUs, Llano for high-performance and Brazos for low-power devices, in January 2011 and launched the first units on June 14.[1][2] The second generation Trinity for high-performance and Brazos-2 for low-power devices were announced in June 2012. The third generation Kaveri for high performance devices were launched in January 2014, while Kabini and Temash for low-power devices were announced in the summer of 2013. Since the launch of the Zen microarchitecture, Ryzen and Athlon APUs have released to the global market as Raven Ridge on the DDR4 platform, after Bristol Ridge a year prior.
AMD has also supplied semi-custom APUs for consoles starting with the release of Sony PlayStation 4 and Microsoft Xbox One eighth generation video game consoles.
History
[edit]The AMD Fusion project started in 2006 with the aim of developing a system on a chip that combined a CPU with a GPU on a single die. This effort was moved forward by AMD's acquisition of graphics chipset manufacturer ATI[3] in 2006. The project reportedly required three internal iterations of the Fusion concept to create a product deemed worthy of release.[3] Reasons contributing to the delay of the project include the technical difficulties of combining a CPU and GPU on the same die at a 45 nm process, and conflicting views on what the role of the CPU and GPU should be within the project.[4]
The first generation desktop and laptop APU, codenamed Llano, was announced on 4 January 2011 at the 2011 Consumer Electronics Show in Las Vegas and released shortly thereafter.[5][6] It featured K10 CPU cores and a Radeon HD 6000 series GPU on the same die on the FM1 socket. An APU for low-power devices was announced as the Brazos platform, based on the Bobcat microarchitecture and a Radeon HD 6000 series GPU on the same die.[7]
At a conference in January 2012, corporate fellow Phil Rogers announced that AMD would re-brand the Fusion platform as the Heterogeneous System Architecture (HSA), stating that "it's only fitting that the name of this evolving architecture and platform be representative of the entire, technical community that is leading the way in this very important area of technology and programming development."[8] However, it was later revealed that AMD had been the subject of a trademark infringement lawsuit by the Swiss company Arctic, who used the name "Fusion" for a line of power supply products.[9]
The second generation desktop and laptop APU, codenamed Trinity, was announced at AMD's 2010 Financial Analyst Day[10][11] and released in October 2012.[12] It featured Piledriver CPU cores and Radeon HD 7000 series GPU cores on the FM2 socket.[13] AMD released a new APU based on the Piledriver microarchitecture on 12 March 2013 for Laptops/Mobile and on 4 June 2013 for desktops under the codename Richland.[14] The second generation APU for low-power devices, Brazos 2.0, used exactly the same APU chip, but ran at higher clock speed and rebranded the GPU as Radeon HD 7000 series and used a new I/O controller chip.
Semi-custom chips were introduced in the Microsoft Xbox One and Sony PlayStation 4 video game consoles,[15][16] and subsequently in the Microsoft Xbox Series X|S and Sony PlayStation 5 consoles.
A third generation of the technology was released on 14 January 2014, featuring greater integration between CPU and GPU. The desktop and laptop variant is codenamed Kaveri, based on the Steamroller architecture, while the low-power variants, codenamed Kabini and Temash, are based on the Jaguar architecture.[17]
Since the introduction of Zen-based processors, AMD renamed their APUs as the Ryzen with Radeon Graphics and Athlon with Radeon Graphics, with desktop units assigned with G suffix on their model numbers (e.g. Ryzen 5 3400G & Athlon 3000G) to distinguish them from regular processors or with basic graphics and also to differentiate away from their former Bulldozer era A-series APUs. The mobile counterparts were always paired with Radeon Graphics regardless of suffixes.
Features
[edit]Heterogeneous System Architecture
[edit]AMD is a founding member of the Heterogeneous System Architecture (HSA) Foundation and is consequently actively working on developing HSA in cooperation with other members. The following hardware and software implementations are available in AMD's APU-branded products:
| Type | HSA feature | First implemented | Notes |
|---|---|---|---|
| Optimized Platform | GPU Compute C++ Support | 2012 Trinity APUs |
Support OpenCL C++ directions and Microsoft's C++ AMP language extension. This eases programming of both CPU and GPU working together to process support parallel workloads. |
| HSA-aware MMU | GPU can access the entire system memory through the translation services and page fault management of the HSA MMU. | ||
| Shared Power Management | CPU and GPU now share the power budget. Priority goes to the processor most suited to the current tasks. | ||
| Architectural Integration | Heterogeneous Memory Management: the CPU's MMU and the GPU's IOMMU share the same address space.[18][19] | 2014 PlayStation 4, Kaveri APUs |
CPU and GPU now access the memory with the same address space. Pointers can now be freely passed between CPU and GPU, hence enabling zero-copy. |
| Fully coherent memory between CPU and GPU | GPU can now access and cache data from coherent memory regions in the system memory, and also reference the data from CPU's cache. Cache coherency is maintained. | ||
| GPU uses pageable system memory via CPU pointers | GPU can take advantage of the shared virtual memory between CPU and GPU, and pageable system memory can now be referenced directly by the GPU, instead of being copied or pinned before accessing. | ||
| System Integration | GPU compute context switch | 2015 Carrizo APU |
Compute tasks on GPU can be context switched, allowing a multi-tasking environment and also faster interpretation between applications, compute and graphics. |
| GPU graphics pre-emption | Long-running graphics tasks can be pre-empted so processes have low latency access to the GPU. | ||
| Quality of service[18] | In addition to context switch and pre-emption, hardware resources can be either equalized or prioritized among multiple users and applications. |
Feature overview
[edit]The following table shows features of AMD's processors with 3D graphics, including APUs (see also: List of AMD processors with 3D graphics).
| Platform | High, standard and low power | Low and ultra-low power | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Codename | Server | Basic | Toronto | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Micro | Kyoto | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Desktop | Performance | Raphael | Phoenix | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Mainstream | Llano | Trinity | Richland | Kaveri | Kaveri Refresh (Godavari) | Carrizo | Bristol Ridge | Raven Ridge | Picasso | Renoir | Cezanne | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Entry | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Basic | Kabini | Dalí | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Mobile | Performance | Renoir | Cezanne | Rembrandt | Dragon Range | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Mainstream | Llano | Trinity | Richland | Kaveri | Carrizo | Bristol Ridge | Raven Ridge | Picasso | Renoir Lucienne |
Cezanne Barceló |
Phoenix | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Entry | Dalí | Mendocino | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Basic | Desna, Ontario, Zacate | Kabini, Temash | Beema, Mullins | Carrizo-L | Stoney Ridge | Pollock | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Embedded | Trinity | Bald Eagle | Merlin Falcon, Brown Falcon |
Great Horned Owl | Grey Hawk | Ontario, Zacate | Kabini | Steppe Eagle, Crowned Eagle, LX-Family |
Prairie Falcon | Banded Kestrel | River Hawk | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Released | Aug 2011 | Oct 2012 | Jun 2013 | Jan 2014 | 2015 | Jun 2015 | Jun 2016 | Oct 2017 | Jan 2019 | Mar 2020 | Jan 2021 | Jan 2022 | Sep 2022 | Jan 2023 | Jan 2011 | May 2013 | Apr 2014 | May 2015 | Feb 2016 | Apr 2019 | Jul 2020 | Jun 2022 | Nov 2022 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| CPU microarchitecture | K10 | Piledriver | Steamroller | Excavator | "Excavator+"[20] | Zen | Zen+ | Zen 2 | Zen 3 | Zen 3+ | Zen 4 | Bobcat | Jaguar | Puma | Puma+[21] | "Excavator+" | Zen | Zen+ | "Zen 2+" | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ISA | x86-64 v1 | x86-64 v2 | x86-64 v3 | x86-64 v4 | x86-64 v1 | x86-64 v2 | x86-64 v3 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Socket | Desktop | Performance | — | AM5 | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Mainstream | — | AM4 | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Entry | FM1 | FM2 | FM2+ | FM2+[a], AM4 | AM4 | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Basic | — | — | AM1 | — | FP5 | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Other | FS1 | FS1+, FP2 | FP3 | FP4 | FP5 | FP6 | FP7 | FL1 | FP7 FP7r2 FP8 |
FT1 | FT3 | FT3b | FP4 | FP5 | FT5 | FP5 | FT6 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| PCI Express version | 2.0 | 3.0 | 4.0 | 5.0 | 4.0 | 2.0 | 3.0 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| CXL | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Fab. (nm) | GF 32SHP (HKMG SOI) |
GF 28SHP (HKMG bulk) |
GF 14LPP (FinFET bulk) |
GF 12LP (FinFET bulk) |
TSMC N7 (FinFET bulk) |
TSMC N6 (FinFET bulk) |
CCD: TSMC N5 (FinFET bulk) cIOD: TSMC N6 (FinFET bulk) |
TSMC 4nm (FinFET bulk) |
TSMC N40 (bulk) |
TSMC N28 (HKMG bulk) |
GF 28SHP (HKMG bulk) |
GF 14LPP (FinFET bulk) |
GF 12LP (FinFET bulk) |
TSMC N6 (FinFET bulk) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Die area (mm2) | 228 | 246 | 245 | 245 | 250 | 210[22] | 156 | 180 | 210 | CCD: (2x) 70 cIOD: 122 |
178 | 75 (+ 28 FCH) | 107 | ? | 125 | 149 | ~100 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Min TDP (W) | 35 | 17 | 12 | 10 | 15 | 65 | 35 | 4.5 | 4 | 3.95 | 10 | 6 | 12 | 8 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max APU TDP (W) | 100 | 95 | 65 | 45 | 170 | 54 | 18 | 25 | 6 | 54 | 15 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max stock APU base clock (GHz) | 3 | 3.8 | 4.1 | 4.1 | 3.7 | 3.8 | 3.6 | 3.7 | 3.8 | 4.0 | 3.3 | 4.7 | 4.3 | 1.75 | 2.2 | 2 | 2.2 | 3.2 | 2.6 | 1.2 | 3.35 | 2.8 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max APUs per node[b] | 1 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max core dies per CPU | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max CCX per core die | 1 | 2 | 1 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max cores per CCX | 4 | 8 | 2 | 4 | 2 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max CPU[c] cores per APU | 4 | 8 | 16 | 8 | 2 | 4 | 2 | 4 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max threads per CPU core | 1 | 2 | 1 | 2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Integer pipeline structure | 3+3 | 2+2 | 4+2 | 4+2+1 | 1+3+3+1+2 | 1+1+1+1 | 2+2 | 4+2 | 4+2+1 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| i386, i486, i586, CMOV, NOPL, i686, PAE, NX bit, CMPXCHG16B, AMD-V, RVI, ABM, and 64-bit LAHF/SAHF | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| IOMMU[d] | — | v2 | v1 | v2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| BMI1, AES-NI, CLMUL, and F16C | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| MOVBE | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| AVIC, BMI2, RDRAND, and MWAITX/MONITORX | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| SME[e], TSME[e], ADX, SHA, RDSEED, SMAP, SMEP, XSAVEC, XSAVES, XRSTORS, CLFLUSHOPT, CLZERO, and PTE Coalescing | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| GMET, WBNOINVD, CLWB, QOS, PQE-BW, RDPID, RDPRU, and MCOMMIT | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| MPK, VAES | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| SGX | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| FPUs per core | 1 | 0.5 | 1 | 1 | 0.5 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Pipes per FPU | 2 | 2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| FPU pipe width | 128-bit | 256-bit | 80-bit | 128-bit | 256-bit | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| CPU instruction set SIMD level | SSE4a[f] | AVX | AVX2 | AVX-512 | SSSE3 | AVX | AVX2 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 3DNow! | 3DNow!+ | — | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| PREFETCH/PREFETCHW | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| GFNI | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| AMX | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| FMA4, LWP, TBM, and XOP | — | — | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| FMA3 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| AMD XDNA | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L1 data cache per core (KiB) | 64 | 16 | 32 | 32 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L1 data cache associativity (ways) | 2 | 4 | 8 | 8 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L1 instruction caches per core | 1 | 0.5 | 1 | 1 | 0.5 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max APU total L1 instruction cache (KiB) | 256 | 128 | 192 | 256 | 512 | 256 | 64 | 128 | 96 | 128 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L1 instruction cache associativity (ways) | 2 | 3 | 4 | 8 | 2 | 3 | 4 | 8 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L2 caches per core | 1 | 0.5 | 1 | 1 | 0.5 | 1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max APU total L2 cache (MiB) | 4 | 2 | 4 | 16 | 1 | 2 | 1 | 2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L2 cache associativity (ways) | 16 | 8 | 16 | 8 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max on-die L3 cache per CCX (MiB) | — | 4 | 16 | 32 | — | 4 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max 3D V-Cache per CCD (MiB) | — | 64 | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max total in-CCD L3 cache per APU (MiB) | 4 | 8 | 16 | 64 | 4 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max. total 3D V-Cache per APU (MiB) | — | 64 | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max. board L3 cache per APU (MiB) | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max total L3 cache per APU (MiB) | 4 | 8 | 16 | 128 | 4 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| APU L3 cache associativity (ways) | 16 | 16 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| L3 cache scheme | Victim | Victim | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max. L4 cache | — | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max stock DRAM support | DDR3-1866 | DDR3-2133 | DDR3-2133, DDR4-2400 | DDR4-2400 | DDR4-2933 | DDR4-3200, LPDDR4-4266 | DDR5-4800, LPDDR5-6400 | DDR5-5200 | DDR5-5600, LPDDR5x-7500 | DDR3L-1333 | DDR3L-1600 | DDR3L-1866 | DDR3-1866, DDR4-2400 | DDR4-2400 | DDR4-1600 | DDR4-3200 | LPDDR5-5500 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max DRAM channels per APU | 2 | 1 | 2 | 1 | 2 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max stock DRAM bandwidth (GB/s) per APU | 29.866 | 34.132 | 38.400 | 46.932 | 68.256 | 102.400 | 83.200 | 120.000 | 10.666 | 12.800 | 14.933 | 19.200 | 38.400 | 12.800 | 51.200 | 88.000 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| GPU microarchitecture | TeraScale 2 (VLIW5) | TeraScale 3 (VLIW4) | GCN 2nd gen | GCN 3rd gen | GCN 5th gen[23] | RDNA 2 | RDNA 3 | TeraScale 2 (VLIW5) | GCN 2nd gen | GCN 3rd gen[23] | GCN 5th gen | RDNA 2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| GPU instruction set | TeraScale instruction set | GCN instruction set | RDNA instruction set | TeraScale instruction set | GCN instruction set | RDNA instruction set | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max stock GPU base clock (MHz) | 600 | 800 | 844 | 866 | 1108 | 1250 | 1400 | 2100 | 2400 | 400 | 538 | 600 | ? | 847 | 900 | 1200 | 600 | 1300 | 1900 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Max stock GPU base GFLOPS[g] | 480 | 614.4 | 648.1 | 886.7 | 1134.5 | 1760 | 1971.2 | 2150.4 | 3686.4 | 102.4 | 86 | ? | ? | ? | 345.6 | 460.8 | 230.4 | 1331.2 | 486.4 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| 3D engine[h] | Up to 400:20:8 | Up to 384:24:6 | Up to 512:32:8 | Up to 704:44:16[24] | Up to 512:32:8 | 768:48:8 | 128:8:4 | 80:8:4 | 128:8:4 | Up to 192:12:8 | Up to 192:12:4 | 192:12:4 | Up to 512:?:? | 128:?:? | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| IOMMUv1 | IOMMUv2 | IOMMUv1 | ? | IOMMUv2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Video decoder | UVD 3.0 | UVD 4.2 | UVD 6.0 | VCN 1.0[25] | VCN 2.1[26] | VCN 2.2[26] | VCN 3.1 | ? | UVD 3.0 | UVD 4.0 | UVD 4.2 | UVD 6.2 | VCN 1.0 | VCN 3.1 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Video encoder | — | VCE 1.0 | VCE 2.0 | VCE 3.1 | — | VCE 2.0 | VCE 3.4 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| AMD Fluid Motion | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| GPU power saving | PowerPlay | PowerTune | PowerPlay | PowerTune[27] | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| TrueAudio | — | ? | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| FreeSync | 1 2 |
1 2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HDCP[i] | ? | 1.4 | 2.2 | 2.3 | ? | 1.4 | 2.2 | 2.3 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| PlayReady[i] | — | 3.0 not yet | — | 3.0 not yet | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Supported displays[j] | 2–3 | 2–4 | 3 | 3 (desktop) 4 (mobile, embedded) |
4 | 2 | 3 | 4 | 4 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
/drm/radeon[k][30][31] |
— | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
/drm/amdgpu[k][32] |
— | — | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
- ^ For FM2+ Excavator models: A8-7680, A6-7480 & Athlon X4 845.
- ^ A PC would be one node.
- ^ An APU combines a CPU and a GPU. Both have cores.
- ^ Requires firmware support.
- ^ a b Requires firmware support.
- ^ No SSE4. No SSSE3.
- ^ Single-precision performance is calculated from the base (or boost) core clock speed based on a FMA operation.
- ^ Unified shaders : texture mapping units : render output units
- ^ a b To play protected video content, it also requires card, operating system, driver, and application support. A compatible HDCP display is also needed for this. HDCP is mandatory for the output of certain audio formats, placing additional constraints on the multimedia setup.
- ^ To feed more than two displays, the additional panels must have native DisplayPort support.[29] Alternatively active DisplayPort-to-DVI/HDMI/VGA adapters can be employed.
- ^ a b DRM (Direct Rendering Manager) is a component of the Linux kernel. Support in this table refers to the most current version.
APU or Radeon Graphics branded platforms
[edit]AMD APUs have CPU modules, cache, and a discrete-class graphics processor, all on the same die using the same bus. This architecture allows for the use of graphics accelerators, such as OpenCL, with the integrated graphics processor.[34] The goal is to create a "fully integrated" APU, which, according to AMD, will eventually feature 'heterogeneous cores' capable of processing both CPU and GPU work automatically, depending on the workload requirement.[35]
TeraScale-based GPU
[edit]K10 architecture (2011): Llano
[edit]
- "Stars" AMD K10-cores[36]
- Integrated Evergreen/VLIW5-based GPU (branded Radeon HD 6000 series)
- Northbridge[18][19]
- PCIe[18][19]
- DDR3[18][19] memory controller to arbitrate between coherent and non-coherent memory requests.[37] The physical memory is partitioned between the GPU (up to 512 MB) and the CPU (the remainder).[37]
- Unified Video Decoder[18][19]
- AMD Eyefinity multi-monitor-support
The first generation APU, released in June 2011, was used in both desktops and laptops. It was based on the K10 architecture and built on a 32 nm process featuring two to four CPU cores on a thermal design power (TDP) of 65-100 W, and integrated graphics based on the Radeon HD 6000 series with support for DirectX 11, OpenGL 4.2 and OpenCL 1.2. In performance comparisons against the similarly priced Intel Core i3-2105, the Llano APU was criticised for its poor CPU performance[38] and praised for its better GPU performance.[39][40] AMD was later criticised for abandoning Socket FM1 after one generation.[41]
Bobcat architecture (2011): Ontario, Zacate, Desna, Hondo
[edit]- Bobcat-based CPU
- Evergreen/VLIW5-based GPU (branded Radeon HD 6000 series and Radeon HD 7000 series)
- Northbridge[18][19]
- PCIe[18][19] support.
- DDR3 SDRAM[18][19] memory controller to arbitrate between coherent and non-coherent memory requests.[37] The physical memory is partitioned between the GPU (up to 512 MB) and the CPU (the remainder).[37]
- Unified Video Decoder (UVD)[18][19]
The AMD Brazos platform was introduced on 4 January 2011, targeting the subnotebook, netbook and low power small form factor markets.[5] It features the 9-watt AMD C-Series APU (codename: Ontario) for netbooks and low power devices as well as the 18-watt AMD E-Series APU (codename: Zacate) for mainstream and value notebooks, all-in-ones and small form factor desktops. Both APUs feature one or two Bobcat x86 cores and a Radeon Evergreen Series GPU with full DirectX11, DirectCompute and OpenCL support including UVD3 video acceleration for HD video including 1080p.[5]
AMD expanded the Brazos platform on 5 June 2011 with the announcement of the 5.9-watt AMD Z-Series APU (codename: Desna) designed for the Tablet market.[42] The Desna APU is based on the 9-watt Ontario APU. Energy savings were achieved by lowering the CPU, GPU and northbridge voltages, reducing the idle clocks of the CPU and GPU as well as introducing a hardware thermal control mode.[42] A bidirectional turbo core mode was also introduced.
AMD announced the Brazos-T platform on 9 October 2012. It comprised the 4.5-watt AMD Z-Series APU (codenamed Hondo) and the A55T Fusion Controller Hub (FCH), designed for the tablet computer market.[43][44] The Hondo APU is a redesign of the Desna APU. AMD lowered energy use by optimizing the APU and FCH for tablet computers.[45][46]
The Deccan platform including Krishna and Wichita APUs were cancelled in 2011. AMD had originally planned to release them in the second half 2012.[47]
Piledriver architecture (2012): Trinity and Richland
[edit]- Piledriver-based CPU
- Northern Islands/VLIW4-based GPU (branded Radeon HD 7000 and 8000 series)
- Unified Northbridge – includes AMD Turbo Core 3.0, which enables automatic bidirectional power management between CPU modules and GPU. Power to the CPU and GPU is controlled automatically by changing the clock rate depending on the load. For example, for a non-overclocked A10-5800K APU the CPU frequency can change from 1.4 GHz to 4.2 GHz, and the GPU frequency can change from 304 MHz to 800 MHz. In addition, CC6 mode is capable of powering down individual CPU cores, while PC6 mode is able to lower the power on the entire rail.[48]
- AMD HD Media Accelerator[49] – includes AMD Perfect Picture HD, AMD Quick Stream technology, and AMD Steady Video technology.
- Display controllers: AMD Eyefinity-support for multi-monitor set-ups, HDMI, DisplayPort 1.2, DVI
- Trinity
The first iteration of the second generation platform, released in October 2012, brought improvements to CPU and GPU performance to both desktops and laptops. The platform features 2 to 4 Piledriver CPU cores built on a 32 nm process with a TDP between 65 W and 100 W, and a GPU based on the Radeon HD7000 series with support for DirectX 11, OpenGL 4.2, and OpenCL 1.2. The Trinity APU was praised for the improvements to CPU performance compared to the Llano APU.[50]
- Richland
- "Enhanced Piledriver" CPU cores[51]
- Temperature Smart Turbo Core technology. An advancement of the existing Turbo Core technology, which allows internal software to adjust the CPU and GPU clock speed to maximise performance within the constraints of the Thermal design power of the APU.[52]
- New low-power consumption CPUs with only 45 W TDP[53]
The release of this second iteration of this generation was 12 March 2013 for mobile parts and 5 June 2013 for desktop parts.
Graphics Core Next-based GPU
[edit]Jaguar architecture (2013): Kabini and Temash
[edit]- Jaguar-based CPU
- Graphics Core Next 2nd Gen-based GPU
- Socket AM1 and Socket FT3 support
- Target segment desktop and mobile
In January 2013 the Jaguar-based Kabini and Temash APUs were unveiled as the successors of the Bobcat-based Ontario, Zacate and Hondo APUs.[54][55][56] The Kabini APU is aimed at the low-power, subnotebook, netbook, ultra-thin and small form factor markets, while the Temash APU is aimed at the tablet, ultra-low power and small form factor markets.[56] The two to four Jaguar cores of the Kabini and Temash APUs feature numerous architectural improvements regarding power requirement and performance, such as support for newer x86-instructions, a higher IPC count, a CC6 power state mode and clock gating.[57][58][59] Kabini and Temash are AMD's first, and also the first ever quad-core x86 based SoCs.[60] The integrated Fusion Controller Hubs (FCH) for Kabini and Temash are codenamed "Yangtze" and "Salton", respectively.[61] The Yangtze FCH features support for two USB 3.0 ports, two SATA 6 Gbit/s ports, as well as the xHCI 1.0 and SD/SDIO 3.0 protocols for SD-card support.[61] Both chips feature DirectX 11.1-compliant GCN-based graphics as well as numerous HSA improvements.[54][55] They were fabricated at a 28 nm process in an FT3 ball grid array package by Taiwan Semiconductor Manufacturing Company (TSMC), and were released on 23 May 2013.[57][62][63]
The PlayStation 4 and Xbox One were revealed to both be powered by 8-core semi-custom Jaguar-derived APUs.
Steamroller architecture (2014): Kaveri
[edit]
- Steamroller-based CPU with 2–4 cores[64]
- Graphics Core Next 2nd Gen-based GPU with 192–512 shader processors[65]
- 15–95 W thermal design power[64][65]
- Fastest mobile processor of this series: AMD FX-7600P (35 W)
- Fastest desktop processor of this series: AMD A10-7850K (95 W)
- Socket FM2+ and Socket FP3[64]
- Target segment desktop and mobile
- Heterogeneous System Architecture-enabled zero-copying through pointer passing
The third generation of the platform, codenamed Kaveri, was partly released on 14 January 2014.[66] Kaveri contains up to four Steamroller CPU cores clocked to 3.9 GHz with a turbo mode of 4.1 GHz, up to a 512-core Graphics Core Next GPU, two decode units per module instead of one (which allows each core to decode four instructions per cycle instead of two), AMD TrueAudio,[67] Mantle API,[68] an on-chip ARM Cortex-A5 MPCore,[69] and will release with a new socket, FM2+.[70] Ian Cutress and Rahul Garg of Anandtech asserted that Kaveri represented the unified system-on-a-chip realization of AMD's acquisition of ATI. The performance of the 45 W A8-7600 Kaveri APU was found to be similar to that of the 100 W Richland part, leading to the claim that AMD made significant improvements in on-die graphics performance per watt;[64] however, CPU performance was found to lag behind similarly specified Intel processors, a lag that was unlikely to be resolved in the Bulldozer family APUs.[64] The A8-7600 component was delayed from a Q1 launch to an H1 launch because the Steamroller architecture components allegedly did not scale well at higher clock speeds.[71]
AMD announced the release of the Kaveri APU for the mobile market on 4 June 2014 at Computex 2014,[65] shortly after the accidental announcement on the AMD website on 26 May 2014.[72] The announcement included components targeted at the standard voltage, low-voltage, and ultra-low voltage segments of the market. In early-access performance testing of a Kaveri prototype laptop, AnandTech found that the 35 W FX-7600P was competitive with the similarly priced 17 W Intel i7-4500U in synthetic CPU-focused benchmarks, and was significantly better than previous integrated GPU systems on GPU-focused benchmarks.[73] Tom's Hardware reported the performance of the Kaveri FX-7600P against the 35 W Intel i7-4702MQ, finding that the i7-4702MQ was significantly better than the FX-7600P in synthetic CPU-focused benchmarks, whereas the FX-7600P was significantly better than the i7-4702MQ's Intel HD 4600 iGPU in the four games that could be tested in the time available to the team.[65]
Puma architecture (2014): Beema and Mullins
[edit]- Puma-based CPU
- Graphics Core Next 2nd Gen-based GPU with 128 shader processors
- Socket FT3
- Target segment ultra-mobile
Puma+ architecture (2015): Carrizo-L
[edit]- Puma+-based CPU with 2–4 cores[74]
- Graphics Core Next 2nd Gen-based GPU with 128 shader processors[74]
- 12–25 W configurable TDP[74]
- Socket FP4 support; pin-compatible with Carrizo[74]
- Target segment mobile and ultra-mobile
Excavator architecture (2015): Carrizo
[edit]- Excavator-based CPU with 4 cores[75]
- Graphics Core Next 3rd Gen-based GPU
- Memory controller supports DDR3 SDRAM at 2133 MHz and DDR4 SDRAM at 1866 MHz[75]
- 15–35 W configurable TDP (with the 15 W cTDP unit having reduced performance)[75]
- Integrated southbridge[75]
- Socket FP4
- Target segment mobile
- Announced by AMD on YouTube (19 November 2014)[76]
Steamroller architecture (Q2–Q3 2015): Godavari
[edit]- Update of the desktop Kaveri series with higher clock frequencies or smaller power envelope
- Steamroller-based CPU with 4 cores[77]
- Graphics Core Next 2nd Gen-based GPU
- Memory controller supports DDR3 SDRAM at 2133 MHz
- 65/95 W TDP with support for configurable TDP
- Socket FM2+
- Target segment desktop
- Listed since Q2 2015
Excavator architecture (2016): Bristol Ridge and Stoney Ridge
[edit]
- Excavator-based CPU with 2–4 cores
- 1 MB L2 cache per module
- Graphics Core Next 3rd Gen-based GPU[78][79][80][81]
- Memory controller supports DDR4 SDRAM
- 15/35/45/65 W TDP with support for configurable TDP
- 28 nm
- Socket AM4 for desktop
- Target segment desktop, mobile and ultra-mobile
Zen architecture (2017): Raven Ridge
[edit]- Zen-based CPU cores[82] with simultaneous multithreading (SMT)
- 512 KB L2 cache per core
- 4 MB L3 cache
- Precision Boost 2[83]
- Graphics Core Next 5th Gen "Vega"-based GPU[84]
- Memory controller supports DDR4 SDRAM
- Video Core Next as successor of UVD+VCE
- 14 nm at GlobalFoundries
- Socket FP5 for mobile[85] and AM4 for desktop
- Target segment desktop and mobile
- Listed since Q4 2017
Zen+ architecture (2018): Picasso
[edit]- Zen+-based CPU microarchitecture[86]
- Refresh of Raven Ridge on 12 nm with improved latency and efficiency/clock frequency. Features similar to Raven Ridge
- Launched April 2018
Zen 2 architecture (2019): Renoir
[edit]- Zen 2-based CPU microarchitecture[85]
- Graphics Core Next 5th Gen "Vega"-based GPU[87]
- VCN 2.1[87]
- Memory controller supports DDR4 and LPDDR4X SDRAM up to 4266 MHz[87]
- 15 and 45 W TDP for mobile and 35 and 65 W TDP for desktop[85]
- 7 nm at TSMC[88]
- Socket FP6 for mobile and socket AM4 for desktop[85]
- Release July 2019[87][88]
Zen 3 architecture (2020): Cezanne
[edit]- Zen 3-based CPU microarchitecture[89]
- Graphics Core Next 5th Gen "Vega"-based GPU[90]
- Memory controller supports DDR4 and LPDDR4X SDRAM up to 4266 MHz[90][89]
- Up to 45 W TDP for mobile;[91] 35W to 65W TDP for desktop.[90]
- 7 nm at TSMC[89]
- Socket AM4 for desktop[90]
- Socket FP6 for mobile
- Released for mobile early 2021[89] with desktop counterparts released in November 2020.[90]
RDNA-based GPU
[edit]Zen 3+ architecture (2022): Rembrandt
[edit]- Zen 3+ based CPU microarchitecture[92]
- RDNA 2-based GPU[92]
- Memory controller supports DDR5-4800 and LPDDR5-6400[92]
- Up to 45 W TDP for mobile
- Node: TSMC N6[92]
- Socket FP7 for mobile
- Released for mobile early 2022[92]
Zen 4 architecture (2023): Phoenix Point
[edit]- Zen 4 based CPU microarchitecture[93]
- RDNA 3-based GPU with up to 12 CU[93]
- Memory controller supports DDR5-5600 and LPDDR5x-7500
- XDNA-powered NPU with up to 16 TOPS[94]
- Up to 54 W TDP for mobile
- Up to 65 W TDP for desktop[94]
- Node: TSMC N4[93]
- Sockets FP7, FP7r2 & FP8 for mobile
- Socket AM5 for desktop
- Released for mobiles early 2023[93]
- Released for desktop early 2024[94]
Zen 5 architecture (2024): Strix Point
[edit]- Zen 5 based CPU microarchitecture with a mix of Zen 5 and 5c cores[95]
- RDNA 3.5-based GPU[95] with up to 16 CU
- Memory controller supports DDR5-5600 and LPDDR5x-8000
- XDNA2-powered NPU with up to 55 TOPS[95]
- Up to 54 W TDP for mobile
- Node: TSMC N4[95]
- Socket FP8 for mobile
- Released for mobile early 2024
See also
[edit]References
[edit]- ^ "AMD A4-3300M". X86 CPUS' GUIDE. Retrieved 4 June 2025.
- ^ "AMD A6-3400M". X86 CPUS' GUIDE. Retrieved 4 June 2025.
- ^ a b "The rise and fall of AMD: A company on the ropes". 23 April 2013. Retrieved 20 December 2013.
- ^ William Van Winkle (13 August 2012). "AMD Fusion: How It Started, Where It's Going, And What It Means". Retrieved 20 December 2013.
- ^ a b c AMD (4 January 2011). "AMD Fusion APU Era Begins". Retrieved 24 August 2013.
- ^ Stokes, Jon (8 February 2010). "AMD reveals Fusion CPU+GPU, to challenge Intel in laptops". Ars Technica. Archived from the original on 10 February 2010. Retrieved 9 February 2010.
- ^ Kowaliski, Cyril (9 November 2010). "A closer look at AMD's Brazos platform". The Tech Report. Retrieved 15 June 2017.
- ^ "AMD ditches Fusion branding". Bit-tech. Retrieved 24 July 2013.
- ^ "AMD targeted by Arctic over Fusion brand". Bit-tech. Retrieved 24 July 2013.
- ^ Cyril Kowaliski (9 November 2010). "AMD begins shipping Brazos, announces Bulldozer-based APUs". The Tech Report. Retrieved 7 January 2014.
- ^ Rick Bergman (9 November 2010). "AMD 2010 Financial Analyst Day". Advanced Micro Devices, Inc. Archived from the original (PDF) on 18 January 2016. Retrieved 7 January 2014.
- ^ "AMD reveals its 2012-2013 roadmap, promises 28 nm chips across the board by 2013". Engadget. 2 February 2012. Archived from the original on 2 March 2019. Retrieved 22 August 2012.
- ^ Kingsley-Hughes, Adrian. "Building an AMD 'Trinity' desktop PC - ZDNet". ZDNet.
- ^ "AMD launches "Richland" A-Series APUs: slight speed bump, better power management 404". Archived from the original on 19 July 2013.
- ^ Taylor, John (21 February 2013). "AMD and The Sony PS4. Allow Me To Elaborate". Archived from the original on 26 May 2013. Retrieved 30 August 2013.
- ^ "XBox One Revealed". Wired. 21 May 2013. Retrieved 23 May 2013.
- ^ Darren Murph (8 January 2013). "AMD announces Temash, Kabini, Richland, and Kaveri APUs at CES 2013 (video)". Retrieved 20 December 2013.
- ^ a b c d e f g h i j "The programmer's guide to the APU galaxy" (PDF).
- ^ a b c d e f g h i Shimpi, Anand Lal. "AMD Outlines HSA Roadmap: Unified Memory for CPU/GPU in 2013, HSA GPUs in 2014". www.anandtech.com. Archived from the original on 4 February 2012.
- ^ "AMD Announces the 7th Generation APU: Excavator mk2 in Bristol Ridge and Stoney Ridge for Notebooks". 31 May 2016. Retrieved 3 January 2020.
- ^ "AMD Mobile "Carrizo" Family of APUs Designed to Deliver Significant Leap in Performance, Energy Efficiency in 2015" (Press release). 20 November 2014. Retrieved 16 February 2015.
- ^ "The Mobile CPU Comparison Guide Rev. 13.0 Page 5 : AMD Mobile CPU Full List". TechARP.com. Retrieved 13 December 2017.
- ^ a b "AMD VEGA10 and VEGA11 GPUs spotted in OpenCL driver". VideoCardz.com. Retrieved 6 June 2017.
- ^ Cutress, Ian (1 February 2018). "Zen Cores and Vega: Ryzen APUs for AM4 – AMD Tech Day at CES: 2018 Roadmap Revealed, with Ryzen APUs, Zen+ on 12nm, Vega on 7nm". Anandtech. Retrieved 7 February 2018.
- ^ Larabel, Michael (17 November 2017). "Radeon VCN Encode Support Lands in Mesa 17.4 Git". Phoronix. Retrieved 20 November 2017.
- ^ a b "AMD Ryzen 5000G 'Cezanne' APU Gets First High-Res Die Shots, 10.7 Billion Transistors In A 180mm2 Package". wccftech. 12 August 2021. Retrieved 25 August 2021.
- ^ Tony Chen; Jason Greaves, "AMD's Graphics Core Next (GCN) Architecture" (PDF), AMD, retrieved 13 August 2016
- ^ "A technical look at AMD's Kaveri architecture". Semi Accurate. Retrieved 6 July 2014.
- ^ "How do I connect three or More Monitors to an AMD Radeon™ HD 5000, HD 6000, and HD 7000 Series Graphics Card?". AMD. Retrieved 8 December 2014.
- ^ Airlie, David (26 November 2009). "DisplayPort supported by KMS driver mainlined into Linux kernel 2.6.33". Retrieved 16 January 2016.
- ^ "Radeon feature matrix". freedesktop.org. Retrieved 10 January 2016.
- ^ Deucher, Alexander (16 September 2015). "XDC2015: AMDGPU" (PDF). Retrieved 16 January 2016.
- ^ a b Michel Dänzer (17 November 2016). "[ANNOUNCE] xf86-video-amdgpu 1.2.0". lists.x.org.
- ^ APU101_Final_Jan 2011.pdf
- ^ Shimpi, Anand Lal. "AMD Outlines HSA Roadmap: Unified Memory for CPU/GPU in 2013, HSA GPUs in 2014". AnandTech. Archived from the original on 4 February 2012.
- ^ "AMD Llano core". Cpu-world.com. 17 March 2014. Retrieved 24 March 2014.
- ^ a b c d Kanter, David (27 June 2011). "AMD Fusion Architecture and Llano".
- ^ Anand Lal Shimpi (30 June 2011). "The AMD A8-3850 Review: Llano on the Desktop". Anandtech. Archived from the original on 16 June 2013. Retrieved 12 January 2014.
- ^ "Conclusion - AMD A8-3850 Review: Llano Rocks Entry-Level Desktops". 30 June 2011.
- ^ Shimpi, Anand Lal. "The AMD A8-3850 Review: Llano on the Desktop". AnandTech. Archived from the original on 3 July 2011.
- ^ Shimpi, Anand Lal. "AMD A10-5800K & A8-5600K Review: Trinity on the Desktop, Part 1". AnandTech. Archived from the original on 29 September 2012.
- ^ a b Nita, Sorin (1 June 2011). "AMD Releases More Details Regarding the Desna Tablet APU". Retrieved 20 March 2013.
- ^ AMD (9 October 2013). "New AMD Z-Series APU for Tablets Enables Immersive Experience for Upcoming Microsoft Windows 8 Platforms". Retrieved 20 March 2013.
- ^ Shvets, Anthony (10 October 2012). "AMD announces Z-60 APU for tablets".
- ^ Hruska, Joel (9 October 2012). "AMD's Hondo Z-Series APU To Challenge Intel's Atom In Windows 8 Tablet Market". Archived from the original on 23 September 2020. Retrieved 20 March 2013.
- ^ Shilov, Anton (9 October 2012). "AMD Introduces Its First Accelerated Processing Unit for Media Tablets". Archived from the original on 9 February 2013. Retrieved 20 March 2013.
- ^ Demerjian, Charlie (15 November 2011). "Exclusive: AMD kills Wichita and Krishna". SemiAccurate. Retrieved 22 August 2012.
- ^ "CPU + GPU = APU: East Meets West". 14 June 2011. Retrieved 1 September 2013.
- ^ "AMD's 2nd Generation APU, Codenamed "Trinity," Will Enable Superior Multimedia Experience for Our "Connected" Generation". Archived from the original on 7 April 2013.
- ^ Shimpi, Anand Lal. "The AMD A8-3850 Review: Llano on the Desktop". AnandTech. Archived from the original on 16 June 2013.
- ^ "AMD Officially Announces Third Generation Richland A-Series Mobile APUs - 50% Faster GPU Than Intel Core i7 Mobile". 12 March 2013.
- ^ "New Details Revealed on AMD's Upcoming Richland Chips". 12 March 2013.
- ^ "AMD A10-Series A10-6700T — AD670TYHA44HL / AD670TYHHLBOX". Cpu-world.com. Retrieved 10 November 2013.
- ^ a b SKYMTL (9 January 2013). "Richland, Kaveri, Kabini & Temash; AMD's 2013 APU Lineup Examined". Hardwarecanucks. Retrieved 23 March 2013.
- ^ a b Halfacree, Gareth (8 January 2013). "AMD unveils new APUs, SoCs and Radeon HD 8000 Series". Bit-Tech. Retrieved 23 March 2013.
- ^ a b Lal Shimpi, Anand (2 February 2012). "AMD's 2012 - 2013 Client CPU/GPU/APU Roadmap Revealed". AnandTech. Archived from the original on 1 January 2013. Retrieved 8 August 2012.
- ^ a b Shilov, Anton (2 January 2013). "AMD to Officially Roll-Out "Kabini" and "Temash" Low-Power APUs This Quarter". X-bit labs. Archived from the original on 17 January 2013. Retrieved 21 March 2013.
- ^ Shilov, Anton (24 July 2013). "AMD's New Low-Power Micro-Architecture to Support AVX, BMI Other New Instructions". X-bit labs. Archived from the original on 9 February 2013. Retrieved 21 March 2013.
- ^ Paul, Donald (21 October 2012). "Leaked details of the future some Kabini APU AMD". Technewspedia. Archived from the original on 31 August 2014. Retrieved 21 March 2013.
- ^ Paine, Steve Chippy (9 January 2013). "AMD Shares SoC Line-Up for 2013. Kabini is for Ultrathins". Ultrabooknews. Archived from the original on 2 July 2014. Retrieved 21 March 2013.
- ^ a b Abazovic, Fuad (24 January 2013). "Kabini chipset is Yangtze". Fudzilla. Retrieved 21 March 2013.
- ^ Hruska, Paul (14 January 2013). "AMD quietly confirms 28 nm Kabini, Temash chips are being built at TSMC". Extremetech. Retrieved 21 March 2013.
- ^ "AMDs sparsame Mobilprozessoren Kabini und Temash legen los". 23 May 2013. Retrieved 31 August 2013.
- ^ a b c d e "AMD Kaveri Review: A8-7600 and A10-7850K Tested". Anandtech. Archived from the original on 8 February 2014. Retrieved 20 May 2014.
- ^ a b c d "AMD FX-7600P Kaveri Review: FX Rides Again...In A Mobile APU?". Tom's Hardware. Archived from the original on 8 June 2014. Retrieved 8 June 2014.
- ^ "AnandTech Portal | AMD Kaveri APU Launch Details: Desktop, January 14th". Anandtech.com. Archived from the original on 12 November 2013. Retrieved 13 January 2014.
- ^ ChrisFiebelkorn on 3 Dec 2013 (2 December 2013). "AMD A10 Kaveri APU Details Leaked". HotHardware. Retrieved 13 January 2014.
{{cite web}}: CS1 maint: numeric names: authors list (link) - ^ Dave_HH on 14 Nov 2013 (13 November 2013). "How AMD's Mantle Will Redefine Gaming, AMD Hardware Not Required". HotHardware. Retrieved 13 January 2014.
{{cite web}}: CS1 maint: numeric names: authors list (link) - ^ "AMD and ARM Fusion redefine beyond x86". Archived from the original on 5 November 2013. Retrieved 20 July 2012.
- ^ "AMD's Next-Gen "Kaveri" APUs Will Require New Mainboards - X-bit labs". Archived from the original on 7 June 2013. Retrieved 31 May 2013.
- ^ "Perils of a paper launch: AMD's A8-7600 pushed back to late 2014". Extreme Tech. 9 April 2014. Retrieved 20 May 2014.
- ^ "AMD Publishes Mobile Kaveri Specifications". Anandtech. Archived from the original on 28 May 2014. Retrieved 29 May 2014.
- ^ "AMD Launches Mobile Kaveri APUs". AnandTech. Archived from the original on 4 June 2014. Retrieved 8 June 2014.
- ^ a b c d "AMD's Carrizo-L APUs Unveiled: 12-25W Quad Core Puma+". AnandTech. Archived from the original on 13 May 2015. Retrieved 1 September 2015.
- ^ a b c d "AMD Details Carrizo APUs Energy Efficient Design at Hot Chips 2015 – 28nm Bulk High Density Design With 3.1 Billion Transistors, 250mm2 Die". WCCFTech. 26 August 2015. Retrieved 1 September 2015.
- ^ "Preview AMD's next gen APU (Carrizo)". YouTube. Archived from the original on 20 November 2014. Retrieved 21 November 2014.
- ^ "PC gaming hardware | PC Gamer". Pcgamer.
- ^ Shilov, Anton. "AMD preps 'Bristol Ridge' APUs: 'Carrizo' for desktops". KitGuru. Retrieved 5 April 2016.
- ^ Cutress, Ian (5 April 2016). "AMD Pre-Announces Bristol Ridge in Notebooks: The 7th Generation APU". AnandTech.com. AnandTech.com. Archived from the original on 5 April 2016. Retrieved 5 April 2016.
- ^ Kampman, Jeff (5 April 2016). "AMD lifts the curtain a little bit on its Bristol Ridge APUs". TechReport.com. Retrieved 5 April 2016.
- ^ Cutress, Ian (1 June 2016). "AMD Announces 7th Generation APU". Anandtech.com. Archived from the original on 17 November 2016. Retrieved 1 June 2016.
- ^ Larabel, Michael (13 December 2016). "AMD Reveals More Zen CPU Details, Officially Known As Ryzen, No Linux Details Yet". Phoronix. Retrieved 13 December 2016.
- ^ Hallock, Robert (27 November 2017). "Understanding Precision Boost 2 in AMD SenseMI technology". AMD. Retrieved 19 December 2019.
- ^ Ferreira, Bruno (16 May 2017). "Ryzen Mobile APUs are coming to a laptop near you". Tech Report. Retrieved 16 May 2017.
- ^ a b c d Mujtaba, Hassan (18 December 2019). "AMD Ryzen 4000 APU Lineup For Desktop & Mobility Platforms Leaked". Wccftech. Retrieved 19 December 2019.
- ^ Cutress, Ian (6 January 2019). "AMD at CES 2019: Ryzen Mobile 3000-Series Launched, 2nd Gen Mobile at 15W and 35W, and Chromebooks". anandtech.com. AnandTech. Archived from the original on 6 January 2019. Retrieved 12 November 2019.
- ^ a b c d btarunr (3 September 2019). "AMD "Renoir" APU to Support LPDDR4X Memory and New Display Engine". TechPowerUp. Retrieved 19 December 2019.
- ^ a b Pirzada, Usman (11 November 2019). "AMD Renoir APU Launching CES 2020, Will Destroy The NVIDIA MX 250 And Iris Pro Graphics". Wccftech. Retrieved 19 December 2019.
- ^ a b c d Anandtech. "AMD Launches Ryzen 5000 Mobile: Zen 3 and Cezanne for Notebooks". Anandtech. Archived from the original on 12 January 2021. Retrieved 18 January 2021.
- ^ a b c d e Anandtech. "AMD Ryzen 5000G APUs: OEM Only For Now, Full Release Later This Year". Anandtech. Archived from the original on 13 April 2021. Retrieved 15 April 2021.
- ^ AMD. "AMD Ryzen Mobile Processors with Radeon Graphics". AMD. Retrieved 18 January 2021.
- ^ a b c d e Anandtech. "AMD Announces Ryzen 6000 Mobile CPUs for Laptops: Zen3+ on 6nm with RDNA2 Graphics". Archived from the original on 4 January 2022.
- ^ a b c d Discuss, btarunr (5 January 2023). "AMD Announces Ryzen 7040 "Phoenix Point" Mobile Processor: 4nm, Zen 4, RDNA3, XDNA". TechPowerUp. Retrieved 2 October 2025.
- ^ a b c Paul Alcorn (8 January 2024). "AMD launches Ryzen 8000G 'Phoenix' APUs, brings AI to the desktop PC — reveals Zen 4c clocks for the first time". Tom's Hardware. Retrieved 2 October 2025.
- ^ a b c d Hassam Nasir Contributions from Paul Alcorn (6 January 2025). "AMD launches Ryzen AI 300 and 200 series chips for laptops". Tom's Hardware. Retrieved 2 October 2025.
External links
[edit]- HSA Heterogeneous System Architecture Overview on YouTube by Vinod Tipparaju at SC13 in November 2013
- HSA and the software ecosystem
- HSA Archived 5 March 2016 at the Wayback Machine
AMD APU
View on GrokipediaIntroduction
Definition and Purpose
An Accelerated Processing Unit (APU) is a system on a chip (SoC) developed by AMD that integrates x86 CPU cores and a discrete-level GPU on a single die, enabling enhanced efficiency and performance for general computing tasks.[1] This design combines central processing capabilities with graphics acceleration in a unified package, distinct from traditional setups that pair separate CPU and discrete GPU components.[1] The primary purpose of an AMD APU is to facilitate seamless collaboration between the CPU and GPU through shared system memory, which allows unified memory access and minimizes data transfer latency between the processors.[6] This integration reduces overall power consumption and manufacturing costs compared to discrete CPU-GPU configurations, while simplifying system design by eliminating the need for high-bandwidth interconnects like PCIe for inter-processor communication.[1] Additionally, it promotes energy-efficient operation suitable for mobile and embedded applications without sacrificing computational versatility.[7] At its core, an AMD APU consists of AMD64-compatible CPU cores for general-purpose computing, an integrated Radeon-branded GPU for graphics and parallel processing, and shared resources such as a memory controller and I/O interfaces that enable cohesive operation.[1] The CPU handles sequential tasks and system management, while the GPU accelerates vectorized workloads, with both accessing the same memory pool to optimize data sharing.[6] AMD introduced the APU under the "Fusion" branding in 2011, later transitioning to the A-Series designation for consumer and embedded products, and subsequently incorporating them into the Ryzen lineup with integrated Radeon Graphics starting in 2018.[1][8][9]Significance and Applications
AMD Accelerated Processing Units (APUs) hold significant market importance in budget and mid-range laptops, all-in-one PCs, and embedded systems, where their integrated design facilitates compact, cost-effective solutions without the need for discrete graphics cards.[10] This enables the creation of thin-and-light laptops and space-constrained devices like all-in-one desktops, which prioritize portability and affordability over high-end performance.[11] In the embedded sector, APUs such as the AMD Embedded G-Series have been pivotal since their introduction, providing flexible platforms for industrial applications while adhering to stringent size and power requirements.[12] Key applications of APUs span entry-level gaming, content creation, AI acceleration, gaming consoles, and industrial or automotive embedded systems. In entry-level gaming, APUs like the Ryzen AI Max Series deliver capable integrated graphics for casual play without additional hardware.[13] For content creation tasks such as video editing, APUs support efficient processing through combined CPU and GPU resources. Recent models incorporate neural processing units (NPUs) for AI acceleration, enabling on-device inference in laptops and edge devices.[14] Iconic examples include the custom Jaguar-based APUs powering the PlayStation 4 and [Xbox One](/page/Xbox One) consoles, which integrated CPU and GPU capabilities to drive immersive gaming experiences.[15] In industrial and automotive domains, APUs like the Ryzen Embedded V2000A Series handle real-time sensor data processing for advanced driver-assistance systems (ADAS) and in-vehicle infotainment.[16] The primary advantages of APUs stem from their space-saving integration of CPU and GPU on a single die, which reduces overall system complexity and board real estate compared to discrete configurations.[17] This integration enhances power efficiency, with mobile APUs typically operating in a TDP range of 15W to 65W, contributing to longer battery life in laptops and lower thermal demands.[18] Such scalability allows original equipment manufacturers (OEMs) to tailor APUs for diverse form factors, from ultrabooks to rugged embedded units, while maintaining consistent performance profiles.[19] APUs address key challenges in traditional CPU-GPU setups by mitigating bandwidth bottlenecks through direct on-chip interconnects, enabling faster data sharing between processing elements. This is particularly beneficial for hybrid workloads like video encoding, where the CPU and GPU collaborate seamlessly—facilitated briefly by technologies such as Heterogeneous System Architecture (HSA)—to optimize throughput without excessive memory transfers.[20][21]History
Origins and Fusion Initiative
The origins of the AMD Accelerated Processing Unit (APU) trace back to AMD's acquisition of ATI Technologies in October 2006, which provided the graphics expertise necessary to pursue integrated CPU-GPU designs.[22] The $5.4 billion deal, completed on October 24, 2006, merged AMD's microprocessor capabilities with ATI's GPU technology, enabling the company to develop unified processor architectures.[22] Immediately following the acquisition, AMD announced its "Fusion" initiative on October 25, 2006, aiming to create processors that combined central processing units (CPUs) and graphics processing units (GPUs) on a single die or package.[22] The primary goals of the Fusion initiative were to deliver discrete graphics-level performance within an integrated solution, thereby improving efficiency and reducing power consumption for mainstream computing.[23] Targeted at laptops and desktops, Fusion sought to provide enhanced visual computing experiences without the need for separate discrete GPUs, positioning AMD to compete directly with Intel's evolving integrated graphics offerings in its Core i-series processors.[23][24] This integration was envisioned to enable modular designs that leveraged both CPU and GPU compute capabilities, with initial platforms planned for commercial clients, mobile devices, and gaming by 2007, and full Fusion processors by late 2008 or early 2009.[22] Development of early Fusion prototypes accelerated in the late 2000s, pairing AMD's K10 (Stars) CPU architecture with ATI's TeraScale GPU technology to create system-on-chip (SoC) solutions.[25] This shift marked AMD's transition from selling standalone CPUs and GPUs to producing cohesive SoCs, responding to industry demands for more efficient, all-in-one processors amid Intel's advancements in integrated graphics.[24] Internal codenames such as "Llano" were assigned to the first desktop-oriented APU prototype, which combined K10-based CPU cores with a TeraScale GPU on a single die, laying the groundwork for subsequent releases.[25]Launch and Early Adoption
The initial commercial launches of AMD's Accelerated Processing Units (APUs) occurred in 2011, beginning with the low-power Ontario and Zacate APUs of the Brazos platform in the first quarter, followed by the desktop Llano platform in June and its mobile variants later that year. The Llano APUs, built on a 32nm process and integrating K10-based CPU cores with Radeon HD 6000-series graphics, were introduced under the A-Series branding to target mainstream consumer desktops. These releases represented AMD's first widespread deployment of fused CPU-GPU architectures, branded as A-Series processors with embedded Radeon HD graphics to deliver enhanced multimedia and light gaming capabilities without discrete GPUs.[1] Early adoption was favorable, particularly for graphics-intensive tasks where Llano APUs outperformed Intel's Sandy Bridge integrated GPUs, achieving up to four times higher frame rates in benchmarks like 3DMark and select games. Positioned for budget systems, the APUs gained traction in entry-level PCs and netbooks, with models like the A8-3850 priced at $135 to undercut competitors and drive volume sales among cost-sensitive consumers and OEMs.[26] Positive reviewer feedback on integrated graphics performance spurred uptake in affordable all-in-one systems and slim laptops, helping AMD capture share in the sub-$500 PC segment. Despite these strengths, early APUs encountered hurdles such as elevated power consumption, exemplified by Llano's 100W TDP which constrained thermal designs in compact chassis. The 32nm fabrication process also imposed efficiency limitations relative to emerging competitors' nodes, impacting battery life in mobile variants. In the nascent tablet space, AMD's x86-based APUs faced rivalry from Nvidia's ARM-oriented Tegra 2 SoCs, which prioritized ultra-low power for longer runtime in portable devices.[27] A key milestone came with the 2012 launch of the Trinity APUs, which transitioned to the FM2 socket for desktops and improved integration, broadening ecosystem support and paving the way for further refinements.[28] By 2013, APU adoption in OEM laptops had grown substantially, evidenced by partnerships like AMD's collaboration with HP to deliver over one million APU-equipped units to the Chinese market.[29]Evolution to Zen and Beyond
In the mid-2010s, AMD advanced its APU lineup by integrating Graphics Core Next (GCN) GPUs with the Kaveri APUs launched in 2014, marking the first use of this architecture in integrated graphics for improved compute and gaming performance.[30][31] Concurrently, the company introduced the low-power Puma platform, featuring Mullins and Beema APUs targeted at tablets and ultrathin devices, which delivered up to twice the battery life and 25% better overall performance compared to prior generations like Temash.[32] By 2015, the Carrizo APUs with Excavator cores further enhanced efficiency, achieving up to 25% longer battery life and 20% faster graphics performance through optimizations like improved voltage scaling and a more integrated SoC design.[33] The transition to Zen architectures began in 2017 with Raven Ridge, AMD's first APU combining Zen CPU cores and Vega graphics, which extended support for the AM4 socket through at least 2020 to prolong platform longevity.[34] This shift enabled better multi-threaded performance and integrated graphics capable of 1080p gaming without discrete GPUs. In 2020, the Renoir APUs adopted the Zen 2 microarchitecture on a 7nm process, doubling transistor density over prior 14nm designs and boosting single-threaded performance by up to 25% while maintaining Vega graphics for balanced mobile and desktop use.[35] Recent developments have focused on graphics and AI enhancements, with the 2022 Rembrandt APUs introducing RDNA 2 integrated GPUs for ray tracing support and up to 50% more compute units than Vega predecessors, enabling solid 1080p gaming on 6nm processes.[36] The 2023 Phoenix APUs paired Zen 4 cores with RDNA 3 graphics, delivering improved efficiency and AI acceleration on a 4nm node.[37] In 2024, Strix Point APUs added a dedicated neural processing unit (NPU) delivering 50 TOPS for AI workloads, powering the Ryzen AI 300 series and advancing Copilot+ PC capabilities with Zen 5 cores and RDNA 3.5 graphics.[38] APUs have expanded into high-impact applications, including custom Zen 2 variants in the PlayStation 5 console for seamless CPU-GPU integration in gaming.[39] Their role in AI PCs has grown significantly, with Ryzen AI processors enabling on-device inference and over 150 AI PC models available by 2025, driving productivity and creative tools.[40] In laptops, APUs now form the core of AMD's client portfolio, supporting the surge in thin-and-light designs with integrated AI and graphics.Architectural Features
CPU Microarchitectures in APUs
The CPU microarchitectures in AMD APUs began with the K10-based "Stars" cores used in early desktop-oriented models like Llano. These cores featured up to four standard cores with dedicated floating-point units on a 32 nm process, emphasizing multi-threaded workloads to complement integrated graphics.[41] Concurrently, the Bobcat microarchitecture powered low-power APUs, delivering a dual-issue, out-of-order execution model on a 40 nm process tailored for netbook and ultrathin applications with single or dual cores optimized for efficiency over peak performance.[42] Subsequent iterations refined the Bulldozer lineage through the Piledriver microarchitecture, which introduced enhancements like improved branch prediction and floating-point scheduling, yielding approximately 15% higher instructions per clock (IPC) compared to Bulldozer while maintaining the modular structure. Steamroller followed, expanding execution units for wider integer and floating-point throughput, achieving up to 20% IPC gains over Piledriver through better resource sharing within modules and support for advanced instruction sets.[43] The Puma family, a derivative optimized for mobile devices, refined Jaguar cores with enhanced power gating and decode efficiency for ultra-low-power scenarios, prioritizing battery life in thin-and-light systems.[44] Excavator concluded this era, delivering a 14% IPC uplift over Steamroller via larger caches and optimized pipelines on a 28 nm process, marking a shift toward higher efficiency in mainstream mobile APUs.[45] The Zen series represented a complete redesign, debuting in APUs with Zen 1 cores on a 14 nm process, supporting 4 to 8 cores per chip with simultaneous multithreading (SMT) for improved single-threaded performance through a wider front-end and deeper execution resources.[46] Zen+ refined this on a 12 nm process, reducing latencies in cache and memory subsystems for up to 3% IPC gains while enabling higher boost clocks in APU configurations. Zen 2 advanced to a 7 nm process with a partial chiplet layout, incorporating a monolithic die for APUs but leveraging modular CCDs for scalability, alongside doubled L3 cache per core complex for better multi-core efficiency.[47] Zen 3 emphasized higher clock speeds through unified L3 cache designs and improved branch prediction, achieving 19% IPC uplift over Zen 2 in APU variants. Zen 4 introduced AVX-512 support with double-pumped 256-bit execution units on a 4 nm monolithic die in APU configurations, enhancing vector workloads in integrated systems.[48] Zen 5 further boosts IPC by ~16% via enhanced branch prediction with dual decode pipes and wider pipelines, targeting AI-accelerated APUs on advanced nodes.[49] APU-specific adaptations across these microarchitectures emphasize balanced core counts from 2 to 16 to optimize power efficiency, allowing seamless scaling for thermal constraints in laptops and desktops while pairing CPU performance with integrated GPUs. In some embedded variants, elements like programmable logic akin to FPGA fabrics are integrated alongside Ryzen cores via the Embedded+ architecture, enabling customizable acceleration for edge AI and real-time processing.[50]GPU Microarchitectures in APUs
The GPU microarchitectures in AMD Accelerated Processing Units (APUs) have evolved significantly since the introduction of integrated graphics, transitioning from the TeraScale architecture to the more advanced RDNA family, with each generation enhancing rendering efficiency, compute capabilities, and API support tailored for power-constrained environments. In the TeraScale era, AMD employed the TeraScale 2 microarchitecture, based on a Very Long Instruction Word (VLIW5) design, in early APUs such as Llano and Trinity. This architecture featured up to 80 shader processors organized into 5 shader engines, enabling DirectX 11 compatibility for improved tessellation and geometry processing in graphics workloads. Later Piledriver-based APUs incorporated refinements from TeraScale 3 and 4, which introduced enhanced tessellation units and minor efficiency gains in pixel and vertex processing pipelines, though still limited by the VLIW paradigm's scalability issues. The shift to the Graphics Core Next (GCN) family marked a pivotal advancement, starting with GCN 1.0 in APUs like Kabini and Kaveri, which unified scalar and vector processing through a single-instruction multiple-data (SIMD) approach across compute units (CUs). This generation supported OpenCL 1.2 for general-purpose GPU computing and delivered up to 6 CUs in mobile variants, focusing on balanced rasterization and basic compute tasks. GCN 2.0, featured in Carrizo, added asynchronous compute engines to allow concurrent graphics and compute operations, reducing pipeline stalls and improving throughput in multi-threaded applications. Subsequent iterations included GCN 3.0 in Bristol Ridge, which emphasized higher clock speeds—up to 1 GHz in some configurations—for better performance in legacy DirectX 12 workloads without major architectural overhauls. The GCN 5.0 variants, based on Vega, appeared in Zen-based APUs such as Raven Ridge, Picasso, Renoir, and Cezanne, scaling to 8–11 CUs with high-bandwidth cache controllers and early hardware support for ray tracing primitives like bounding volume hierarchy traversal acceleration. The transition to the RDNA architecture brought further optimizations for gaming and AI workloads in APUs. RDNA 2, integrated in Rembrandt APUs, utilized 12 CUs with a dual-issue wavefront architecture, enhancing rasterization efficiency by up to 50% over Vega through improved instruction scheduling and primitive shaders. RDNA 3, integrated in Phoenix APUs, incorporated dedicated AV1 encode hardware and dual compute engines per workgroup processor for superior video processing and ray tracing performance. The latest RDNA 3.5, found in Strix Point APUs, expands to up to 16 CUs with refined ray tracing accelerators and integrated AI upscaling engines, enabling efficient hardware-accelerated neural rendering in thin-and-light devices. APU-specific optimizations across these microarchitectures emphasize scalability and resource sharing, with configurable CU counts ranging from 2 to 16 to match thermal and power envelopes. Recent models achieve shared memory bandwidth exceeding 100 GB/s via unified L3 caches and Infinity Fabric interconnects, while supporting modern APIs like Vulkan 1.3 for cross-platform compute and DirectX 12 Ultimate for advanced mesh shading and variable rate shading.Integration Technologies
The Heterogeneous System Architecture (HSA), co-developed by AMD and introduced in 2013 through the HSA Foundation, enables cohesive CPU-GPU integration in APUs by allowing direct pointer sharing between the CPU and GPU, eliminating the need for data copying. This architecture provides unified virtual addressing across processors, permitting seamless memory access, and supports hOpenCL for heterogeneous computing tasks that leverage both CPU and GPU resources without kernel recompilation. HSA's design facilitates low-latency task dispatching to the GPU independently of the CPU, enhancing overall system efficiency for parallel workloads.[51] Full HSA implementation in AMD APUs began with the Carrizo series in 2015, building on partial support in earlier models like Kaveri, and has since become a cornerstone for compute-intensive applications in subsequent generations. By unifying the programming model, HSA reduces overhead in heterogeneous computing, allowing developers to treat the APU as a single coherent system for tasks such as image processing and machine learning acceleration.[52][53] The memory subsystem in AMD APUs emphasizes shared access to optimize CPU-GPU interactions, evolving from DDR3 support in early architectures to high-bandwidth LPDDR5X-7500 in modern designs like Strix Point. In Zen-based APUs, a shared L3 cache—typically 4-16 MB depending on configuration—serves both CPU cores and the integrated GPU, reducing latency for common data sets and improving cache coherency through hardware-managed protocols. Starting with Zen 2+ chiplet implementations, Infinity Fabric interconnect provides scalable, high-speed communication between dies, achieving bandwidths up to 40-60 GB/s for inter-component data transfer in multi-chiplet APUs.[54][55] Power and thermal management in APUs incorporate dynamic voltage and frequency scaling (DVFS) applied jointly to CPU and GPU domains, enabling real-time adjustments based on workload demands to balance performance and efficiency. Thermal design power (TDP) configurations span ultra-low 4W for embedded and handheld applications to 120W for high-end desktop and mobile variants, with configurable profiles allowing system designers to tailor power envelopes. An integrated northbridge on the APU die handles I/O coherency and memory controller functions, minimizing external dependencies and latency in data routing.[56][57][58] Further integrations enhance APU versatility, including an Input-Output Memory Management Unit (IOMMU) that supports secure virtualization by enabling direct memory access (DMA) remapping for GPU tasks in virtualized environments. PCIe support has advanced to version 4.0 in Zen 3+ APUs and 5.0 in select Zen 4+ models, providing up to 28 lanes in Phoenix APUs for expanded peripheral connectivity. Beginning with Zen 4+, APUs incorporate a dedicated Neural Processing Unit (NPU) based on the XDNA architecture, as seen in Strix Point, delivering up to 50 TOPS for AI inference while maintaining power efficiency through dedicated accelerators.[59]TeraScale-based APUs
Llano (2011)
Llano, released in June 2011, marked AMD's entry into desktop accelerated processing units (APUs) with the codename Llano and the introduction of the FM1 socket. The platform featured the A4, A6, and A8 series processors, offering dual- and quad-core configurations based on the Stars microarchitecture derived from the K10 family, providing 2 to 4 cores and corresponding threads without simultaneous multithreading.[60][61] The integrated graphics drew from the Radeon HD 6000 series using the TeraScale 2 architecture, with variants including the HD 6410D (160 shading units) for A4 models, HD 6530D (320 shading units) for A6 models, and HD 6550D (400 shading units) for A8 models, delivering up to 480 GFLOPS of peak performance.[62][63][64] These APUs supported Dual Graphics mode, allowing combination with compatible discrete AMD GPUs for enhanced rendering via crossfire technology. Fabricated on a 32 nm silicon-on-insulator (SOI) process by GlobalFoundries, Llano APUs contained 1.178 billion transistors across a 228 mm² die.[60] They supported dual-channel DDR3 memory up to 1866 MT/s and operated within a 65–100 W thermal design power (TDP) envelope, balancing performance for mainstream desktops. As the first desktop APU to integrate DirectX 11-capable graphics, Llano pioneered unified CPU-GPU designs for consumer systems, significantly outperforming Intel's HD Graphics 2000 in gaming workloads—delivering up to 3–4 times higher frame rates in titles like Dirt 2 at 1680x1050 resolution. Its efficient video decode capabilities and low-power integrated graphics made it popular for budget home theater PCs (HTPCs) and all-in-one systems, enabling 1080p playback and light gaming without discrete GPUs.[65][66]Bobcat-based APUs (2011)
The Bobcat-based APUs, released in early 2011 as part of AMD's Brazos platform, targeted ultra-portable devices such as netbooks, mainstream laptops, and tablets with low-power requirements. These APUs integrated one or two Bobcat CPU cores, operating at clock speeds ranging from 1.0 to 1.6 GHz, under the E-Series (Zacate codename for netbooks) and C-Series (Ontario codename for mainstream low-power laptops).[67] The Desna variant, aimed at tablets under the Z-Series branding, featured similar dual-core configurations at around 1.0 GHz but included enhanced display output support for touch-enabled devices.[68] Fabricated on a 40 nm process node by TSMC, these APUs supported single-channel DDR3-1066 memory and had thermal design power ratings of 9–18 W, enabling fanless designs in slim form factors.[69][2] The integrated graphics were based on the TeraScale 2 architecture, branded as Radeon HD 6xxxG series, with five compute units delivering approximately 30–50 GFLOPS of peak performance depending on the model and clock speeds up to 500 MHz. For instance, the Zacate E-350 featured the Radeon HD 6310 with 80 shader processors at 488 MHz, while Ontario models like the C-50 used the Radeon HD 6250 at lower clocks for reduced power draw.[70] These GPUs included the third-generation Unified Video Decoder (UVD3), supporting hardware-accelerated 1080p video playback for H.264 and other formats, which enhanced multimedia capabilities in battery-constrained systems. The overall design emphasized integration on a single die, with die sizes around 75 mm², prioritizing cost efficiency over high transistor density.[2] These APUs marked AMD's first sub-10 W offerings for the netbook and tablet markets, outperforming Intel's Atom processors in multimedia workloads and light productivity tasks by up to 80% in JavaScript execution while maintaining comparable power efficiency. The Brazos platform's reception was positive for reviving interest in x86-based ultra-portables, as the combined CPU-GPU setup enabled smooth 720p video playback and basic 3D graphics without discrete components. However, the in-order execution of Bobcat cores limited multithreaded performance, positioning these APUs as entry-level solutions rather than direct competitors to higher-end mobile chips. Desna's addition of multiple display outputs facilitated early Windows 7 tablet adoption, though overall adoption was tempered by the era's shift toward ARM-based alternatives.[71]Piledriver-based APUs (2012–2013)
The Piledriver-based APUs represented AMD's second-generation desktop and mobile accelerated processing units, succeeding the Llano architecture and bridging the TeraScale graphics era. Launched under the codenames Trinity and Richland, these APUs integrated Piledriver CPU cores with Radeon HD 7000 and 8000 series graphics, respectively, on the FM2 socket for desktops. They targeted mainstream consumer systems, emphasizing balanced performance for multimedia, light gaming, and everyday computing in budget-oriented PCs and laptops.[72] Trinity debuted in May 2012 for mobile platforms, with desktop variants following in October 2012, while Richland arrived as a refresh in March 2013 for mobiles and June 2013 for desktops. The lineup spanned the A4 to A10 series, featuring 1 to 2 Piledriver modules (2 to 4 cores and threads), with base clocks starting at 2.0 GHz for entry-level models and peaking at 4.2 GHz for Trinity's flagship A10-5800K, which could turbo up to 4.2 GHz. Richland models, such as the A10-6800K, pushed frequencies higher to a 4.1 GHz base and 4.4 GHz turbo, offering modest gains through architectural tweaks and higher binning. These APUs supported up to 4 MB of L2 cache and were designed for multi-threaded workloads, though single-thread performance remained competitive primarily in value segments.[73][28][74] Both platforms utilized a 32 nm SOI process node, with Trinity dies measuring 246 mm² and containing 1.303 billion transistors; Richland retained the same die size and transistor count as a silicon refresh, focusing on optimizations rather than a node shrink. Thermal design power (TDP) ranged from 65 W to 100 W for desktops and as low as 35 W for mobiles, enabling efficient operation in compact systems. Memory support included dual-channel DDR3-1866, an upgrade from prior generations, enhancing bandwidth for integrated graphics tasks. Piledriver delivered approximately 10-15% higher instructions per clock (IPC) compared to the original Bulldozer cores, primarily through improved branch prediction, floating-point execution, and reduced latency in the shared FPU, though multi-threaded scaling was limited by the module design.[75][72] Graphics integration featured TeraScale 3 (Radeon HD 7000 series) in Trinity and TeraScale 3 (Radeon HD 8000 series) in Richland, with up to 384 shader processors (6 compute units) configurable across models. Peak performance reached approximately 614 GFLOPS in top Trinity configurations at 800 MHz GPU clocks, scaling to around 650 GFLOPS in Richland's higher-binned 844 MHz variants, supporting DirectX 11 and hardware-accelerated video decode via UVD3.[76][77] These iGPUs excelled in budget gaming, delivering playable frame rates in titles like League of Legends at 1080p low settings when paired with dual-channel memory. PowerTune technology enabled dynamic GPU boosting within TDP limits, improving efficiency during bursty workloads like video playback or casual gaming.[73][72] Key innovations included native USB 3.0 support via the accompanying A85X and A75 chipsets, providing up to four USB 3.0 ports alongside 10 USB 2.0 ports for enhanced peripheral connectivity in mainstream builds. The FM2 platform also introduced overclocking via AMD OverDrive for unlocked "K" models, appealing to enthusiasts on a budget. Reception was positive in the value market, where these APUs powered affordable all-in-one PCs and HTPCs, capturing significant share in emerging markets and contributing to APUs comprising nearly 75% of AMD's processor unit shipments by late 2012. Their integrated design reduced system costs, making them popular for entry-level gaming rigs capable of 30+ FPS in older titles without discrete GPUs.[78][79]Graphics Core Next-based APUs
Jaguar-based APUs (2013)
The Jaguar-based APUs, released in the second quarter of 2013, marked AMD's shift to a new low-power x86 core architecture paired with Graphics Core Next (GCN) graphics for mainstream mobile and ultra-low-power applications. Codenamed Kabini for entry-level laptops and Temash for tablets and embedded devices, these APUs targeted thin-and-light systems with improved power efficiency over prior generations. The A4 and A6 series processors featured four Jaguar cores with out-of-order execution, operating at clock speeds ranging from 1.5 GHz for the A4-5000 to 2.0 GHz for the A6-5200 in Kabini variants, and lower 1.0 GHz base (up to 1.4 GHz turbo) in the Temash A6-1450.[80][81][82] These APUs were fabricated on a 28 nm process node, supporting DDR3-1600 memory in a single-channel configuration and delivering thermal design power (TDP) ratings from 15-25 W for Kabini to as low as 4-8 W for Temash, enabling fanless designs in ultrathin tablets. The packages used BGA mounting typical for mobile SoCs, with some variants compatible with FT3/FT4 interfaces in modular systems. Kabini and Temash found adoption in entry-level laptops like the Acer Aspire series and tablets, while custom variants powered the PlayStation 4 and Xbox One consoles, contributing to over 100 million indirect unit shipments through these gaming platforms by leveraging the same Jaguar and GCN foundations.[83][84][85][15] The integrated GPUs, branded under the Radeon HD 8000 series, utilized GCN 1.0 architecture with 2 or 4 compute units (CUs) delivering up to approximately 300 GFLOPS of single-precision compute performance, depending on configuration and clock speeds up to 600 MHz. For instance, the A4-5000 paired with Radeon HD 8330 (2 CUs at 497 MHz), while higher-end A6 models used variants like HD 8400 (up to 4 CUs). These GPUs supported OpenCL 1.2 for general-purpose computing, enabling basic heterogeneous workloads alongside DirectX 11.1 graphics.[86][87] Jaguar-based APUs delivered 20-25% better power efficiency compared to the preceding Piledriver architecture, primarily through smaller core design and optimized branch prediction, allowing sustained performance at lower TDPs. The Temash platform, in particular, achieved a configurable 3.95-8 W TDP, facilitating extended battery life in tablets without compromising quad-core x86 compatibility. Reception highlighted their role in reviving AMD's mobile presence, especially via console integrations that validated the architecture's scalability for real-time rendering and multitasking.[88][89][90][91]Steamroller and Puma-based APUs (2014–2015)
The Steamroller and Puma microarchitectures marked AMD's transition to more efficient APUs in 2014, targeting both mainstream desktop/mobile and low-power portable devices. The Kaveri family, based on Steamroller, debuted in the first quarter of 2014 for FM2+ socket desktop and mobile platforms, with models like the A10-7850K offering up to four cores clocked at 3.7 GHz.[92][93] A refresh under the Godavari codename followed in the second quarter of 2015, introducing minor clock boosts such as the A10-7870K at up to 3.9 GHz while retaining the core architecture.[94] Concurrently, the Puma-based Beema and Mullins platforms launched in early 2014, optimized for low-power applications with quad-core configurations in the A6 to A10 series reaching up to 2.4 GHz, emphasizing battery life in tablets and convertibles.[95][96] These APUs were fabricated on a 28 nm process, with Kaveri featuring 2.41 billion transistors across a 245 mm² die, while Puma variants like Beema and Mullins integrated similar densities for compact SoCs.[97] Thermal design power ranged from 65–95 W for desktop Kaveri models to 12–25 W for mobile Beema, with Mullins extending to ultra-low configurations as low as 2.5 W for fanless designs.[98][99] Memory support included DDR3-2133, and Mullins notably incorporated SATA 6 Gb/s for enhanced storage connectivity in embedded and portable systems.[100] The Steamroller cores in Kaveri improved instruction-level parallelism over prior architectures, while Puma refined Jaguar's efficiency for bursty workloads, as detailed in broader microarchitecture overviews.[92] GPU integration advanced with Graphics Core Next (GCN) 1.2/2.0 architectures, branded as Radeon R7 and R5 in Kaveri and Puma respectively, featuring up to 8 compute units (CUs) for Kaveri and up to 6 for Puma variants for scalable performance.[101] Kaveri's top configurations delivered up to approximately 0.86 TFLOPS of FP32 compute at base clocks, enabling smooth 1080p gaming and multimedia tasks without discrete graphics.[102] HSA preview functionality first appeared in Kaveri, allowing preliminary unified memory access between CPU and GPU to streamline heterogeneous computing, though full interoperability required software ecosystem maturity.[43] Innovations included Kaveri's support for 4K video decoding and display output via updated UVD 4.2 and VCE 2.0 engines, facilitating Ultra HD playback in compatible systems.[103] Puma platforms, particularly Mullins at configurable TDPs around 10.6 W, were tailored for 2-in-1 convertibles and tablets, prioritizing all-day battery life and seamless mode switching.[95] These APUs received mixed reception; Kaveri was praised for its integrated graphics leap and HSA potential but criticized for modest CPU gains over predecessors and premium pricing relative to Intel Haswell alternatives.[104] Puma variants fared better in low-power niches, offering competitive efficiency for media consumption devices despite limited high-end appeal.Excavator-based APUs (2015–2017)
The Excavator-based APUs represented AMD's sixth and seventh generation A-Series processors, integrating the Excavator CPU microarchitecture with Graphics Core Next (GCN) 3.0 graphics to target mobile and entry-level desktop markets. These platforms, including Carrizo, Carrizo-L, Bristol Ridge, and Stoney Ridge, emphasized energy efficiency improvements over prior Steamroller designs while supporting Heterogeneous System Architecture (HSA) for unified CPU-GPU computing.[105] Carrizo APUs launched in the second quarter of 2015 for mobile devices, featuring quad-core Excavator configurations under the A6 to A12 branding with clock speeds up to 3.7 GHz. The lower-end Carrizo-L variant followed in the fourth quarter of 2015, also mobile-focused but with dual- or quad-core Puma+ cores derived from Excavator for mainstream configurations at reduced power levels.[33] Bristol Ridge and Stoney Ridge arrived in the second quarter of 2016, with Bristol Ridge serving desktop systems on the FM2+ socket and Stoney Ridge targeting mobile ultrabooks via the FP4 package; both offered A6 to A12 models with up to four Excavator cores. Bristol Ridge is a family of accelerated processing units (APUs) released in 2016-2017, part of the 7th generation A-Series APUs for desktop and mobile, based on the Excavator CPU microarchitecture and Graphics Core Next (GCN) graphics. Key Bristol Ridge models include: Desktop APUs:- A12-9800 (4 cores, 3.8 GHz base, 4.2 GHz boost, 35W TDP)
- A12-9800E (4 cores, 3.1 GHz base, 3.8 GHz boost, 35W TDP)
- A10-9700 (4 cores, 3.5 GHz base, 3.8 GHz boost, 65W TDP)
- A10-9700E (4 cores, 3.0 GHz base, 3.5 GHz boost, 35W TDP)
- A6-9500 (2 cores, 3.5 GHz base, 3.8 GHz boost, 65W TDP)
- A6-9500E (2 cores, 3.0 GHz base, 3.4 GHz boost, 35W TDP)
- A12-9700P (4 cores, 2.5 GHz base, 3.6 GHz boost, 15W TDP)
- A10-9600P (4 cores, 2.4 GHz base, 3.3 GHz boost, 15W TDP)
- A9-9410 (2 cores, 2.9 GHz base, 3.5 GHz boost, 10-25W TDP)
- A6-9220 (2 cores, 2.5 GHz base, 3.0 GHz boost, 10-25W TDP)
- A4-9120 (2 cores, 2.5 GHz base, 2.6 GHz boost, 10-25W TDP)
Zen-based APUs with GCN Graphics (2017–2021)
The Zen-based APUs with GCN graphics marked AMD's transition to integrating its high-performance Zen CPU cores with the Graphics Core Next (GCN) 5th generation architecture, specifically Vega graphics, in a monolithic die design. The series began with the Raven Ridge APUs, which debuted in mobile form factor at the end of 2017 and expanded to desktop with the Ryzen 2000G series in February 2018, supporting the AM4 socket for desktops and FP5 for mobiles. This was followed by the Picasso refresh in January 2019 for mobile and July 2019 for desktop Ryzen 3000G models, leveraging Zen+ enhancements on a 12 nm process. The lineup progressed to Renoir in Q1 2020 for mobile Ryzen 4000 series and July 2020 for desktop, adopting Zen 2 on 7 nm, before culminating with Cezanne in April 2021 for OEM mobile/desktop PRO variants and August 2021 for consumer Ryzen 5000G desktop models on Zen 3. These APUs featured 4 to 8 cores with simultaneous multithreading, base clocks starting at 3.6 GHz and boosts up to 4.6 GHz, targeting 35-65 W TDP envelopes for efficient all-in-one computing. At the heart of these APUs was the integrated Vega graphics based on GCN 5.0, offering configurations like Vega 8 (512 shaders, 8 compute units) or Vega 11 (704 shaders, 11 compute units), with peak performance reaching up to approximately 1.8 TFLOPS at official boost clocks around 1250 MHz in higher-end models like the Ryzen 5 2400G. The Vega iGPU supported FP16 half-precision compute, enabling early machine learning workloads and accelerating tasks like video encoding, while providing DirectX 12 compatibility and hardware-accelerated video decode for H.264, H.265, and VP9. Key system specifications included DDR4-3200 memory support with dual-channel configuration for optimal graphics bandwidth, PCIe 3.0 lanes (up to 20) in Zen 1 models evolving to PCIe 4.0 (up to 24) in Zen 2 and 3 variants, and transistor counts ranging from 4.95 billion on the 14 nm/210 mm² Raven Ridge die to 10.7 billion on the denser 7 nm/180 mm² Cezanne die, with Renoir and Picasso at approximately 9.8 billion and 4.94 billion transistors respectively on 156 mm² and 210 mm² dies. From Raven Ridge onward, including Cezanne, these APUs used a monolithic design without chiplets while benefiting from Zen 3's unified L3 cache improvements.[113] These APUs represented a significant leap in integrated graphics performance, delivering discrete-level capabilities comparable to entry-level dedicated GPUs like the GeForce GTX 1050 in 1080p gaming at low to medium settings, making them popular for budget builds without a separate graphics card.[114] Innovations included the debut of Zen architecture in APUs with Raven Ridge, offering up to 2x the IPC over prior Bulldozer-era designs, and the 7 nm shift in Renoir, which achieved nearly double the power efficiency of Picasso through process shrinks and Zen 2 optimizations, enabling sustained performance at lower TDPs. Cezanne further extended AM4 socket longevity into 2022 with unlocked multipliers for overclocking and enhanced iGPU clocks up to 2 GHz, solidifying reception as a value-driven solution for light gaming, content creation, and OEM systems.[115]RDNA-based APUs
Rembrandt (2022)
The Rembrandt platform, codenamed for AMD's Ryzen 6000 series mobile processors, was announced on January 4, 2022, with laptops featuring these APUs becoming available starting in February 2022.[116] This laptop-focused lineup emphasizes the Ryzen 5 and Ryzen 7 6000U and HS variants, which incorporate 6 to 8 Zen 3+ CPU cores and 12 to 16 threads, with boost clocks reaching up to 4.9 GHz.[116] Built on TSMC's 6 nm process, Rembrandt APUs contain approximately 13.1 billion transistors across a 210 mm² die and support configurable TDPs from 15 W to 45 W, enabling efficient operation in thin-and-light designs. They utilize the FP7 socket and include four 32-bit memory channels compatible with LPDDR5-6400, alongside PCIe 4.0 and USB4 connectivity for enhanced data throughput.[117] A key advancement in Rembrandt is the integration of the Radeon 680M graphics processor, AMD's first APU to employ the RDNA 2 architecture with 12 compute units (768 shaders), operating at up to 2.4 GHz.[118] This iGPU delivers up to 3.7 TFLOPS of FP32 compute performance, roughly doubling the graphics capabilities of the Vega-based iGPUs in the prior Cezanne generation (Ryzen 5000 series).[116] It introduces hardware-accelerated ray tracing via 12 ray accelerators and AV1 video decode support, enabling smoother 1080p playback and improved efficiency for streaming and content creation.[119] These features position Rembrandt as a strong contender for integrated 1080p gaming in ultrabooks, achieving 40-60 FPS in select AAA titles at low to medium settings without discrete GPUs.[120] Rembrandt's refinements to the Zen 3 architecture, including optimized power delivery and higher average clocks, contribute to up to 28% better multi-threaded CPU performance over Cezanne while maintaining similar cache latencies.[121] As the inaugural RDNA-based APU, it marked a significant shift from prior GCN/Vega integrations, prioritizing graphics efficiency for mobile workloads. The platform saw widespread adoption, powering over 200 premium laptop models from OEMs like ASUS, Lenovo, and HP by mid-2022, and capturing substantial share in the high-end ultrabook segment for its balance of productivity, battery life (up to 29 hours in office tasks), and casual gaming.[116]Phoenix (2023–2024)
The Phoenix platform, codenamed Phoenix Point, represents AMD's Zen 4-based accelerated processing unit (APU) architecture introduced in the Ryzen 7040 series for mobile devices, with a refresh under the Hawk Point codename in the Ryzen 8040 series. Launched in the second quarter of 2023 following an announcement at CES, the initial lineup targeted thin-and-light laptops and handhelds, featuring 6 to 8 Zen 4 cores and 12 to 16 threads, with maximum boost clocks reaching up to 5.2 GHz on the flagship Ryzen 9 7940HS model.[122] The Hawk Point refresh, announced in December 2023 and available starting early 2024, maintained the core Zen 4 design while enhancing AI capabilities, extending the platform's lifecycle into 2024 for broader commercial adoption.[123] Central to the Phoenix architecture is the integration of RDNA 3 graphics via the Radeon 760M and 780M iGPUs, marking AMD's first mobile APU with this GPU generation. The Radeon 780M, found in Ryzen 7 and 9 models, employs 12 compute units (CUs) with 768 shaders, operating at up to 2.7–3.0 GHz for approximately 4.3 TFLOPS of FP32 performance, while the Radeon 760M in Ryzen 5 variants uses 8 CUs at similar clocks for around 2.9 TFLOPS. These iGPUs leverage RDNA 3's dual compute units per workgroup processor (WGP) design, improving efficiency over prior architectures, alongside dedicated AI accelerators for machine learning tasks. Built on TSMC's 4nm process node with an approximate die size of 178 mm² and approximately 25.4 billion transistors in the primary compute/graphics chiplet (plus a separate I/O die), the platform supports configurable thermal design power (TDP) from 15 W to 54 W, LPDDR5X-7500 memory, and the FP8 socket for mobile integration.[124][125] A key innovation in Phoenix is the introduction of the Ryzen AI neural processing unit (NPU) based on AMD's XDNA architecture, the first dedicated AI engine in an x86 processor, enabling native support for features like Microsoft Copilot+. The original 7040 series delivers up to 10 TOPS of INT8 performance via the NPU, while the Hawk Point refresh boosts this to 16 TOPS, qualifying for Copilot+ PC certification and accelerating AI workloads such as image generation and video upscaling. Graphics performance sees over 50% uplift compared to the prior Rembrandt platform's Radeon 680M, driven by RDNA 3 advancements, allowing efficient 1080p and 1440p gaming in titles like Cyberpunk 2077 at medium settings.[123][126][127] The platform gained positive reception for its balance of power efficiency and integrated graphics prowess, particularly in handheld gaming devices like the ASUS ROG Ally, which employs a custom Phoenix-derived Ryzen Z1 Extreme APU (8 Zen 4 cores, 12-CU Radeon 780M at 15–30 W TDP). Reviews highlighted its ability to deliver playable frame rates at 1440p with low power draw, outperforming competitors like Intel's Meteor Lake in gaming efficiency by up to 139% in select benchmarks, while maintaining strong battery life in ultrathin laptops. This efficiency stems from the 4nm process and optimized chiplet design, positioning Phoenix as a foundational step for AI-enhanced mobile computing.[122][128][129]Strix Point (2024)
The Strix Point platform, codenamed for AMD's Ryzen AI 300 series, was announced at Computex in June 2024 and became available in premium laptops starting in July 2024.[38] Targeted at high-end mobile computing, it features up to 12 Zen 5 CPU cores—comprising a mix of full Zen 5 and dense Zen 5c cores—delivering up to 24 threads and a maximum boost clock of 5.1 GHz in models like the Ryzen AI 9 HX 370.[130][38] These monolithic APUs emphasize AI-driven tasks in thin-and-light designs from manufacturers such as ASUS and Lenovo, with configurable power envelopes suited for creative professionals and mobile gamers.[131] Integrated graphics are powered by the Radeon 890M, based on the RDNA 3.5 architecture with 16 compute units (1024 shaders) clocked up to 2.9 GHz, providing up to approximately 5.9 TFLOPS of FP32 compute performance.[132][133] Enhancements include improved ray-tracing accelerators over prior RDNA generations and support for AMD Fluid Motion Frames, enabling frame generation for smoother gameplay in supported titles. The platform is fabricated on TSMC's 4 nm process node, with the core die measuring about 233 mm² and incorporating roughly 15 billion transistors across the CPU, GPU, and other components.[134] It supports LPDDR5X memory up to 8000 MT/s and operates within a 15-54 W TDP range, though some configurations extend to 80 W for higher performance.[135] The dedicated XDNA 2 neural processing unit delivers 50 TOPS of INT8 performance, optimized for generative AI workloads such as on-device large language models.[130][38] Architecturally, Strix Point achieves a 16% increase in instructions per clock over the Zen 4-based predecessors, contributing to strong efficiency in multi-threaded applications.[38] This marks the first AMD APU series capable of running advanced on-device LLMs fluidly without cloud dependency, excelling in creative workflows like video editing and 3D rendering due to the combined CPU, GPU, and NPU capabilities.[136] Early reviews highlight its prowess in AI-accelerated content creation, with the integrated NPU handling tasks like image generation and real-time upscaling more efficiently than CPU-only alternatives.[137]Strix Halo (2025)
Strix Halo employs a chiplet-based design, contrasting with the monolithic architecture of Strix Point. It supports up to 16 Zen 5 cores, exceeding Strix Point's maximum of 12 cores combining Zen 5 and Zen 5c variants. The integrated Radeon 8060S graphics, based on RDNA 3.5, feature up to 40 compute units—more than double the 16 in Strix Point's Radeon 890M—delivering performance comparable to discrete GPUs. Enhanced memory bandwidth, up to 256 GB/s via a 256-bit LPDDR5X interface, supports these capabilities. Targeting thicker high-performance laptops and handhelds with greater cooling demands, Strix Halo suits intensive gaming and compute tasks, while Strix Point prioritizes efficiency in premium thin-and-light devices.[3][138][139]References
- https://en.wikichip.org/wiki/amd/microarchitectures/zen
- https://en.wikichip.org/wiki/amd/microarchitectures/zen_2
- https://en.wikichip.org/wiki/amd/microarchitectures/zen_4

