Recent from talks
Nothing was collected or created yet.
Mali (processor)
View on WikipediaARM Cortex A57 A53 big.LITTLE SoC with a Mali-T624 GPU | |
| Release date | 2005 |
|---|---|
| Architecture |
|
| Models | See Variants |
| Cores | 1 to 32 cores |
| Fabrication process | 4 to 40 nm |
| API support | |
| Direct3D | 9 to 12 |
| OpenCL | 1.1 to 3.0 |
| OpenGL | 2.0 to 3.0 |
| Vulkan | 1.0 to 1.3 |
The Mali and Immortalis series of graphics processing units (GPUs) and multimedia processors are semiconductor intellectual property cores produced by Arm Holdings for licensing in various ASIC designs by Arm partners.
Mali GPUs were developed by Falanx Microsystems A/S, which was a spin-off of a research project from the Norwegian University of Science and Technology.[1] Arm Holdings acquired Falanx Microsystems A/S on June 23, 2006 and renamed the company to Arm Norway.[2]
It was originally named Malaik, but the team shortened the name to Mali, Serbo-Croatian for "small", which was thought to be fitting for a mobile GPU.[3]
On June 28, 2022, Arm announced their Immortalis series of GPUs with hardware-based Ray Tracing support.[4]
Graphics processors
[edit]Utgard
[edit]In 2005, Falanx announced their Utgard GPU Architecture, the Mali-200 GPU.[5] Arm followed up with the Mali-300, Mali-400, Mali-450, and Mali-470. Utgard was a non-unified GPU (discrete pixel and vertex shaders).[1]
| Model | Launch date | Type | EUs/Shader core count | Core clock rate
(MHz) |
L2 cache size | Fillrate | GFLOPS (per core) |
OpenGL ES | ||
|---|---|---|---|---|---|---|---|---|---|---|
| M△/s | GT/s | (GP/s) | ||||||||
| Mali-55/110 | 2005 | Fixed function pipeline[6] | 1 | 2.8 | 0.1 | ? | 1.1 | |||
| Mali-200 | 2007[7] | Programmable pipeline[6] | 1 | 5 | ? | 0.2 | 2.0 | |||
| Mali-300 | 2010[8] | 1 | 500 | 8 KiB | 55 | 0.5 | 5 | |||
| Mali-400 MP | 2008 | 1–4 | 200–600 | 8–256 KiB | 55 | 0.5 | 1.2–5.4 | |||
| Mali-450 MP | 2012 | 1–8 | 300–750 | 8–512 KiB | 142 | 2.6 | 4.5–11.9 | |||
| Mali-470 MP | 2015 | 1–4 | 250–650 | 8–256 KiB | 71 | 0.65 | 8–20.8 | |||
Midgard
[edit]1st generation
[edit]On November 10, 2010, Arm announced their Midgard 1st gen GPU Architecture, including the Mali-T604 and later the Mali-T658 GPU in 2011.[9][10][11][12] Midgard uses a Hierarchical Tiling system.[1]
2nd generation
[edit]On August 6, 2012, Arm announced their Midgard 2nd gen GPU Architecture, including the Mali-T678 GPU.[13] Midgard 2nd gen introduced Forward Pixel Kill.[1][14]
3rd generation
[edit]On October 29, 2013, Arm announced their Midgard 3rd gen GPU Architecture, including the Mali-T760 GPU.[15][1][16][17][18]
4th generation
[edit]On October 27, 2014, Arm announced their Midgard 4th gen GPU Architecture, including the Mali-T860, Mali-T830, Mali-T820. Their flagship Mali-T880 GPU was announced on February 3, 2015. New microarchitectural features include:[19]
- Up to 16 cores for the Mali-T880, with 256KB – 2MB L2 cache
Bifrost
[edit]1st generation
[edit]On May 27, 2016, Arm announced their Bifrost GPU Architecture, including the Mali-G71 GPU. New microarchitectural features include:[20][21]
- Unified shaders with quad vectorization
- Scalar ISA
- Clauses execution
- Full cache coherency
- Up to 32 cores for the Mali-G71, with 128KB – 2MB L2 cache
- Arm claims the Mali-G71 has 40% more performance density and 20% better energy efficiency than the Mali-T880
2nd generation
[edit]On May 29, 2017, Arm announced their Bifrost 2nd gen GPU Architecture, including the Mali-G72 GPU. New microarchitectural features include:[22][23]
- Arithmetic optimizations and increased caches
- Up to 32 cores for the Mali-G72, with 128KB – 2MB L2 cache
- Arm claims the Mali-G72 has 20% more performance density and 25% better energy efficiency than the Mali-G71
3rd generation
[edit]On May 31, 2018, Arm announced their Bifrost 3rd gen GPU Architecture, including the Mali-G76 GPU. New microarchitectural features include:[24][25]
- 8 execution lanes per engine (up from 4). Doubled pixel and texel throughput
- Up to 20 cores for the Mali-G76, with 512KB – 4MB L2 cache
- Arm claims the Mali-G76 has 30% more performance density and 30% better energy efficiency than the Mali-G72
Valhall
[edit]1st generation
[edit]On May 27, 2019, Arm announced their Valhall GPU Architecture, including the Mali-G77 GPU, and in October Mali-G57 GPUs. New microarchitectural features include:[26][27][28]
- New superscalar engine
- Simplified scalar ISA
- New dynamic scheduling
- Up to 16 cores for the Mali-G77, with 512KB – 2MB L2 cache
- Arm claims the Mali-G77 has 30% more performance density and 30% better energy efficiency than the Mali-G76
2nd generation
[edit]On May 26, 2020, Arm announced their Valhall 2nd Gen GPU Architecture, including the Mali-G78. New microarchitectural features include:[29][30][31]
- Asynchronous clock domains
- New FMA units and increase Tiler throughput
- Up to 24 cores for the Mali-G78, with 512KB – 2MB L2 cache
- Arm Frame Buffer Compression (AFBC)
- Arm claims the Mali-G78 has 15% more performance density and 10% better energy efficiency than the Mali-G77
3rd generation
[edit]On May 25, 2021, Arm announced their Valhall 3rd Gen GPU Architecture (as part of TCS21), including the Mali-G710, Mali-G510, and Mali-G310 GPUs. New microarchitectural features include:[32][33][34]
- Larger shader cores (2x compared to Valhall 2nd Gen)
- New GPU frontend, Command Stream Frontend (CSF) replaces the Job Manager
- Up to 16 cores for the Mali-G710, with 512KB – 2MB L2 cache
- Arm claims the Mali-G710 has 20% more performance density and 20% better energy efficiency than the Mali-G78
4th generation
[edit]On June 28, 2022, Arm announced their Valhall 4th Gen GPU Architecture (as part of TCS22), including the Immortalis-G715, Mali-G715, and Mali-G615 GPUs. New microarchitectural features include:[4][35]
- Ray Tracing support (hardware-based)
- Variable Rate Shading[36]
- New Execution Engine, with doubled the FMA block, Matrix Multiply instruction support, and PPA improvements
- Arm Fixed Rate Compression (AFRC)
- Arm claims the Immortalis-G715 has 15% more performance & 15% better energy efficiency than the Mali-G710[37]
5th generation
[edit]On May 29, 2023, Arm announced their 5th Gen Arm GPU Architecture (as part of TCS23), including the Immortalis-G720, Mali-G720 and Mali-G620 GPUs.[38][39][40] New microarchitectural features include:[41]
- Deferred vertex shading (DVS) pipeline
- Arm claims the Immortalis-G720 has 15% more performance and uses up to 40% less memory bandwidth than the Immortalis-G715
Technical details
[edit]Like other embedded IP cores for 3D rendering acceleration, the Mali GPU does not include display controllers driving monitors, in contrast to common desktop video cards. Instead, the Mali ARM core is a pure 3D engine that renders graphics into memory and passes the rendered image over to another core to handle display.
ARM does, however, license display controller SIP cores independently of the Mali 3D accelerator SIP block, e.g. Mali DP500, DP550 and DP650.[42]
ARM also supplies tools to help in authoring OpenGL ES shaders named Mali GPU Shader Development Studio and Mali GPU User Interface Engine.
Display controllers such as the ARM HDLCD display controller are available separately.[43]
Variants
[edit]The Mali core grew out of the cores previously produced by Falanx and currently constitute:[44]
| Model | Microarchi- tecture |
Type | Launch date | EUs/Shader core count | Shading Units (per core) | Total Shaders | Fab (nm) | Die size (mm2) |
Core clock rate (MHz) | L2 cache size (KiB) | Fillrate | GFLOPS (per core) |
GFLOPS (total) |
API (version) | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| M△/s | GT/s | (GP/s) | Vulkan | OpenGL ES | OpenCL | |||||||||||||
| Mali-T604[45] | Midgard 1st gen | Unified shader model + | Nov 2010[46] | 1–4 | 8 | 8–32 | 32 28 |
? | 533 | 32–256 | 133 | 0.6 @ 600 MHz | 9.6 @ 600 MHz | 9.6–38.4 @ 600 MHz | — | 3.1 | 1.1 Full Profile | |
| Mali-T658[45] | Nov 2011[47] | 1–8 | 16 | 16–128 | ? | ? | ? | 19.2 @ 600 MHz | 19.2–153.6 @ 600 MHz | |||||||||
| Mali-T622 | Midgard 2nd gen | Jun 2013[48] | 1–2 | 4 | 4–8 | 32 28 |
? | 533 | ? | ? | 4.8 @ 600 MHz | 4.8–9.6 @ 600 MHz | ||||||
| Mali-T624 | Aug 2012 | 1–4 | 8 | 8–32 | ? | 533–600 | ? | ? | 9.6 @ 600 MHz | 9.6–38.4 @ 600 MHz | ||||||||
| Mali-T628 | 1–8 | 16 | 16–128 | ? | 533–695 | ? | ? | 19.2 @ 600 MHz | 19.2–153.6 @ 600 MHz | |||||||||
| Mali-T678[49] | 1–8 | 28 | ? | ? | ? | ? | ||||||||||||
| Mali-T720 | Midgard 3rd gen | Oct 2013 | 1–8 | 10 | 10–80 | 28 14 10 |
? | 400–700 | 600 (MP8@ 600 MHz) |
0.6 @ 600 MHz | 12 @ 600 MHz | 12–96 @ 600 MHz | ||||||
| Mali-T760 | 1–16 | 14 | 14–224 | 28 20 14 |
1.75 mm2 per shader core at 14 nm[50] | 600–772 | 256–2048[51] | 1300 | 16.8 @ 600 MHz | 16.8–268.8 @ 600 MHz | 1.0[52] | 3.2[53] | 1.2 Full Profile | |||||
| Mali-T820 | Midgard 4th gen | Q4 2015 | 1–4 | 8 | 8–32 | 28 | ? | 600 | 32–256[51] | 400 | 9.6 @ 600 MHz | 9.6–38.4 @ 600 MHz | ||||||
| Mali-T830 | 16 | 16–64 | 28 16 14 |
? | 600–950 | 400 | 19.2 @ 600 MHz | 19.2–76.8 @ 600 MHz | ||||||||||
| Mali-T860 | 1–16 | 14 | 14–224 | ? | 350–700 | 256–2048[51] | 1300 | 16.8 @ 600 MHz | 16.8–268.8 @ 600 MHz | |||||||||
| Mali-T880 | Q2 2016 | 1–16 | 21 | 21–351 | 20 16 14 |
? | 650–1000 | 1700 | 25.2 @ 600 MHz | 25.2–403.2 @ 600 MHz | ||||||||
| Mali-G31 | Bifrost 1st gen | Unified shader model + Unified memory +
scalar, clause-based ISA |
Q1 2018 | 1–6[54] | 4 or 8 | 4–48 | 28 12 |
? | 650 | 32–512 | 0.5 @ 1000 MHz | 8–16 @ 1000 MHz | 48–576 @ 1000 MHz | 2.0 Full Profile | ||||
| Mali-G51[55] | Q4 2016 | 1–6[56] | 8 or 12 | 8–72 | 28 16 14 12 10 |
? | 1000 | 16–24 @ 1000 MHz | 16–144 @ 1000 MHz | |||||||||
| Mali-G71 | Q2 2016 | 1–32 | 12 | 12–384 | 16 14 10 |
? | 546–1037 | 128–2048 | 1850 | 1 @ 1000 MHz | 24 @ 1000 MHz | 24–768 @ 1000 MHz | ||||||
| Mali-G52 | Bifrost 2nd gen | Q1 2018 | 1–6 | 16 or 24 | 16–144 | 16 12 8 7 |
? | 850 | 32-512 | 2 @ 1000 MHz | 32–48 @ 1000 MHz | 32–288 @ 1000 MHz | 2.1 Full Profile | |||||
| Mali-G72 | Q2 2017 | 1–32 | 12 | 12–384 | 16 12 10 |
1.36 mm2 per shader core at 10 nm[57] | 572–1050 | 128–2048 | 1 @ 1000 MHz | 24 @ 1000 MHz | 24–768 @ 1000 MHz | 2.0 Full Profile | ||||||
| Mali-G76 | Bifrost 3rd gen | Q2 2018 | 4–20 | 24 | 96–480 | 12 8 7 |
? | 600–800 | 512–4096 | ? | 2 @ 1000 MHz | 2 @ 1000 MHz | 48 @ 1000 MHz | 192–960 @ 1000 MHz | 1.1 | 2.1 Full Profile | ||
| Mali-G57 | Valhall 1st gen | Superscalar engine + Unified memory +
simplified scalar ISA |
Q2 2019 | 1–6 | 32 | 32–192 | 12 7 6 |
? | 950[58] | 64–512 | ? | 4 @ 1000 MHz | 64 @ 1000 MHz | 64–384 @ 1000 MHz | ||||
| Mali-G77 | 7–16 | 224–512 | 7 6 |
? | 695–850 | 512–2048 | ? | 448–1024 @ 1000 MHz | ||||||||||
| Mali-G68 | Valhall 2nd gen | Q2 2020 | 1–6 | 32–192 | 6 5 3 |
64–384 @ 1000 MHz | 1.2 | 3.0 Full Profile | ||||||||||
| Mali-G78 | 7–24 | 224–768 | 5 | 759-848 | 448–1536 @ 1000 MHz | |||||||||||||
| Mali-G310 | Valhall 3rd gen | Q2 2021 | 1 | 16 or 32 or 64 | 16–64 | 6 5 4 |
256–1024 | 2, 4 or 8 @ 1000 MHz | 2 or 4 @ 1000 MHz | 32–128 @ 1000 MHz | ||||||||
| Mali-G510 | 2–6 | 48 or 64 | 96–384 | 4 or 8 @ 1000 MHz | 4 @ 1000 MHz | 96–128 @ 1000 MHz | 192–768 @ 1000 MHz | |||||||||||
| Mali-G610 | 1–6 | 64 | 64–384 | 512–2048 | 8 @ 1000 MHz | 128 @ 1000 MHz | 128–768 @ 1000 MHz | |||||||||||
| Mali-G710 | 7–16 | 448–1024 | 650,850 900 |
2648 | 896–2048 @ 1000 MHz | |||||||||||||
| Mali-G615 | Valhall 4th gen | Q2 2022 | 1–6 | 128 | 128–768 | 4 | 256 @ 1000 MHz | 256–1536 @ 1000 MHz | 1.3[59] | |||||||||
| Mali-G715 | 7–9 | 896–1152 | 1792–2304 @ 1000 MHz | |||||||||||||||
| Immortalis-G715 | 10–16 | 1280–2048 | 2560–4096 @ 1000 MHz | |||||||||||||||
| Mali-G620 | 5th Gen[60] | Deferred Vertex Shading (DVS) | Q2 2023 | 1–5 | 128–640 | 256–1024 | 256–1280 @ 1000 MHz | |||||||||||
| Mali-G720 | 6–9 | 768–1152 | 512–2048 | 1536–2304 @ 1000 MHz | ||||||||||||||
| Immortalis-G720 | Q4 2023 | 10–16 | 1280–2048 | 2560–4096 @ 1000 MHz | ||||||||||||||
| Mali-G625 | Q2 2024 | 1–5 | 128–640 | 4 3 |
256–1024 | 256–1280 @ 1000 MHz | ||||||||||||
| Mali-G725 | 6–9 | 768–1152 | 512–4096 | 1536–2304 @ 1000 MHz | ||||||||||||||
| Immortalis-G925 | 10–24 | 1280–3072 | 2560–6144 @ 1000 MHz | |||||||||||||||
| Mali G1-Pro | Q3 2025 | 1–5 | 128–640 | 3 | 512–2048 | 256–1280 @ 1000 MHz | 1.4 | |||||||||||
| Mali G1-Premium | 6–9 | 768–1152 | 512–4096 | 1536–2304 @ 1000 MHz | ||||||||||||||
| Mali G1-Ultra | 10–24 | 1280–3072 | 2560–6144 @ 1000 MHz | |||||||||||||||
| Model | Microarchi- tecture |
Type | Launch date | EUs/Shader core count | Shading Units (per core) | Total Shaders | Fab
(nm) |
Die size (mm2) |
Core clock rate (MHz) | Max L2 cache size (KiB) | Fillrate (per core) | FP32 GFLOPS (per core) |
GFLOPS (total) |
Vulkan | Open GL/ES |
Open CL | ||
Some microarchitectures (or just some chips?) support cache coherency for the L2 cache with the CPU.[61][62]
Adaptive Scalable Texture Compression (ASTC) is supported by Mali-T620, T720/T760, T820/T830/T860/T880[63] and Mali-G series.
Implementations
[edit]The Mali GPU variants can be found in the following systems on chips (SoCs):
| Vendor | SoC name | Mali version |
|---|---|---|
| Allwinner | Allwinner A1X (A10, A10s, A13) | Mali-400 MP[64][65][66] @ 300 MHz |
| A20, A23, A33, A64,[67] H2, H3, H64, R8, R16, R40, R18 | Mali-400 MP2[68] @ 350/350/350/600/600/?/?/?/?/?/? MHz | |
| H5 | Mali-450 MP4 | |
| H6 | Mali-T720 MP2 | |
| H313, H616, H618 | Mali-G31 MP2 | |
| Amlogic | 8726-M series (8726-M1, 8726-M3, 8726-M6, 8726-MX) | Mali-400 MP/MP2[69] @ 250/400 MHz |
| 8726-M8 series (M801, M802, S801, S802, S812) | Mali-450 MP6[69] @ 600 MHz | |
| 8726-M8B series (M805, S805) | Mali-450 MP2[69] @ 500 MHz | |
| S905, S905X/D/L | Mali-450 MP3 @ 750 MHz | |
| S905X2, S905X3, S905Y2, S905D2, S905X4[70] | Mali-G31 MP2 | |
| S905X5[71] | Mali-G310 @ 1 GHz[72] | |
| S912 | Mali-T820 MP3 @ 600 MHz | |
| S922X, A311D | Mali-G52 MP4 | |
| T966 | Mali-T830 MP2 @ 650 MHz | |
| ARM | Morello | Mali-G76 |
| Asus | Tinkerboard, Tinkerboard S | Mali-T760 |
| Baikal Electronics | Baikal-M | Mali-T628 MP8[73] |
| CSR | Quatro 5300 Series | Mali-400 MP |
| ELVEES Multicore | 1892VM14Ya | Mali-300 |
| InfoTM | iMAP×15 | Mali-400 |
| iMAP×820 | Mali-400 MP2 | |
| iMAP×912 | Mali-400 MP2 | |
| Tensor | Mali-G78 MP20 @ 848(996) MHz | |
| Tensor G2 | Mali-G710 MP7 @ 848(996) MHz | |
| Tensor G3 | Mali-G715 MP7 @ 890(900) MHz | |
| Tensor G4 | Mali-G715 MP7 @ 940 MHz | |
| HiSilicon | Kirin 620 | Mali-450 MP4 @ 533 MHz |
| Kirin 650/655/658/659 | Mali-T830 MP2 @ 900 MHz | |
| Kirin 710 | Mali-G51 MP4 @ 1000 MHz | |
| Kirin 810 | Mali-G52 MP6 @ 820 MHz | |
| Kirin 820 | Mali-G57 MP6 @??? MHz | |
| Kirin 910/910T | Mali-450 MP4 @ 533/700 MHz | |
| Kirin 920/925/928 | Mali-T628 MP4 @ 600/600/? MHz | |
| Kirin 930/935 | Mali-T628 MP4 @ 600/680 MHz | |
| Kirin 950/955 | Mali-T880 MP4 @ 900 MHz | |
| Kirin 960 | Mali-G71 MP8 @ 1037 MHz | |
| Kirin 970 | Mali-G72 MP12 @ 746 MHz | |
| Kirin 980 | Mali-G76 MP10 @ 720 MHz | |
| Kirin 985 | Mali-G77 MP8 @??? MHz | |
| Kirin 990/990 5G | Mali-G76 MP16 @ 600 MHz | |
| Kirin 9000 5G/Kirin 9000E 5G | Mali-G78 MP24/22 @ 759 MHz | |
| Hi3798cv200 | Mali-T720 @ 450/600 MHz | |
| Leadcore | LC1810, LC1811, LC1813, LC1913 | Mali-400[74][75][76][77] |
| LC1860, LC1860C, LC1960 | Mali-T628 MP2 @ 600 MHz | |
| MediaTek | MSD6683 | Mali-470 MP3 |
| MT5595, MT5890 | Mali-T624 MP3 | |
| MT5596, MT5891 | Mali-T860 MP2[78] | |
| MT6571, MT6572, MT6572M | Mali-400 MP1 @ ?/500/400 MHz | |
| MT6580 | Mali-400 MP1 @ 500 MHz | |
| MT6582/MT6582M | Mali-400 MP2 @ 500/416 MHz | |
| MT6588, MT6591, MT6592, MT6592M, MT8127 | Mali-450 MP4 @ 600/700/600/600 MHz[79] | |
| MT6735, MT6735M, MT6735P | Mali-T720 MP2 @ 600/500/400 MHz | |
| MT6737, MT6737T | Mali-T720 MP2 @ 550/600 MHz | |
| MT8735 | Mali-T720 MP2 @ 450 MHz | |
| MT6753 | Mali-T720 MP3 @ 700 MHz[80] | |
| MT6732, MT6732M, MT6752, MT6752M | Mali-T760 MP2 @ 500/500/700/700 MHz[81] | |
| MT6750 | Mali-T860 MP2 @ 520 MHz | |
| MT6755 (Helio P10/P15/P18) | Mali-T860 MP2 @ 700/650/800 MHz | |
| MT6757 (Helio P20, P25) | Mali-T880 MP2 @ 900 MHz/1.0 GHz[82] | |
| MT6797 (Helio X20/X23/X25/X27) | Mali-T880 MP4 @ 780/850/875 MHz | |
| MT6763T (Helio P23), MT6758 (Helio P30) | Mali-G71 MP2 @ 770/950 MHz[83][84] | |
| MT6771 (Helio P60, P70) | Mali-G72 MP3 @ 800/900 MHz[85][86] | |
| MT6768 (Helio P65), MT6769 (Helio G70/G80/G85/G88) | Mali-G52 MC2 @ 820/950/1000 MHz | |
| Helio G91 | Mali-G52 MC2 @ 1 GHz | |
| MT6785 (Helio G90/G90T/G95) | Mali-G76 MC4 @ 720/800/900 MHz | |
| MT6781 (Helio G96, G99) | Mali-G57 MC2 @ 950/1000 MHz | |
| MT6833 (Dimensity 700, 810, 6020) | Mali-G57 MC2 @ 950/1068/950 MHz | |
| MT6853 (Dimensity 720, 800U) | Mali-G57 MC3 @ 850 MHz | |
| MT6873 (Dimensity 800) | Mali-G57 MC4 @ 650 MHz | |
| MT6875 (Dimensity 820), MT6883Z (Dimensity 1000C) | Mali-G57 MC5 @ 900 MHz | |
| MT6877/MT6877T (Dimensity 900/920/1080/7050) | Mali-G68 MC4 @ 900 MHz | |
| MT6885Z (Dimensity 1000L) | Mali-G77 MC7 @ 695 MHz | |
| MT6889 (Dimensity 1000/1000+) | Mali-G77 MC9 @ 850 MHz | |
| MT6891/MT6893 (Dimensity 1100/1200/1300/8020/8050) | Mali-G77 MC9 @ 850 MHz | |
| MT8192 (Kompanio 820) | Mali-G57 MC5 GPU @ ??? MHz | |
| Kompanio 838 | Mali-G57 MC3 | |
| MT8195/MT8195T (Kompanio 1200/1380) | Mali-G57 MC5 GPU @ ??? MHz | |
| MT8791 (Kompanio 900T) | Mali-G68 MP4 GPU @ 900 MHz | |
| MT8797 (Kompanio 1300T) | Mali-G77 MP9 @ 850 MHz | |
| MT6886 (Dimensity 7200) | Mali-G610 MC4 @ 1.13 GHz | |
| MT6878 (Dimensity 7300/7300X) | Mali-G615 MC2 @ 1.05 GHz | |
| MT6895/MT6895Z/MT6896 (Dimensity 8000/8100/8200) | Mali-G610 MC6 @ 700/860/950 MHz | |
| Dimensity 8300 | Mali-G615 MC6 | |
| MT6983 (Dimensity 9000/9000+) | Mali-G710 MP10 @ 848/950 MHz | |
| MT6985 (Dimensity 9200/9200+) | Immortalis-G715 MP11 @ 981/1150 MHz | |
| MT6989 (Dimensity 9300) | Immortalis-G720 MP12 @ 1.3 GHz | |
| Dimensity 9400 | Immortalis-G925 MP12 | |
| NetLogic | Au1380, Au1350 | Mali-200[87][88] |
| Nufront | NS2816, NS2816M | Mali-400 MP |
| NS115, TL7688, TL7689 | Mali-400 MP2 | |
| NXP | i.MX95 | Mali-G310 MP1 |
| Realtek | RTD1294, RTD1295, RTD1296 | Mali-T820 MP3[89] |
| RTD1395 | Mali-470 | |
| Rockchip | RK2818 | Mali-200 |
| RK2926, RK2628, RK3036, RK3229 | Mali-400 MP @ 400/400/500/600 MHz[90] | |
| RK3026, RK3126, RK3128 | Mali-400 MP2 @ 500/600/600 MHz | |
| RK3066, RK3188, RK3188T | Mali-400 MP4 @ 266/533/~400 MHz[91][92] | |
| RK3288 | Mali-T760 MP4 @ 600 MHz | |
| RK3326 | Mali-G31 MP2[93][94] | |
| RK3328 | Mali-450 MP2 | |
| RK3399 | Mali-T860 MP4 @ 600 MHz | |
| RK3530, RK3566, RK3568 | Mali-G52[95] | |
| RK3588 | Mali-G610 MC4[96] | |
| Samsung | Exynos 3 Quad 3470 | Mali-400 MP4 @ 450 MHz[97] |
| Exynos 3 Quad 3475 | Mali-T720 @ 600 MHz | |
| Exynos 4 Dual 4210 | Mali-400 MP4 @ 266 MHz | |
| Exynos 4 Dual 4212 | Mali-400 MP4 @ 400 MHz | |
| Exynos 3 Quad 4412/4415 | Mali-400 MP4 @ 533 MHz | |
| Exynos 5 Dual 5250 | Mali-T604 MP4 @ 533 MHz[98] | |
| Exynos 5 Hexa 5260 | Mali-T624 MP4 @ 600 MHz | |
| Exynos 5 Octa 5420/5422 | Mali-T628 MP6 @ 533 MHz | |
| Exynos 5 Octa 5430/5800 | Mali-T628 MP6 @ 600 MHz | |
| Exynos 5 Hexa 7872 | Mali-G71 MP1 @ 1200 MHz | |
| Exynos 7 Octa 5433/7410 | Mali-T760 MP6 @ 700 MHz | |
| Exynos 7 Octa 7420 | Mali-T760 MP8 @ 772 MHz | |
| Exynos 7 Quad 7570 | Mali-T720 MP1 @ 830 MHz | |
| Exynos 7 Octa 7578/7580 | Mali-T720 MP2 @ 668 MHz | |
| Exynos 7 Octa 7870 | Mali-T830 MP1 @ 700 MHz | |
| Exynos 7 Octa 7880 | Mali-T830 MP3 @ 950 MHz | |
| Exynos 7 Series 7884A | Mali-G71 MP2 @ 450 MHz | |
| Exynos 7 Series 7884 | Mali-G71 MP2 @ 676/845 MHz | |
| Exynos 7 Series 7885 | Mali-G71 MP2 @ 1100 MHz | |
| Exynos 7 Series 7904 | Mali-G71 MP2 @ 770 MHz | |
| Exynos 850 | Mali-G52 MP1 @ 1001 MHz | |
| Exynos 880 | Mali-G76 MP5 @ 546 MHz | |
| Exynos 8 Octa 8890 | Mali-T880 MP10 (Lite) @650 MHz Mali-T880 MP12 @650 MHz | |
| Exynos 9 Octa 8895 | Mali-G71 MP20 @ 546 MHz[99][100] | |
| Exynos 7 Series 9609 | Mali-G72 MP3 | |
| Exynos 7 Series 9610 | Mali-G72 MP3 @ 1053 MHz[101] | |
| Exynos 7 Series 9611 | Mali-G72 MP3 @ 850 MHz | |
| Exynos 9 Series 9810 | Mali-G72 MP18 @ 572 MHz[102] | |
| Exynos 9 Series 9820 | Mali-G76 MP12 @ 702 MHz | |
| Exynos 9 Series 9825 | Mali-G76 MP12 @ 754 MHz | |
| Exynos 9 Series 980 | Mali-G76 MP5 @ 728 MHz | |
| Exynos 9 Series 990 | Mali-G77 MP11 @ 832 MHz | |
| Exynos 1080 | Mali-G78 MP10 @ 800 MHz | |
| Exynos 1280 | Mali-G68 MP4 @ 897 MHz | |
| Exynos 1330 | Mali G68 MP2 @ 949 MHz | |
| Exynos 1380 | Mali-G68 MP5 @ 949 MHz | |
| Exynos 2100 | Mali-G78 MP14 @ 854 MHz | |
| S5P6450 Vega | Mali-400 MP[103] | |
| Sigma Designs | SMP8750 Series | Mali-400 MP4 @ 350 MHz[104] |
| Socle-Tech | Leopard-6 | Mali-200[105] |
| Spreadtrum | SC68xx, SC57xx, SC77xx, SC8xxx, SC983x | Mali-400 MP Series[106] |
| SC9860, SC9860GV | Mali-T880 MP4 | |
| ST-Ericsson | NovaThor U9500, U8500, U5500 | Mali-400 MP[107] |
| STMicroelectronics | SPEAr1340 | Mali-200[108] |
| STi7108, STiH416 | Mali-400 MP[109][110] | |
| Telechips | TCC8803, TCC8902, TCC8900, TCC9201 | Mali-200[69][111] |
| WonderMedia | WM8750 | Mali-200 |
| WM8850, WM8950 | Mali-400 MP[112] | |
| WM8880, WM8980 | Mali-400 MP2 | |
| WM8860 | Mali-450 | |
| Xiaomi | Surge S1 | Mali-T860 MP4 @ 800 MHz[113][114] |
| Surge S2 | Mali-G71 MP12 @ 900 MHz (?)[115] |
Video processors
[edit]Mali Video is the name given to ARM Holdings' dedicated video decoding and video encoding ASIC. There are multiple versions implementing a number of video codecs, such as HEVC, VP9, H.264 and VP8. As with all ARM products, the Mali video processor is a semiconductor intellectual property core licensed to third parties for inclusion in their chips. Real time encode-decode capability is central to videotelephony. An interface to ARM's TrustZone technology is also built-in to enable digital rights management of copyrighted material.
Mali-V500
[edit]The first version of a Mali Video processor was the V500, released in 2013 with the Mali-T622 GPU.[116] The V500 is a multicore design, sporting 1–8 cores, with support for H.264 and a protected video path using ARM TrustZone. The 8 core version is sufficient for 4K video decode at 120 frames per second (fps). The V500 can encode VP8 and H.264, and decode H.264, H.263, MPEG4, MPEG2, VC-1/WMV, Real, VP8.
Mali-V550
[edit]Released with the Mali-T800 GPU, ARM V550 video processors added both encode and decode HEVC support, 10-bit color depth, and technologies to further reduced power consumption.[117] The V550 also included technology improvements to better handle latency and save bandwidth.[118] Again built around the idea of a scalable number of cores (1–8) the V550 could support between 1080p60 (1 core) to 4K120 (8 cores). The V550 supported HEVC Main, H.264, VP8, JPEG encode, and HEVC Main 10, HEVC Main, H.264, H.263, MPEG4, MPEG2, VC-1/WMV, Real, VP8, JPEG decode.
Mali-V61
[edit]The Mali V61 video processor (formerly named Egil) was released with the Mali Bifrost GPU in 2016.[119][120] V61 has been designed to improve video encoding, in particular HEVC and VP9, and to allow for encoding either a single or multiple streams simultaneously.[121] The design continues the 1–8 variable core number design, with a single core supporting 1080p60 while 8 cores can drive 4Kp120. It can decode and encode VP9 10-bit, VP9 8-bit, HEVC Main 10, HEVC Main, H.264, VP8, JPEG and decode only MPEG4, MPEG2, VC-1/WMV, Real, H.263.[122]
Mali-V52
[edit]The Mali V52 video processor was released with the Mali G52 and G31 GPUs in March 2018.[123] The processor is intended to support 4K (including HDR) video on mainstream devices.[124]
The platform is scalable from 1 to 4 cores and doubles the decode performance relative to V61. It also adds High 10 H.264 encode (Level 5.0) and decode (Level 5.1) capabilities, as well as AVS Part 2 (Jizhun) and Part 16 (AVS+, Guangdian) decode capability for YUV420.[125]
Mali-V76
[edit]The Mali V76 video processor was released with the Mali G76 GPU and Cortex-A76 CPU in 2018.[126] The V76 was designed to improve video encoding and decoding performance. The design continues the 2–8 variable core number design, with 8 cores capable of 8Kp60 decoding and 8Kp30 encoding. It claims improves HEVC encode quality by 25% relative to Mali-V61 at launch. The AV1 codec is not supported.
Mali-V77
[edit]The Mali V77 video processor was released with the Mali G77 GPU and Cortex-A77 CPU in 2019.
Comparison
[edit]| Mali Video | V500 | V550 | V61 | V52 | V76 | V77 |
|---|---|---|---|---|---|---|
| Announced | June 2, 2013[127] | October 27, 2014[128] | October 31, 2016[129] | March 6, 2018[130] | May 31, 2018[126] | |
| Recommended GPU | Mali-T800-series | Mali-G51 Mali-G72 |
Mali-G31 Mali-G52 |
Mali-G76 | Mali-G77 | |
| Recommended DPU | Mali-DP500 | Mali-DP550 Mali-DP650 |
Mali-DP650 Mali-D71 |
Mali-D52 | ||
| Memory system | MMU | |||||
| Bus interface | AMBA 3 AXI AMBA 4 ACE Lite |
AMBA AXI | AMBA4 AXI | |||
| Performance (enc) | 1080p60 (1 core) to 4K120 (8 core) | 1080p60 (1 core) to 4K60 (4 core) | 1080p60 (1 core) to 8K30 (8 core) | |||
| Performance (dec) | 1080p120 / 4K30 (1 core) to 4K120 (4 core) | 1080p120 / 4K30 (1 core) to 8K60 (8 core) | ||||
| Decode & encode | ||||||
| H.264 8-bit | D & E | D & E | D & E | D & E | D & E | D & E |
| H.264 10-bit | - | - | - | D & E | D & E | D & E |
| VP8 | D & E | D & E | D & E | D & E | D & E | D & E |
| JPEG | - | D & E | D & E | D & E | D & E | D & E |
| HEVC Main | - | D & E | D & E | D & E | D & E | D & E |
| HEVC Main 10 | - | D | D & E | D & E | D & E | D & E |
| VP9 8-bit | - | - | D & E | D & E | D & E | D & E |
| VP9 10-bit | - | - | D & E | D & E | D & E | D & E |
| AV1 | - | - | - | - | - | - |
Display processors
[edit]Mali-D71
[edit]The Mali-D71 added Arm Framebuffer Compression (AFBC) 1.2 encoder, support for ARM CoreLink MMU-600 and Assertive Display 5. Assertive Display 5 has support for HDR10 and hybrid log–gamma (HLG).
Mali-D77
[edit]The Mali-D77 added features including asynchronous timewarp (ATW), lens distortion correction (LDC), and chromatic aberration correction (CAC)[broken anchor]. The Mali-D77 is also capable of 3K (2880x1440) @ 120 Hz and 4K @ 90 Hz.[131]
| Mali Display | DP500[132][133] | DP550[134] | DP650[135][136] | D71[137][138][139] | D51 | D77[140][141] | D37[142] |
|---|---|---|---|---|---|---|---|
| Announced | May 8, 2010 | October 27, 2014 | January 20, 2016 | October 31, 2017 | March 6, 2018 | May 15, 2019 | October 23, 2019 |
| Optimized res | n/a | 720p (HD) to 1080p (FHD) | 1440p (QHD) | 1440p (QHD) to 2160p (UHD/4K) | 1080p (FHD) to 1440p (QHD) | 2880x1440 @ 120 Hz | 1080p (FHD) to 1440p (QHD) |
| Maximum res | 2160p (4K) | 2160p (4K) | 2160p (4K) | 2160p (4K) up to 120fps | 4096x2048 up to 60fps | 4320x2160 @ 120 Hz | |
| Launched alongside | Cortex-A17 core | Mali-T800 series GPU, V550 Video Processor | CoreLink MMU-600, Assertive Display 5 | Mali-G31, Mali-G52, Mali-V52 | Ethos-N77, Ethos-N57, Ethos-N37 Mali-G57 |
Image signal processors
[edit]Mali-C71
[edit]On April 25, 2017 the Mali-C71 was announced, ARM's first image signal processor (ISP).[143][144][145]
Mali-C52 and Mali-C32
[edit]On January 3, 2019 the Mali-C52 and C32 were announced, aimed at everyday devices including drones, smart home assistants and security, and internet protocol (IP) camera.[146]
Mali-C71AE
[edit]On September 29, 2020 the Mali-C71AE image signal processor was introduced, alongside the Cortex-A78AE CPU and Mali-G78AE GPU.[147] It supports up to 4 real-time cameras or up to 16 virtual cameras with a maximum resolution of 4096 x 4096 each.[148]
Mali-C55
[edit]On June 8, 2022 the Mali-C55 ISP was introduced as successor to the C52.[149][150] It is the smallest and most configurable image signal processor from Arm, and support up to 8 camera with a max resolution of 48 megapixel each. Arm claims improved tone mapping and spatial noise reduction compared to the C52. Multiple C55 ISPs can be combined to support higher than 48 megapixel resolutions.
Comparison
[edit]| Mali camera | C32 | C52 | C55 | C71 | C71AE |
|---|---|---|---|---|---|
| Announced | January 3, 2019 | June 8, 2022[149] | April 25, 2017 | September 29, 2020[151] | |
| Throughput | 600 MP/s | 1.2 GP/s | |||
| Support cameras | 4 | 8 | 4 real-time or 16 virtual | ||
| Max resolution | 4608×3456 (16 MP) | 8192×6144 (48 MP) | 4096×4096 (16MP) | ||
| Bit-depth (dynamic range) | 20-bit (20 stops) | 24-bit (24 stops) | |||
| Channel support | RGGB, RGBlr | RGGB | RGGB, RCCC, RGBIr, RCCB, RCCG | RGGB, RCCC, RCCB, RCCG, RGBIr | |
| up to 16 channels | |||||
| ASIL compliance | ASIL B / SIL 2
ASIL D / SIL 3 | ||||
The Lima, Panfrost and Panthor FOSS drivers
[edit]On January 21, 2012, Phoronix reported that Luc Verhaegen was driving a reverse-engineering attempt aimed at the Mali series of GPUs, specifically the Mali 200 and Mali 400 versions. The project was known as Lima and targeted support for OpenGL ES 2.0.[152] The reverse-engineering project was presented at FOSDEM, February 4, 2012,[153][154] followed by the opening of a website[155] demonstrating some renders. On February 2, 2013, Verhaegen demonstrated Quake III Arena in timedemo mode, running on top of the Lima driver.[156] In May 2018, a Lima developer posted the driver for inclusion in the Linux kernel.[157] In May 2019, the Lima driver became part of the mainline Linux kernel.[158] The Mesa userspace counterpart was merged at the same time. It currently supports OpenGL ES 1.1, 2.0 and parts of Desktop OpenGL 2.1, and the fallback emulation in MESA provides full support for graphical desktop environments.[159]
Panfrost is a reverse-engineered driver effort for Mali Txxx (Midgard) and Gxx (Bifrost) GPUs. A talk introducing Panfrost was presented at X.Org Developer's Conference 2018.[160] As of May 2019, the Panfrost driver is part of the mainline Linux kernel.[161] and MESA. Panfrost supports OpenGL ES 2.0, 3.0 and 3.1, as well as OpenGL 3.1.[162]
Later Collabora has developed[163] panthor driver for G310, G510, G710 GPUs.
See also
[edit]- Adreno – GPU developed by Qualcomm (formerly AMD, then Freescale)
- Atom family of SoCs – with Intel graphics core, not licensed to third parties
- AMD mobile APUs – with AMD graphics core, not licensed to third parties
- PowerVR – by Imagination Technologies
- Tegra – family of SoCs by Nvidia with the graphics core available as a SIP block to third parties
- VideoCore – family of SoCs by Broadcom with the graphics core available as a SIP block to third parties
- Vivante – available as SIP block to third parties
- Imageon – old AMD mobile GPU
- RDNA – by AMD, licensed to Samsung for use as GPUs in Exynos SoCs under Xclipse name
References
[edit]- ^ a b c d e Smith, Ryan (July 3, 2014). "ARM's Mali Midgard Architecture Explored". AnandTech. Archived from the original on July 3, 2014. Retrieved 2023-09-16.
- ^ "News". Arm.
- ^ Freddi Jeffries (2016-06-17). "Happy 10th Birthday Mali!". community.arm.com. Arm Limited. Retrieved 2021-12-19.
- ^ a b "Gaming Performance Unleashed with Arm's new GPUs - Announcements - Arm Community blogs - Arm Community". community.arm.com. 2022-06-28. Retrieved 2023-09-16.
- ^ "New Programmable Mali200 IP Core From Falanx Microsystems Delivers PC-level Graphics Quality for Mobile Devices".
- ^ a b https://static.docs.arm.com/dui0363/d/DUI0363D_opengl_es_app_dev_guide.pdf [dead link]
- ^ Smith, Ryan (July 3, 2014). "A Brief History of Mali - ARM's Mali Midgard Architecture Explored". AnandTech. Archived from the original on Oct 28, 2022.
- ^ "ARM Mali GPU Makes Advanced Graphics a Reality for All Consumers". ARM. 22 November 2010. Archived from the original on 2016-10-13.
- ^ "ARM intros new Mali-T604 GPU, touts 5x more performance". TechSpot. 2010-11-10. Retrieved 2023-09-16.
- ^ Shimpi, Anand Lal. "ARM's Mali-T658 GPU in 2013, Up to 10x Faster than Mali-400". AnandTech. Archived from the original on November 11, 2011. Retrieved 2023-09-16.
- ^ "ARM Mali-T604 Overview". community.arm.com. 2013-08-07. Retrieved 2023-09-16.
- ^ "ARM Mali-T604: New GPU & Architecture For Highest Performance & Flexibility". community.arm.com. 2013-09-11. Retrieved 2023-09-16.
- ^ Shimpi, Anand Lal. "ARM Announces 8-core 2nd Gen Mali-T600 GPUs". AnandTech. Archived from the original on August 8, 2012. Retrieved 2023-09-16.
- ^ "Killing Pixels - A New Optimization for Shading on ARM Mali GPUs". community.arm.com. 2013-09-11. Retrieved 2023-09-16.
- ^ "Introducing the ARM Mali-T700 GPU series: Innovated to (efficiently) power the next generation of devices". community.arm.com. 2013-10-29. Retrieved 2023-09-16.
- ^ "Mali GPU: An Abstract Machine - Frame Pipelining". community.arm.com. 2014-02-03. Retrieved 2023-09-16.
- ^ "Mali GPU: An Abstract Machine: Tile-based Rendering". community.arm.com. 2014-02-20. Retrieved 2023-09-16.
- ^ "Mali GPU: An abstract machine - the midgard shader core". community.arm.com. 2014-03-12. Retrieved 2023-09-16.
- ^ "Mali-T880 is set to Deliver the Premium Mobile Experience of 2016". community.arm.com. 2015-12-17. Retrieved 2023-09-16.
- ^ "Mali-G71: ARM's Most Powerful, Scalable, Efficient GPU". community.arm.com. 2016-05-27. Retrieved 2023-09-16.
- ^ Smith, Ryan. "ARM Unveils Next Generation Bifrost GPU Architecture & Mali-G71: The New High-End Mali". AnandTech. Archived from the original on May 31, 2016. Retrieved 2023-09-16.
- ^ "Mali-G72 new high performance GPU from Arm". community.arm.com. 2017-05-29. Retrieved 2023-09-16.
- ^ Smith, Ryan. "ARM Announces Mali-G72: Bifrost Refined for the High-End SoC". AnandTech. Archived from the original on May 29, 2017. Retrieved 2023-09-16.
- ^ "Mali-G76: Taking High-End Graphics To The Next Level". community.arm.com. 2018-05-31. Retrieved 2023-09-16.
- ^ Smith, Ryan; Frumusanu, Andrei (May 31, 2018). "Arm Announces Mali-G76 GPU: Scaling up Bifrost". AnandTech. Archived from the original on August 26, 2018. Retrieved 2023-09-16.
- ^ "Introducing Arm Mali-G77 GPU with Valhall architecture". community.arm.com. 2019-05-27. Retrieved 2023-09-16.
- ^ "Mali-G77". developer.arm.com. Retrieved 2023-09-16.
- ^ Frumusanu, Andrei. "Arm's New Mali-G77 & Valhall GPU Architecture: A Major Leap". AnandTech. Archived from the original on May 27, 2019. Retrieved 2023-09-16.
- ^ "True Immersive Entertainment through Arm Mali-G78 GPU". community.arm.com. 2020-05-26. Retrieved 2023-09-16.
- ^ Frumusanu, Andrei. "Arm Announces The Mali-G78 GPU: Evolution to 24 Cores". AnandTech. Archived from the original on May 26, 2020. Retrieved 2023-09-16.
- ^ "Mali-G78". developer.arm.com. Retrieved 2023-09-16.
- ^ "New Arm Mali GPUs for all consumer compute markets". community.arm.com. 2021-05-25. Retrieved 2023-09-16.
- ^ "Mali-G710 developer overview". community.arm.com. 2022-03-09. Retrieved 2023-09-16.
- ^ Frumusanu, Andrei. "Arm Announces New Mali-G710, G610, G510 & G310 Mobile GPU Families". AnandTech. Archived from the original on May 25, 2021. Retrieved 2023-09-16.
- ^ "Arm Immortalis-G715 Developer Overview". community.arm.com. 2023-03-20. Retrieved 2023-09-16.
- ^ "Documentation – Arm Developer". developer.arm.com. Retrieved 2023-09-16.
- ^ "Arm Immortalis-G715 deep dive: Ray tracing graphics for mobile". Android Authority. 2022-06-28. Retrieved 2023-09-16.
- ^ "Arm GPUs built on new 5th Gen GPU architecture - Announcements - Arm Community blogs - Arm Community". community.arm.com. 2023-05-29. Retrieved 2023-09-16.
- ^ "5th Gen Arm GPU Architecture". developer.arm.com. Retrieved 2023-09-16.
- ^ "TCS23: The complete platform for consumer computing - Announcements - Arm Community blogs - Arm Community". community.arm.com. 2023-05-29. Retrieved 2023-09-16.
- ^ "Immortalis-G720". developer.arm.com. Retrieved 2023-09-16.
- ^ "Initial support for ARM Mali Display Controller". Linux kernel mailing list. 2016-04-01.
- ^ "DRM: Add support for the ARM HDLCD display controller [LWN.net]".
- ^ "Documentation – Arm Developer". developer.arm.com. Retrieved 2023-09-16.
- ^ a b Tsai, Alan (2012). "OpenGL and OpenGL ES" (PDF). Khronos Group. Archived (PDF) from the original on Apr 27, 2023.
- ^ "ARM Heralds New Era In Embedded Graphics With Next-Generation Mali GPU". Archived from the original on 2016-08-11.
- ^ "Mali-T658 GPU Extends Graphics And GPU Compute Leadership For High Performance Devices". Archived from the original on 2016-08-11.
- ^ "ARM Targets 580 Million Mid-Range Mobile Devices with New Suite of IP". Archived from the original on 2016-08-12.
- ^ Shimpi, Anand Lal. "ARM Announces 8-core 2nd Gen Mali-T600 GPUs". AnandTech. Archived from the original on August 8, 2012. Retrieved 2023-01-15.
- ^ "The Samsung Exynos 7420 Deep Dive – Inside A Modern 14nm SoC". AnandTech. Archived from the original on June 30, 2015. Retrieved 2015-06-15.
- ^ a b c depending on number of shader core groups
- ^ "Vulkan – Mali Developer Center". ARM. Archived from the original on 2016-12-04. Retrieved 2016-06-16.
- ^ Supported as of June 2016 with r12p0 userspace driver
- ^ "Mali-G31 GPU". ARM Developer. Retrieved 2018-11-02.
- ^ Lynch, Doug (31 October 2016). "ARM announces their second Bifrost GPU – the Mali-G51". XDA Developers. Archived from the original on 5 December 2020. Retrieved 31 October 2016.
- ^ "Mali-G51". developer.arm.com.
- ^ "The Exynos 9810 – Introducing Meerkat". AnandTech. Archived from the original on March 28, 2018. Retrieved 2018-02-28.
- ^ "MediaTek Dimensity 700". MediaTek. Archived from the original on 2020-11-10. Retrieved 2021-07-24.
- ^ "The Khronos Group". The Khronos Group. 2022-06-28. Retrieved 2022-06-28.
- ^ "5th Gen Arm GPU Architecture".
- ^ "Resources and Information" (PDF). ww16.heterogeneouscompute.org. Retrieved 2023-01-15.
- ^ http://www.chipdesignmag.com/pallab/2011/06/30/arm-mali-gpu-unifying-graphics-across-platforms/ Archived 2017-02-14 at the Wayback Machine Heterogeneous Computing
- ^ "Graphics and Multimedia". Arm. Retrieved 2023-01-15.
- ^ "A10". Allwinner Technology. Archived from the original on 2012-12-01. Retrieved 2012-12-13.
- ^ "A10s". Allwinner Technology. Archived from the original on 2012-12-13. Retrieved 2012-12-13.
- ^ "A13". Allwinner Technology. Archived from the original on 2013-04-17. Retrieved 2012-12-13.
- ^ "A64". Linux-sunxi community. November 27, 2019. Retrieved 2022-05-26.
- ^ "AllWinner Publishes A31 and A20 Processors Details". CNXSoft. December 9, 2012. Retrieved 2012-12-09.
- ^ a b c d "Hardware". 2012-02-08. Archived from the original on 2012-02-08. Retrieved 2023-01-15.
- ^ Aufranc (CNXSoft), Jean-Luc (2020-12-28). "Amlogic S905X3 vs S905X4 - Features Comparisons - CNX Software". CNX Software - Embedded Systems News. Retrieved 2024-04-26.
- ^ "Upgraded 4K Chromecast with Google TV set to launch soon with improved hardware". TechSpot. 2024-04-25. Retrieved 2024-04-26.
- ^ Pandey, Rajesh (2023-09-19). "Next-gen Android TV boxes could pack all the right upgrades". Android Police. Retrieved 2024-04-26.
- ^ "Baikal-M". Archived from the original on 2017-09-18. Retrieved 2018-06-02.
- ^ "LC1810". Leadcore. Archived from the original on 2015-01-27. Retrieved 2014-08-16.
- ^ "LC1811". Leadcore. Archived from the original on 2015-01-26. Retrieved 2014-08-16.
- ^ "LC1813". Leadcore. Archived from the original on 2014-08-19. Retrieved 2014-08-16.
- ^ "LC1913". Leadcore. Archived from the original on 2014-08-19. Retrieved 2014-08-16.
- ^ "MediaTek MT5596 Digital TV". 2019-08-27. Archived from the original on 2019-04-25. Retrieved 2017-12-31.
- ^ "Mediatek MT8127". Archived from the original on November 15, 2014.
- ^ "MediaTek MT6753 datasheet | Processor Specs | PhoneDB".
- ^ Hinum, Klaus. "ARM Mali-T760 MP2". Notebookcheck.
- ^ "MediaTek Helio Smartphones, IoT, Automotive & Connectivity". MediaTek. 2018-03-26. Retrieved 2018-03-26.
- ^ "MediaTek Helio Smartphones, IoT, Automotive & Connectivity". MediaTek. 2017-12-06. Retrieved 2017-12-06.
- ^ "MediaTek Helio Smartphones, IoT, Automotive & Connectivity". MediaTek. 2017-12-06. Retrieved 2017-12-06.
- ^ "MediaTek Helio Smartphones, IoT, Automotive & Connectivity". MediaTek. 2018-03-26. Retrieved 2018-03-26.
- ^ "MediaTek Helio Smartphones, IoT, Automotive & Connectivity". MediaTek. 2018-10-25. Retrieved 2018-10-26.
- ^ "NetLogic Au1300". Archived from the original on 2012-02-21. Retrieved 2012-02-26.
- ^ "RMI Au1300 Press release".
- ^ cnxsoft (2017-06-07). "Realtek RTD1296 STB/Media NAS SoC Coming Soon with Multiple Ethernet Ports, Dual SATA, HDMI 2.0 Input and Output". CNXSoft – Embedded Systems News. Retrieved 2019-02-02.
- ^ "Filla Sapphire 7 Product Specification". Archived from the original on 2013-10-19. Retrieved 2022-03-01.
- ^ "News". Arm. Retrieved 2023-01-15.
- ^ http://sg.finance.yahoo.com/news/28nm-quad-core-era-rk3188-144500035.html [permanent dead link]
- ^ "Rockchip-瑞芯微电子股份有限公司". www.rock-chips.com. Retrieved 2018-06-14.
- ^ "Rockchip-瑞芯微电子股份有限公司". www.rock-chips.com. Retrieved 2018-06-14.
- ^ Aufranc, Jean-Luc (2019-04-24). "RK3588 8K Arm Cortex-A76/A55 SoC, Rockchip Roadmap to 2020". CNXSoft – Embedded Systems News. Retrieved 2019-05-01.
- ^ "Rockchip-瑞芯微电子股份有限公司".
- ^ (in English) ARM Powered Smartphone Sets New Graphics Benchmark Archived 2013-01-06 at the Wayback Machine on ARM blog
- ^ "Samsung Confirms Mali is in the Exynos 5250 Processor – ARM Community". Archived from the original on 2013-02-18. Retrieved 2012-03-10.
- ^ Smith, Ryan. "Samsung Announces Exynos 8895 SoC: 10nm, Mali G71MP20, & LPDDR4x". Archived from the original on February 23, 2017. Retrieved 2017-03-12.
- ^ Humrick, Matt. "Samsung Galaxy S8 Showdown: Exynos 8895 vs. Snapdragon 835, Performance & Battery Life Tested". Archived from the original on July 30, 2017. Retrieved 2017-12-06.
- ^ "Exynos 7 Series 9610 Processor: Specs, Features | Samsung Exynos". Samsung Semiconductor. Retrieved 2018-03-22.
- ^ "Samsung Honored for Outstanding Design and Engineering with 36 CES 2018 Innovation Awards". news.samsung.com. Retrieved 2017-12-06.
- ^ "Hardkernel ODROID-E7 development board". Archived from the original on March 15, 2012.
- ^ "Sigma Designs announces HEVC capable SMP8750 family". Archived from the original on 2017-09-20. Retrieved 2017-04-14.
- ^ "Socle Leopard-6 SoC". www.socle-tech.com.tw. Archived from the original on 2012-09-10. Retrieved 2012-02-26.
- ^ Spreadtrum SC8819 Archived 2014-12-04 at the Wayback Machine
- ^ ST-Ericsson NovaThor product page Archived 2012-02-13 at the Wayback Machine
- ^ STMicro SPEAr1340 Archived 2012-07-03 at the Wayback Machine
- ^ STMicro STi7108 Press release
- ^ STMicro STiH416 Press release
- ^ 2010, ARM Limited: TCC8900 development board Archived 2011-12-27 at the Wayback Machine
- ^ "WonderMedia Prizm WM8950 Press release". Archived from the original on 2012-01-10. Retrieved 2012-02-26.
- ^ "Hardware.Info Nederland". nl.hardware.info (in Dutch). Retrieved 2017-12-06.
- ^ "Surge S1 – Xiaomi". Retrieved 2017-12-06.
- ^ "Surge S2 – Xiaomi". Retrieved 2017-12-06.
- ^ Lal Shimpi, Anand (2 June 2013). "ARM MaliT622 V500 Video Block Complement Cortex A12". Anandtech.com. Archived from the original on June 8, 2013. Retrieved 16 June 2016.
- ^ Smith, Ryan (27 October 2014). "ARM Announces Mali V550 Video Processor and Mali P550 Display Processor". Anandtech.com. Archived from the original on October 28, 2014. Retrieved 16 June 2016.
- ^ Sims, Gary (27 October 2014). "Mali-T860 GPU headlines new range of integrated media designs from ARM". Android Authority. Retrieved 24 June 2016.
- ^ Smith, Ryan (16 June 2016). "ARM Announces Mali Egil Video Processor". Anandtech.com. Archived from the original on June 19, 2016. Retrieved 17 June 2016.
- ^ Smith, Ryan (31 October 2016). "ARM Announces Mali-G51 Mainstream GPU, Mali-V-61 Video Processing Block". Anandtech. Archived from the original on November 2, 2016. Retrieved 1 November 2016.
- ^ Walrath, Josh (16 June 2016). "ARM Introduces Egil Video Processor: 4K 120 Hz the Top!". TechReport.com. Retrieved 28 June 2016.
- ^ "Graphics and Multimedia Processors | Mali-V61 – Arm Developer". ARM Developer. Arm Ltd. Archived from the original on 2017-10-14. Retrieved 2017-10-14.
- ^ Wiggers, Kyle (6 March 2018). "ARM Introduces the Mali-G52/G31 GPUs and the Mali-D51 Display and Mali-V52 Video Processors". Xda Developers. Retrieved 2 June 2018.
- ^ Tyson, Mark (7 March 2018). "Arm launches Mali-G52 and Mali-G31 mainstream GPUs". Hexus. Retrieved 2 June 2018.
- ^ "Arm Introduces Mali-G52 & Mali-G31 GPUs, Mali-D51 Display Processor, and Mali-V52 Video Processor for Mainstream Devices". 6 March 2018. Retrieved 21 May 2020.
- ^ a b Smith, Ryan (31 May 2018). "ARM Announces Mail-V76 Video Processor". Anandtech. Archived from the original on August 26, 2018. Retrieved 2 June 2018.
- ^ Shimpi, Anand Lal. "ARM Mali-T622 & V500 Video Block Complement Cortex A12". Archived from the original on June 8, 2013. Retrieved 2017-11-27.
- ^ Smith, Ryan. "ARM Announces Mali-V550 Video Processor & Mali-DP550 Display Processor". Archived from the original on October 28, 2014. Retrieved 2017-11-27.
- ^ Smith, Ryan. "ARM Announces Mali-G51 Mainstream GPU, Mali-V-61 Video Processing Block". Archived from the original on November 2, 2016. Retrieved 2017-11-27.
- ^ "Mali-G52". developer.arm.com.
- ^ Frumusanu, Andrei. "Arm Announces Mali D77 Display Processor: Facilitating AR & VR". AnandTech. Archived from the original on May 15, 2019. Retrieved 2019-05-28.
- ^ "Is the future as good as it used to be?". Arm Community. Retrieved 2017-11-27.
- ^ "Graphics and Multimedia Processors | Mali Display Processors – Arm Developer". ARM Developer. Arm Ltd. Retrieved 2017-11-27.
- ^ "Graphics and Multimedia Processors | Mali-DP550 – Arm Developer". ARM Developer. Arm Ltd. Retrieved 2017-11-27.
- ^ "ARM enables power efficient, enhanced visual experiences on 2.5K and 4K mobile devices with the Mali-DP650 Display Processor". Arm Community. Retrieved 2017-11-27.
- ^ "Graphics and Multimedia Processors | Mali-DP650 – Arm Developer". ARM Developer. Arm Ltd. Retrieved 2017-11-27.
- ^ "Mali-D71 and the Next Generation Display Solution". Arm Community. Retrieved 2017-11-27.
- ^ Oh, Nate. "Arm Announces New Mali-D71 Display Processor and IP Blocks". Archived from the original on November 1, 2017. Retrieved 2017-11-27.
- ^ "Graphics and Multimedia Processors | Mali-D71 – Arm Developer". ARM Developer. Arm Ltd. Retrieved 2017-11-27.
- ^ "Introducing the Arm Mali-D77 Display Processor for VR – Graphics and Gaming blog – Graphics and Gaming". Arm Community. Retrieved 2019-05-15.
- ^ Frumusanu, Andrei. "Arm Announces Mali D77 Display Processor: Facilitating AR & VR". Anandtech. Archived from the original on May 15, 2019. Retrieved 2019-05-15.
- ^ Frumusanu, Andrei. "Arm Announces New Ethos-N57 and N37 NPUs, Mali-G57 Valhall GPU and Mali-D37 DPU". Anandtech. Archived from the original on October 23, 2019. Retrieved 2019-10-24.
- ^ Smith, Ryan. "ARM Announces Mali-C71: Their First Automotive-Grade Image Signal Processor". Archived from the original on April 26, 2017. Retrieved 2017-11-27.
- ^ "Mali-C71 driving image processing for automotive". Arm Community. Retrieved 2017-11-27.
- ^ "Graphics and Multimedia Processors | Mali Camera – Arm Developer". ARM Developer. Arm Ltd. Retrieved 2017-11-27.
- ^ "A sharper digital eye for intelligent devices". Arm Ltd. Retrieved 2019-05-23.
- ^ "New Arm technologies enable safety-capable computing solutions for an autonomous future". Arm Ltd. Retrieved 2022-06-28.
- ^ "Arm Mali-C71AE: High performance ISP with advanced safety". Arm Community. 29 September 2020. Retrieved 2022-06-28.
- ^ a b "Arm introduces new image signal processor to advance vision systems for IoT and embedded markets". Arm Ltd. Retrieved 2022-06-28.
- ^ "Arm Mali-C55: Image processing with smallest silicon area and highest performance - Internet of Things (IoT) blog". Arm Community. 8 June 2022. Retrieved 2022-06-28.
- ^ "New Arm technologies enable safety-capable computing solutions for an autonomous future". Arm Ltd. Retrieved 2022-06-28.
- ^ "Announcing The Lima Open-Source GPU Driver". www.phoronix.com. January 27, 2012. Retrieved 2023-01-15.
- ^ "An Open-Source, Reverse-Engineered Mali GPU Driver". www.phoronix.com. January 27, 2012. Retrieved 2023-01-15.
- ^ "Liberating ARM's Mali GPU". FOSDEM. January 21, 2012. Archived 2012-01-27 at the Wayback Machine
- ^ "limadriver". 2012-02-07. Archived from the original on 2012-02-07. Retrieved 2023-01-15.
- ^ "Quake 3 Arena timedemo on top of the lima driver!" Archived 2013-02-09 at the Wayback Machine
- ^ "Lima DRM driver". lwn.net. Retrieved 2023-01-15.
- ^ "Linux 5.2 DRM Makes Icelake Production-Ready, Adds Lima & Panfrost Drivers". www.phoronix.com. Retrieved 2024-01-27.
- ^ "Lima". The Mesa 3D Graphics Library. Retrieved 2022-07-05.
- ^ Rosenzweig, Alyssa; Abbott, Connor; Paul, Lyude; BiOpen. "Panfrost: A reverse engineered FOSS driver for Mali Midgard and Bifrost GPUs" (PDF).
- ^ "kernel/git/torvalds/linux.git - Linux kernel source tree". git.kernel.org. Retrieved 2023-01-15.
- ^ "Panfrost". The Mesa 3D Graphics Library. Retrieved 2022-07-05.
- ^ "Release the panthor!".
External links
[edit]- Graphics Processing from ARM website
- Mali Developer Center Archived 2017-01-07 at the Wayback Machine a developer focused site run by ARM
- V500
- V550
- Lima driver
Mali (processor)
View on GrokipediaOverview
History
ARM acquired Falanx Microsystems in June 2006, integrating its graphics technology to address the growing demand for advanced mobile graphics processing, with the initial Mali-55 GPU marking the start of the Mali family.[5] This move positioned ARM to provide dedicated GPU IP for power-constrained mobile devices, building on Falanx's research from the Norwegian University of Science and Technology. The Mali-55 was followed by announcements of the Mali-200 in early 2007, which became the first commercially licensed Mali IP in 2008, achieving OpenGL ES 2.0 conformance and enabling higher-resolution graphics in early smartphones and tablets.[6] Key milestones in the 2010s included a shift toward open-source driver development in 2012, with the community-driven Lima project releasing initial open-source code for Mali-200 and Mali-400 GPUs to foster broader ecosystem adoption.[7] Architectural evolution accelerated with the introduction of the Midgard architecture in 2010, transitioning from fixed-function pipelines to unified shaders for improved flexibility and efficiency in handling diverse workloads like OpenGL ES 3.0.[8] This period also saw ARM's IP licensing model gain traction, with major SoC vendors such as Samsung, MediaTek, and Allwinner integrating Mali GPUs into their platforms for cost-effective, high-performance graphics in consumer devices.[9][10][11] Subsequent advancements focused on enhancing realism and AI capabilities, with ray tracing first introduced in the Immortalis series in 2022[12] and further advanced as part of the fifth-generation GPU architecture announced in May 2023, exemplified by the Immortalis-G720 for flagship mobile gaming.[13] In September 2025, ARM released the Mali G1-Ultra, built on the fifth-generation architecture, which enhances AI processing and doubles ray-tracing performance for desktop-quality visuals in mobile SoCs, along with a new branding scheme that drops the Immortalis and Cortex names in favor of G1 for GPUs and C1 for CPUs.[14] The evolution toward fully open-source drivers gained momentum starting in 2017 with community-driven efforts like Panfrost, improving Linux compatibility for Mali hardware.[15]Key features and licensing
The Mali processors feature a highly scalable design, allowing configurations from single-core setups for low-power embedded applications to multi-core clusters with up to 24 cores, such as in the Mali-G78AE, to address diverse needs in mobile devices, automotive systems, and high-end computing.[1] This modularity enables licensees to tailor performance and area trade-offs without redesigning core architectures, supporting applications from IoT sensors to flagship smartphones.[16] Mali GPUs provide broad API compatibility, including OpenGL ES up to version 3.2, Vulkan from 1.0 to 1.3, and OpenCL 1.2 and 2.0 for compute tasks.[17][18] DirectX support is achieved through translation layers in environments like Windows on ARM, facilitating cross-platform development.[19] Power efficiency is a cornerstone of Mali's architecture, optimized for battery-constrained devices through tile-based deferred rendering, which divides the framebuffer into small tiles (typically 16x16 pixels) processed on-chip to minimize external memory bandwidth and reduce power draw by up to 50% compared to immediate-mode rendering.[20][21] Additional features include dynamic voltage and frequency scaling (DVFS) for adaptive power management based on workload demands, and clock gating to disable inactive circuit blocks, further enhancing energy efficiency in tiled architectures.[22][23] Licensing for Mali IP follows ARM's standard model, where the company provides synthesizable register-transfer level (RTL) designs for custom integration or pre-placed GDSII hard macros for faster implementation, accompanied by upfront fees and royalties calculated per shipped device.[24][25] This structure allows partners to incorporate Mali into their SoCs while ARM handles ongoing optimizations. ARM collaborates with leading foundries like TSMC to enable fabrication on advanced nodes, including 3nm and 2nm processes targeted for production in 2025, ensuring compatibility with cutting-edge manufacturing.[26] Mali processors are deeply integrated into major ecosystems, powering graphics and compute in billions of Android devices, Linux-based systems via open-source Mesa drivers, and Windows on ARM platforms for mixed-reality and productivity applications.[1] Later generations, such as the Immortalis series, incorporate hardware-accelerated ray tracing units (RTUv2) for realistic lighting effects and dedicated AI engines for on-device machine learning inference, delivering up to 2x improvements in ray tracing throughput and ML performance.[14][27]Graphics processors
Utgard architecture
The Utgard architecture represents the inaugural generation of Arm's Mali GPU family, introduced as the first programmable shader core design featuring dedicated fixed-function vertex and fragment processors without unified shaders. This pre-unified approach separated geometry processing in the vertex stage from pixel shading in the fragment stage, enabling efficient handling of 2D and 3D graphics pipelines. A core innovation was its tile-based deferred rendering technique, which divides the screen into small tiles (typically 16x16 pixels) processed in on-chip memory buffers, significantly reducing external DDR memory bandwidth demands and enhancing power efficiency for battery-constrained mobile devices. This architecture supported up to 4x multi-sample anti-aliasing (MSAA) directly in hardware, further optimizing rendering quality while minimizing overdraw.[28][29][30] The Utgard lineup began with the Mali-55 in 2007 as a proof-of-concept for low-cost, fixed-function graphics, featuring a pixel processor for rasterization compliant with OpenGL ES 1.1 and relying on CPU software for geometry tasks, with no programmable shaders. This was followed by the programmable Mali-200 and Mali-300 in 2007-2008, which introduced vertex shader support for OpenGL ES 2.0 alongside fragment processing, targeting enhanced mobile UIs and basic 3D games. The Mali-400 series, launched in 2008, expanded to multi-processor (MP) configurations with 1-4 fragment cores for scalable performance, while the Mali-450 in 2012 doubled scalability to 1-8 cores, all maintaining the fixed-function vertex setup shared across the family. These models were produced through 2012, with core counts configurable to balance area, power, and throughput in system-on-chip (SoC) integrations.[31][6][32] Performance scaled with core count and process node, with the Mali-400 MP4 achieving up to 55 million triangles per second (Mtri/s) and 2.0 gigapixels per second (Gpix/s) fill rate at 500 MHz on a 28nm high-performance mobile (HPM) process, sufficient for 1080p resolutions in early smartphones. Targeted at feature phones and entry-level smartphones, these GPUs operated at clock speeds from 210-500 MHz, prioritizing low power over raw compute, with implementations as small as 1.4 mm² die area on 90nm for the Mali-55. Key innovations included hardware acceleration for both 2D (via OpenVG 1.1) and 3D graphics, full-scene anti-aliasing up to 16x, high dynamic range (HDR) rendering, and transaction elimination to further cut memory traffic by up to 75% in tiled operations. The Mali-400, for instance, powered the Samsung Galaxy S smartphone in 2010, enabling OpenGL ES 2.0-compliant 3D games and UI effects in one of the first widely adopted Android devices.[30][28] Despite these advances, the Utgard architecture's graphics-only focus and absence of compute shader support limited its applicability to general-purpose parallel processing tasks like OpenCL, rendering it unsuitable for emerging workloads beyond rendering. This fixed-function design was eventually superseded by the Midgard architecture to address demands for unified shaders and broader API compatibility.[33][28]Midgard architecture
The Midgard architecture represents ARM's first-generation unified shader design for Mali GPUs, introduced in 2012 as a significant advancement over prior fixed-function approaches by enabling programmable shaders for vertex, fragment, and compute processing within a single core type.[34] This shift to scalar unified shaders allowed for flexible workload handling, with each shader core featuring multiple arithmetic pipelines—typically two, increasing to three in later models like the Mali-T880—that process instructions via SIMD vectorization on 128-bit quad-word registers, supporting 4 FP32 operations, 8 FP16, or 16 int8 per pipeline per clock.[35] Branch handling was improved through a massively multi-threaded execution model, where hundreds of independent scalar threads per core mask latency from divergent paths by rapidly switching contexts, avoiding the penalties of lockstep execution in more rigid SIMD designs.[35] Building briefly on Utgard's tile-based rendering foundation, Midgard retained deferred lighting and on-chip tile buffers for power efficiency while adding full programmability to support emerging APIs.[36] Midgard evolved across four generations, scaling in core count and efficiency to meet diverse device needs. The first generation, launched with the Mali-T604 in 2012, supported up to four cores and marked the architecture's debut, powering early high-end tablets like the Google Nexus 7 (2012). The second generation (2013) included models such as the Mali-T622, T624, and T628, offering up to eight cores with enhanced power management for mid-range devices.[37] Third-generation variants like the Mali-T720, T760, and T820 (announced in 2013 but shipping around 2015) pushed scalability to 16 cores, with the T760 delivering 400% better energy efficiency than the T604 through optimized pipelines and larger L2 caches (up to 2 MB).[38] The fourth generation (2016), comprising the Mali-T830, T860, and T880, further refined this with up to 16 cores and support for more complex rendering, as seen in smartphones like the Samsung Galaxy S6 (T760 MP8 variant).[38] Performance scaled with configuration, culminating in the Mali-T880 multi-processor variants, which could achieve over 100 GFLOPS of FP32 compute in 12-core setups at typical mobile clocks around 800 MHz, driven by three arithmetic pipelines per core enabling up to 12 FP32 operations per core per cycle.[35] All Midgard GPUs supported OpenGL ES 3.1 and Vulkan 1.0, alongside OpenCL 1.2 for compute tasks, enabling features like multi-sample anti-aliasing up to 16x and adaptive scalable texture compression (ASTC) for bandwidth reduction.[39] Key features emphasized adaptive scalability, allowing integrators to configure 1 to 16 cores per design to balance power and performance across low- to high-end SoCs.[38] A dedicated job manager handled task distribution, pipelining vertex, tiling, and fragment jobs to optimize throughput while minimizing host CPU overhead.[39] Additional efficiencies included transaction elimination (reducing memory writes by 16x in 16x16 pixel blocks) and Arm Frame Buffer Compression (AFBC) for on-chip storage, contributing to overall system energy savings in tile-based rendering.[39] Midgard laid the groundwork for GPU compute workloads by unifying shader types and exposing general-purpose memory access, but its scalar, thread-level execution model—lacking wave-level (SIMD lockstep) primitives—limited occupancy and efficiency in highly divergent or bandwidth-bound kernels compared to later architectures.[35]Bifrost architecture
The Bifrost architecture represents the second-generation unified shader design in the Mali GPU family, succeeding Midgard by introducing enhancements in execution efficiency and power management while maintaining a tile-based deferred rendering approach.[40] It features programmable shader cores capable of handling vertex, fragment, and compute workloads through a single unified pipeline, with each core comprising multiple execution engines (EEs) that support dual-issue capabilities for issuing two instructions per cycle to improve throughput.[41] The arithmetic pipelines employ warp-based vectorization, operating on 4-wide warps for scalar 32-bit operations but scaling to 16-wide SIMD execution for lower-precision formats like INT8, enabling efficient processing of diverse workloads including texture sampling and compute tasks.[42] Bifrost GPUs are organized into generations, with the first generation encompassing models like the Mali-G71 (announced in 2016) and entry-level variants such as the G52 (2018), followed by the second-generation Mali-G72 (2017) and third-generation Mali-G76 (2018), which support up to 20 cores in multi-processor configurations for scalable performance.[43][44][45] Key improvements include refined texture sampling units that deliver one bilinear texel per clock in small cores (scaling to two in larger cores) and optimized depth texture processing reduced to a single cycle, enhancing rendering efficiency for complex scenes.[41] The architecture also incorporates better power gating through support for mixed-precision operations (INT8, INT16, FP16), allowing dynamic scaling of compute resources to minimize energy use during varying workloads.[42] Performance in Bifrost scales with core count and clock speed, for instance the Mali-G52 MC2 achieves ~80–100 GFLOPS in theoretical FP32 performance, with the Mali-G76 offering up to 46% greater graphics processing power compared to its predecessor while achieving 178% improved energy efficiency, making it suitable for high-end mobile applications.[46][47] It provides full support for Vulkan 1.1 and OpenCL 2.0, enabling advanced graphics rendering and general-purpose computing on GPUs.[40] L2 cache enhancements feature a unified logical cache (configurable from 128 KB to 4 MB across implementations) that reduces partial line writes to memory, improving bandwidth efficiency particularly with LPDDR4 interfaces.[48] Bifrost's design advances prepare it for machine learning applications through efficient compute shaders and INT8 dot product support in later models like the G76, boosting ML inference performance by up to 17% over prior generations via optimizations in thread occupancy and register file size (up to 64 64-bit registers per thread).[49][50] Early implementations include the Mali-G71 in devices like the Samsung Galaxy S8, while the G76 powers the Huawei Kirin 980 SoC in smartphones such as the Huawei Mate 20.[43][51]Valhall architecture
The Valhall architecture is Arm's fourth-generation GPU microarchitecture for the Mali series, succeeding Bifrost and emphasizing enhanced efficiency through innovative compression techniques and scalable design. It builds on Bifrost's dual-issue execution model by incorporating shader core compression that achieves up to 2x density in performance per silicon area compared to prior generations, enabling more compute resources within the same die space. Larger register files support up to 64 32-bit registers per thread, with full occupancy at 32 registers to accommodate complex shaders without sacrificing parallelism. Valhall also introduces native support for mesh shaders via Vulkan extensions, allowing developers to generate and cull geometry more efficiently on the GPU. Valhall evolved across four generations, beginning with the first in 2019 featuring the entry-level Mali-G57 and premium Mali-G77 models, which prioritized power efficiency for mobile devices. The second generation arrived in 2020 with the Mali-G68 for mainstream applications and the high-end Mali-G78, scalable to 24 cores for demanding workloads. The third generation, launched in 2022, included the ultra-low-power Mali-G310 and mid-range Mali-G610, optimizing for broader deployment in wearables and IoT. The Mali-G610 MC4 provides approximately 4-5x higher GPU performance than the Mali-G52 MC2 in benchmarks like AnTuTu, attributed to the newer Valhall architecture, increased core count, and efficiency improvements, making it suitable for mid-range gaming and multitasking.[52] The fourth and final generation in 2023 delivered the Mali-G715 for general use and the Immortalis-G715 variant with dedicated ray tracing hardware, supporting up to 12 cores in premium configurations. Performance highlights include the Mali-G78 MP24 configuration reaching up to 1.3 TFLOPS of FP32 throughput, underscoring Valhall's suitability for console-quality mobile gaming. The architecture conforms to Vulkan 1.2 and 1.3, with Immortalis models adding hardware-accelerated ray tracing for realistic lighting and shadows in supported titles. Notable enhancements encompass improved asynchronous compute, enabling simultaneous graphics rendering and compute tasks to maximize utilization and reduce latency. Integrated AI tensor accelerators further elevate machine learning inference, delivering up to 60% higher ML performance density in initial implementations. Valhall powers key SoCs like the Google Tensor in Pixel 6 devices (Mali-G78) and MediaTek Dimensity 9200 (Mali-G715), driving immersive experiences in smartphones. As the concluding major iteration before Arm's shift to the fifth-generation architecture, Valhall solidified compression-driven scalability as a cornerstone for energy-constrained embedded graphics.Fifth-generation GPU architecture
The fifth-generation GPU architecture, introduced by Arm in May 2023, represents a new microarchitecture designed to enhance graphics rendering, AI workloads, and power efficiency in mobile devices. It features improved core scaling capabilities, supporting configurations from fewer than five cores in entry-level variants to over ten cores in flagship models, with later iterations extending up to 24 cores. This architecture delivers an average 15% peak performance increase and 15% better energy efficiency compared to the prior generation, alongside a 20% uplift in frame rates for complex scenes, while being optimized for advanced 3nm process nodes to accelerate system-on-chip integration.[53][54][55] Key models based on this architecture include the Immortalis-G720, a 2023 ray-tracing flagship scalable to ten or more cores for high-end smartphones; the Mali-G720 and Mali-G620, mid-range options from 2023 with six to nine cores and up to five cores, respectively, omitting mandatory ray tracing for cost efficiency; the Mali-G725, a 2024 premium scalable variant with six to nine cores emphasizing gaming and AI; and the Mali G1-Ultra, a 2025 flagship model with enhancements for AI processing and ray tracing, scaling from ten to 24 cores. As a successor to the Valhall architecture, it builds on prior compression techniques while introducing deferred vertex shading to handle increased scene complexity more effectively.[54][53][56][4] Performance highlights include up to approximately 2.6 TFLOPS in configurations like the Immortalis-G720 MC12, enabling sustained frame rates in demanding applications. The architecture provides full support for Vulkan 1.3 and enhanced OpenCL implementations, including versions 1.2, 2.1, and 3.0 full profile, to facilitate machine learning tasks such as 3D scene reconstruction with up to 25% better efficiency in select workloads.[57][58][59][53] Central features encompass expanded hardware ray tracing via a power-gatable ray-tracing unit, which doubles performance in the Mali G1-Ultra through the second-generation RTUv2 for more realistic lighting and reflections, and intelligent workload balancing that reduces CPU load by up to 40% while cutting memory bandwidth usage for lower power consumption. These GPUs power upcoming 2025 SoCs, such as those in Arm's Lumex Compute Subsystem platform, which integrates them with AI-optimized CPU clusters for on-device experiences in gaming and inference. Looking ahead, the architecture benefits from open-source kernel drivers, with the Panthor DRM driver providing upstream Linux support identical to Arm's commercial implementations for fifth-generation models.[54][4][60][61]Technical details
The Mali GPU architectures utilize tile-based deferred rendering (TBDR) as a core mechanism to minimize memory bandwidth consumption, particularly suited for power-constrained mobile devices. In TBDR, the rendering pipeline begins with a geometry processing phase that bins incoming primitives against the screen, dividing it into fixed-size tiles—typically 16×16 pixels in Mali implementations. This binning creates a compact tile list data structure, identifying only the relevant primitives for each tile and discarding those that do not overlap, thereby avoiding unnecessary fragment processing across the full framebuffer.[20][62] The subsequent fragment processing phase renders each tile entirely within on-chip tile memory, which buffers color, depth, and stencil data locally. Primitives are rasterized, shaded, and blended per tile, with visibility tests (such as early-Z rejection) and overdraw resolution handled without external memory accesses. Only the finalized tile buffer is written back to system memory once, leveraging techniques like Arm Frame Buffer Compression (AFBC) for further lossless reduction in transfer size. This off-screen rendering eliminates repeated reads and writes associated with overdraw in traditional immediate-mode rendering.[20][63] Bandwidth savings in TBDR arise primarily from localizing overdraw handling, which can be approximated conceptually as follows. In immediate-mode rendering, required bandwidth scales with total fragment processing, given by , where is the screen area in pixels, is the overdraw factor (average fragments per pixel), is bandwidth for reads (e.g., depth/color fetches), and is bandwidth for writes. In TBDR, processing occurs per tile, reducing to , where is tile area in pixels (e.g., 256 for 16×16), is the final tile write cost, and is binning overhead. Derivation of savings yields , assuming negligible binning cost relative to overdraw elimination; smaller tiles refine this by limiting intra-tile overdraw but increase binning granularity. Quantitative impact includes up to 4 GB/s savings for 1080p deferred shading at 60 FPS, establishing TBDR's role in bandwidth efficiency.[20][63] Shader core designs across Mali architectures have evolved from vector-oriented processing to hybrid scalar-vector models, enhancing flexibility for both graphics and compute workloads. Early Utgard and Midgard generations relied on a 4-wide vector (vec4) SIMD execution, where instructions processed four components in parallel, aligning well with graphics shaders but limiting divergence in general-purpose code. Midgard unified vertex and fragment shaders into scalable cores with dual-issue pipelines for improved throughput.[64][48] Bifrost shifted to a scalar ISA with quad-parallel execution, executing four independent scalar threads in lockstep per pipeline stage, which boosts utilization and eases compilation compared to Midgard's vector constraints. Valhall builds on this scalar foundation, incorporating vector processing capabilities for compute tasks while maintaining scalar efficiency for graphics; register files expanded significantly, reaching 128 KB per core to support higher thread counts (up to 1024 threads per core) and complex programs without spilling to memory. This evolution prioritizes balanced performance across diverse workloads.[48][65] The memory hierarchy in Mali GPUs balances low latency and bandwidth through tiered caching, integrated with system-level coherence. Shader cores include private L1 instruction and data caches (typically 16-32 KB combined) alongside texture caches for filtering operations, enabling fast local access during execution. A unified L2 cache, shared across cores and scalable to 64-128 KB per core in architectures like Bifrost and Valhall, aggregates traffic and applies compression for framebuffer data.[66][67] System coherence with ARM CPU cores is managed at the L2 boundary via protocols such as ACE (AXI Coherency Extensions), ensuring GPU writes are visible to CPUs and vice versa without involving per-core L1 caches in snoop traffic; this reduces overhead while maintaining data consistency in heterogeneous SoCs. Mali L1 caches operate non-coherently internally, relying on L2 for inter-core and system synchronization.[68] Power management features in Mali GPUs emphasize efficiency through dynamic voltage and frequency scaling (DVFS), which modulates core clocks and voltages based on workload demand, often via platform governors that profile utilization. Idle states power down unused shader cores or the entire GPU during quiescence, minimizing leakage. Efficiency metrics, such as GFLOPS/Watt, improve across generations—Bifrost achieves roughly 2× better power efficiency than Midgard in fragment-heavy workloads due to reduced overdraw and scalar optimizations—enabling sustained performance within thermal limits. DVFS curves typically scale frequency linearly with utilization while quadratically reducing power, prioritizing energy savings in bursty mobile scenarios.[69][48] Compute and AI capabilities in Mali GPUs leverage an OpenCL-based execution model, where kernels define parallel work-items grouped into work-groups, dispatched via NDRanges to shader cores for SIMT (Single Instruction, Multiple Threads) processing. Each work-item executes as an independent thread with its own program counter, scheduled in waves to maximize occupancy; barriers and atomics ensure synchronization within work-groups. Later generations, including Valhall and fifth-gen architectures, extend this with tensor operations like low-precision matrix multiply-accumulate (e.g., FP16/INT8) directly in shader pipelines, accelerating AI inference without dedicated tensor cores by fusing operations for neural network layers. This model supports scalable compute throughput, with examples like convolution kernels benefiting from vectorized tensor ops in AI workloads.[70]Implementations
Mali graphics processors are integrated into various system-on-chip (SoC) designs by major vendors, enabling graphics acceleration in mobile, embedded, and emerging computing platforms. Samsung's Exynos series frequently incorporates Mali GPUs, with the Exynos 9820 featuring a Mali-G76 MP12 configuration to deliver enhanced gaming performance in flagship devices.[71][72] MediaTek's Dimensity lineup also widely adopts Mali technology, as seen in the Dimensity 9200 SoC with an Immortalis-G715 MC11 GPU, supporting advanced ray tracing and high-frame-rate rendering for premium smartphones.[73][74] Google's Tensor SoC in the Pixel 6 series utilizes a Mali-G78 MP20 GPU, achieving strong graphics benchmarks that demonstrate reliable performance for everyday mobile tasks and light gaming.[75] Notable device integrations highlight Mali's versatility across consumer products. The Samsung Galaxy S10, powered by the Exynos 9820, leverages the Mali-G76 MP12 for immersive visuals in a 6.1-inch AMOLED display, contributing to its premium multimedia experience.[72] The OnePlus Nord 3 employs the MediaTek Dimensity 9000 with a Mali-G710 MC10 GPU, balancing efficiency and power for mid-range gaming and multitasking on its 6.74-inch Fluid AMOLED screen.[76] In the embedded space, Allwinner's A-series SoCs, such as the A33, integrate Mali-400 MP2 GPUs for cost-effective tablets, supporting basic OpenGL ES 2.0 acceleration in budget Android devices.[11] MediaTek's Helio G91 SoC incorporates the Mali-G52 MC2 GPU, delivering theoretical FP32 performance of ~80–100 GFLOPS for entry-level smartphones and tablets.[77] In automotive applications, earlier Renesas R-Car generations, like the R-Car E2, incorporated Mali GPUs such as the Mali-400 for infotainment systems, enabling smooth UI rendering and video playback in vehicle displays.[78] For emerging workloads, the 2025-introduced Mali G1-Ultra GPU appears in SoCs like the MediaTek Dimensity 9500, targeting AI-enhanced graphics in upcoming flagships such as the Vivo X300 Pro, with up to 33% improved GPU performance over prior generations.[79][80]| Vendor/SoC | Mali GPU Variant | Notable Devices/Use Cases | Key Performance Context |
|---|---|---|---|
| Samsung Exynos 9820 | Mali-G76 MP12 | Galaxy S10 | Up to 40% graphics performance uplift for gaming[81] |
| MediaTek Dimensity 9200 | Immortalis-G715 MC11 | Vivo X90 Pro, Oppo Find X6 | 32% boost in Manhattan 3.0 benchmark scores[73] |
| Google Tensor (Pixel 6) | Mali-G78 MP20 | Google Pixel 6 | Strong performance in graphics benchmarks[75] |
| MediaTek Dimensity 9000 (OnePlus Nord 3) | Mali-G710 MC10 | OnePlus Nord 3 | Efficient for mid-range emulation and multitasking[76] |
| Allwinner A33 | Mali-400 MP2 | Budget Android tablets | Basic 1080p UI and video support[11] |
| MediaTek Helio G91 | Mali-G52 MC2 | Entry-level smartphones | Theoretical FP32 performance ~80–100 GFLOPS[77] |
| MediaTek Dimensity 9500 | Mali G1-Ultra MP12 | Vivo X300 Pro (2025) | 119% ray tracing improvement for AI workloads[79][80] |
Video processors
Mali-V500
The Mali-V500 is Arm's inaugural dedicated video processor, announced in 2013 and made available for integration into system-on-chips (SoCs) starting in mid-2014. Designed for mainstream mobile and embedded devices, it supports key formats for both decoding and encoding, including H.264 (up to High Profile level 4.1) and VP8 for encode/decode, and H.263, MPEG-4 ASP, MPEG-2, VC-1/WMV, and RealVideo for decoding, enabling efficient processing of standard-definition and high-definition content. A single-core configuration delivers performance up to 1080p at 60 frames per second (fps) for both encode and decode, scaling to 4K@120fps with eight cores, with low latency under 10 ms at 1080p30.[82] The architecture employs a scalable fixed-function pipeline, configurable from one to eight cores to balance performance and power, with each core operating at up to 600 MHz via an AMBA AXI or ACE Lite bus interface. This design emphasizes energy efficiency, reducing overall system bandwidth by over 50% through integration with Arm Frame Buffer Compression (AFBC), which enables lossless frame storage and minimizes memory access during motion compensation. The processor includes a memory management unit (MMU) for virtual addressing and supports TrustZone for secure content handling, ensuring protected video paths in multi-tenant environments. It is optimized for low-cost dynamic random-access memory (DRAM) types, further lowering power draw in entry-level SoCs targeted at mid-range mobile devices.[82][83] Key specifications highlight its capability for 1 to 4 simultaneous streams on multi-core variants, facilitating multi-view or multi-party video use cases without excessive power overhead. The Mali-V500 integrates directly with Arm's Mali GPU lineup, such as the Midgard-based Mali-T622 and Mali-T720, allowing shared resources for compositing and post-processing in unified multimedia pipelines.[82] While effective for basic high-definition processing, the Mali-V500 is limited to legacy formats without support for emerging codecs like HEVC, positioning it as a foundational solution for cost-sensitive designs. It precedes more advanced V-series processors by establishing Arm's approach to dedicated video hardware acceleration.[82]Mali-V550
The Mali-V550 is a scalable video processor IP core developed by Arm, introduced in October 2014 as part of the company's Mali multimedia suite, with a primary focus on hardware-accelerated HEVC (H.265) encoding to enable efficient high-resolution video capture in mobile and embedded devices.[84] It represents the first Arm video processor to integrate both encoding and decoding in a single core, supporting up to 1080p60 HEVC encode/decode on one core and scaling to 4K@120fps with an eight-core configuration, making it suitable for premium smartphones and set-top boxes requiring 4K video output.[85] As an evolution from the Mali-V500, the V550 adds dedicated encoding hardware while maintaining backward compatibility for multi-standard video processing.[86] The architecture of the Mali-V550 centers on a multi-core hardware encode engine, configurable from one to eight cores, which handles motion estimation, transform coding, and rate control optimized for HEVC Main Profile at 8- and 10-bit depths. This design supports time-multiplexed multi-stream encoding, allowing up to eight simultaneous 720p streams or mixed resolutions with different codecs like H.264 and HEVC, reducing the need for multiple dedicated engines in system-on-chip (SoC) designs. Integrated features such as Arm Frame Buffer Compression (AFBC) minimize memory bandwidth by up to 60% during encoding, enhancing power efficiency without quality loss, particularly for external display scenarios like wireless streaming.[86][87] Key specifications include a low-latency mode that hides memory access delays to prevent frame drops, ideal for real-time applications such as video calls and live streaming at resolutions up to 1080p. The processor has been integrated into SoCs like the Amlogic S912, an octa-core Cortex-A53 design used in 4K Android TV boxes, where it enables hardware HEVC encoding for efficient media processing. Compared to software-based encoding on general-purpose CPUs, the Mali-V550 delivers significantly better compression efficiency—up to 50% lower power consumption for equivalent bitrates—by offloading compute-intensive tasks to dedicated silicon, thereby extending battery life in mobile devices while supporting higher quality outputs.[86][88][84]Mali-V61
The Mali-V61 is a versatile video processor developed by Arm, announced on October 31, 2016, and designed for integration into mainstream mobile and embedded systems starting in 2017. It combines hardware acceleration for H.265 (HEVC) Main10 Profile decoding and encoding with VP9 Profile 2 decoding, supporting both 8-bit and 10-bit color depths for multi-format video processing up to 4K UHD resolution at 60 frames per second. This unified approach enables efficient handling of high-definition content for applications like streaming and video conferencing, while maintaining backward compatibility with earlier formats such as H.264.[89][90] Building on the encode-focused Mali-V550, the V61 introduces robust decode capabilities to support emerging web video standards. Its architecture employs a unified pipeline that processes both decoding and encoding tasks, allowing for flexible resource allocation across up to 16 simultaneous decode streams or 8 encode streams. This design optimizes throughput for scenarios involving multiple video feeds, such as live broadcasting or multi-view playback, while leveraging Arm Frame Buffer Compression (AFBC) v1.2 for reduced memory bandwidth. The processor scales from 1 to 8 cores, enabling configurations tailored to performance needs, from single-core 1080p@60fps operation to multi-core 4K@120fps decoding.[89][91] Key specifications include native HDR10 support for enhanced dynamic range in 4K content, ensuring compatibility with high-fidelity displays without additional processing overhead. Its power-efficient architecture, optimized for 28nm and advanced nodes, minimizes energy consumption for battery-constrained environments, making it suitable for IoT applications requiring scalable video handling from 1080p to 4K resolutions.[90]Mali-V52
The Mali-V52 is a video processing unit (VPU) developed by Arm and announced in March 2018 as part of the company's Mali Multimedia Suite targeting mainstream devices. It serves as a decode-centric IP core with H.264 encoding support, optimized for efficient hardware decoding and encoding of high-resolution video streams, supporting key codecs including H.265/HEVC (up to 10-bit) and VP9 for decode, and H.264/AVC (High 10 Profile, Levels 5.0/5.1) for encode/decode.[92][93] This design enables smooth playback of 4K content at 60 frames per second in single-core configurations for decode, scaling to 4K at 120 fps decode or 4K at 60 fps encode with up to four cores, making it suitable for delivering premium video experiences in resource-constrained environments.[94] Architecturally, the Mali-V52 features a streamlined core emphasizing performance gains through architectural refinements that double decoding throughput compared to the prior Mali-V61 while reducing silicon area by 38%.[92][95] The core is scalable from one to four instances, allowing integration flexibility in system-on-chips (SoCs) for varying performance needs, and incorporates optimizations for YUV420 color format handling to minimize processing overhead. Similar to the Mali-V61, it prioritizes efficiency but introduces targeted improvements for mid-range scalability.[96] Key specifications highlight its efficiency, with the compact design enabling low bandwidth utilization and power consumption ideal for battery-powered devices.[97] For instance, a single core can handle 4K at 30 fps encode or 60 fps decode, or 1080p at 120 fps, supporting multi-stream scenarios in mainstream applications without excessive memory demands.[98] This focus on area and power efficiency—achieved through refined heuristics and reduced external memory accesses—positions the Mali-V52 as a cost-effective solution for SoC designers aiming to include advanced video capabilities in mid-tier hardware. In practical use cases, the Mali-V52 excels in streaming services on budget smartphones and entry-level tablets, where it facilitates high-quality 4K video playback for apps like YouTube or Netflix while conserving system resources for other tasks.[99] Its deployment in mainstream SoCs supports HDR content decoding, enhancing visual fidelity in affordable consumer electronics without compromising on thermal or energy budgets.[100]Mali-V76
The Mali-V76 is a video processing unit (VPU) from Arm, announced on May 31, 2018, as part of a premium IP suite targeting high-end mobile devices, set-top boxes, and consumer electronics requiring advanced multimedia processing. It builds on prior generations by doubling decode performance while reducing silicon area by up to 40% for equivalent tasks, enabling efficient handling of ultra-high-definition content in power-constrained environments. This processor supports key codecs including H.265 (HEVC) for both decoding and encoding, as well as VP9 decoding, with hardware acceleration for 10-bit color depth.[101][102] The architecture of the Mali-V76 employs a scalable multi-core design configurable from 2 to 8 cores, allowing SoC designers to optimize for varying performance needs and power budgets. Its next-generation decode and encode engines incorporate optimizations for high-resolution video, including support for high dynamic range formats such as HDR10 and hybrid log-gamma (HLG), which enhance color accuracy and contrast in displays. The unit also facilitates multi-view video applications through simultaneous stream processing, such as configuring for video walls or multi-screen setups. Operating at frequencies up to 800 MHz, it delivers significant efficiency gains over predecessors like the Mali-V61, with reduced power consumption for sustained high-frame-rate operations.[102][103] Key specifications highlight the Mali-V76's capability for 8K decoding at up to 60 frames per second in a single stream or four 4K streams at 60 fps, alongside support for up to 16 full HD (1080p) streams concurrently. Encoding performance includes H.265 up to 8K at 30 fps or 4K at 120 fps, suitable for premium video capture in smartphones and broadcasting applications. These features position the V76 for emerging 8K ecosystems, with implementations appearing in high-end SoCs from vendors like MediaTek and Rockchip for devices such as smart TVs and tablets. Overall, it advances video processor efficiency, enabling broader adoption of 8K content without compromising battery life or thermal limits.[101][102][103]Comparison of video processors
The Mali video processors demonstrate a clear progression in capabilities, beginning with the V500's focus on efficient H.264 and VP8 processing for HD content and advancing to the V76's support for high-resolution, multi-format decoding and encoding in premium mobile devices. This evolution reflects Arm's emphasis on scaling performance for diverse SoC requirements while maintaining low power consumption suitable for battery-powered systems. Key advancements include expanded codec support, higher resolutions, and optimized multi-stream handling to enable features like simultaneous video playback and recording.| Model | Decode Formats | Encode Formats |
|---|---|---|
| V500 | H.264, VP8, H.263, MPEG-4 ASP, MPEG-2, VC-1/WMV, RealVideo [104] | H.264, VP8 [104] |
| V550 | H.264, HEVC [105] | H.264, HEVC [105] |
| V61 | H.264, HEVC, VP9 [106] | H.264, HEVC [106] |
| V52 | H.264, HEVC, VP9 [107] | H.264 [107] |
| V76 | H.264, HEVC, VP9 [108] | H.264, HEVC [108] |
[105]: https://www.tomshardware.com/news/arm-mali-gpus-video-display,27961.html
[106]: https://www.cnx-software.com/2016/11/01/arm-introduces-bifrost-mali-g51-gpu-and-mali-v61-4k-h-265-vp9-video-processing-unit/
[107]: https://www.notebookcheck.net/ARM-announces-new-Mali-G52-31-GPUs-along-with-video-and-display-processors-for-mobile-devices.287474.0.html
[108]: https://www.cnx-software.com/2018/06/01/arm-cortex-a76-cpu-mali-g76-gpu-mali-v76-8k-vpu/
Display processors
Mali-D71
The Mali-D71 is a display processor developed by Arm, announced on November 1, 2017, as the first implementation of the company's Komeda architecture for advanced mobile and embedded display handling. Designed primarily for high-resolution outputs in power-constrained environments, it enables driving up to two independent displays simultaneously, with support for 4K (3840×2160) resolution at 60 frames per second per display in dual mode or a single 4K display at up to 120 Hz for latency-sensitive applications like virtual reality.[109][110] The core architecture revolves around a modular compositor with two configurable pipelines, allowing flexible allocation for either dual-display operation—where each pipeline drives a separate output—or single-display mode with combined resources for enhanced complexity, such as up to eight simultaneous Android composition layers. This setup incorporates stages for layer blending, scaling, rotation, and post-processing, integrated with Arm Framebuffer Compression (AFBC) 1.2 to optimize memory bandwidth and reduce power consumption. The processor pairs with the CoreLink MMU-600 for efficient 4KB-paged memory management, ensuring real-time performance in scenarios requiring low latency.[111][112] Key specifications emphasize compatibility with major display interfaces, including MIPI DSI for mobile panels and HDMI for external connections, making it suitable for smartphones, tablets, and VR headsets. Power efficiency is a hallmark, with the Mali-D71 offloading composition tasks from the GPU to achieve up to 30% overall system power savings in complex UI scenarios compared to GPU-based rendering. It supports HDR10 output natively through integration with Assertive Display 5, which handles tone mapping, color space conversion, and dynamic range enhancement even on standard dynamic range (SDR) displays. Additional features include gamma correction for accurate color reproduction and dithering to minimize banding artifacts in gradients.[113][110][96] The Mali-D71 complements Arm's Mali GPU families by managing final display pipeline stages, such as multi-layer blending and output formatting, thereby freeing GPU resources for rendering and improving overall system responsiveness in multi-window environments.[114]Mali-D51
The Mali-D51 is a mainstream display processor developed by Arm, announced on March 6, 2018, based on the Komeda architecture. It supports up to 4K resolution at 60 Hz, with up to eight composition layers, bringing premium features like HDR support via Assertive Display 5 to mid-range devices. Compared to the previous Mali-DP650, it offers 30% system power savings and 50% better memory latency, enabling efficient handling of complex UIs while maintaining low power consumption.[96][92]Mali-D77
The Arm Mali-D77 is a premium display processing unit (DPU) introduced in May 2019, designed primarily to enhance virtual reality (VR) experiences in head-mounted displays (HMDs) and premium mobile devices by handling high-resolution, low-latency composition and rendering offloads from the GPU.[115] It builds upon the Komeda architecture of prior models, enabling support for up to four stereo VR layers with optimizations for resolutions such as 3K at 120 frames per second (fps) or 4K at 90 fps, which helps reduce motion sickness through smoother frame delivery.[116] This represents an evolution from the Mali-D71's capability for dual 4K displays at 60 Hz or a single 4K display at 120 Hz, incorporating dedicated VR accelerations to improve overall system efficiency.[117] Architecturally, the Mali-D77 features fixed-function hardware blocks that perform VR-specific tasks, including Asynchronous Timewarp (ATW) for interpolating frames to maintain high refresh rates despite GPU bottlenecks, Lens Distortion Correction (LDC) to compensate for optical distortions in HMDs, and Chromatic Aberration Correction (CAC) for color fringing reduction.[115] These enhancements, integrated into the Komeda compositor, allow for multi-layer composition with high dynamic range (HDR) support on 4K displays, enabling pixel densities exceeding 1000 pixels per inch (ppi) in collaboration with display drivers like those from Synaptics. The design also achieves up to 40% savings in system bandwidth and 12% in power consumption for VR workloads by offloading compute-intensive operations from the GPU.[118] Key specifications emphasize scalability for untethered VR devices, supporting seamless transitions from HMDs to standard premium mobile screens while preserving image quality.[115] When paired with Arm's MMU-600 memory management unit and Assertive Display 5 engine, it facilitates efficient handling of high-frame-rate content without compromising battery life or thermal performance.[118] The Mali-D77's focus on VR acceleration positions it as a foundational IP for next-generation immersive applications, prioritizing low-latency rendering over general-purpose display tasks.Image signal processors
Mali-C71
The Mali-C71 is Arm's inaugural image signal processor (ISP), announced on April 25, 2017, and designed specifically for advanced driver-assistance systems (ADAS) in automotive applications. It addresses challenges in processing images from multiple cameras under varying lighting and weather conditions, enabling features like 360-degree surround views and object detection for both human display and computer vision pipelines. Built following Arm's acquisition of Apical, the processor integrates over 300 dedicated fault detection circuits to support high-reliability standards, marking a shift toward integrated imaging solutions for smart vehicles.[119] Architecturally, the Mali-C71 employs a multi-input pipeline capable of handling up to four real-time camera streams or sixteen additional streams from memory, allowing simultaneous processing from diverse sensor types including Bayer, monochrome, and flexible color filter arrays (CFAs). It features a modular block-based design that includes advanced noise reduction modules—such as 2D spatial filtering and per-exposure temporal profiling—along with chromatic aberration correction and high dynamic range (HDR) fusion to merge exposures from up to 24 stops of dynamic range. This enables ultra-wide dynamic range imaging, far exceeding typical smartphone ISPs, to capture details in extreme contrasts like direct sunlight and shadows. The processor outputs processed data in formats suitable for display or further analysis, with optimizations for low latency and reversible transforms to preserve raw data integrity for computer vision tasks.[120][121] Key specifications include a throughput of 1.2 gigapixels per second, supporting resolutions adequate for automotive cameras such as full HD at high frame rates, while prioritizing efficiency in power-constrained embedded systems. It processes raw sensor data through debayering, tone mapping, and sharpening stages, with built-in support for region-of-interest cropping and planar histograms to accelerate downstream algorithms. The Mali-C71 has been integrated into automotive system-on-chips (SoCs) for enhanced situational awareness, distinguishing it as a foundational technology for evolving autonomous driving capabilities.[122][123]Mali-C52 and Mali-C32
The Arm Mali-C52 and Mali-C32 image signal processors (ISPs) were announced on January 3, 2019, as mid-range and entry-level solutions for embedded vision applications such as security cameras, drones, and smart home devices.[124] The Mali-C52 targets balanced camera systems with support for up to four independent camera inputs at a maximum resolution of 4608 × 3456 pixels (approximately 16 megapixels per sensor), enabling real-time processing for 4K video at 60 frames per second.[125][126] In contrast, the Mali-C32 is area-optimized for low-power, cost-sensitive entry-level devices, maintaining similar input capabilities but in a more compact implementation suitable for basic 16-megapixel imaging.[127] Both provide a complete ecosystem including hardware IP, software drivers, 3A libraries for auto-exposure, auto-white balance, and auto-focus, along with calibration and tuning tools.[125][127] These ISPs employ a scalable, block-based architecture with multi-context processing that applies over 25 steps per pixel to raw sensor data from RGGB or RGBIr formats, supporting multi-channel outputs in RGB or YUV.[124][128] The Mali-C52 offers configurable modes optimized for either superior image quality or reduced silicon area, with a peak throughput of 600 megapixels per second to handle demanding real-time workloads.[125] The Mali-C32 prioritizes efficiency in the same pipeline, delivering comparable performance in a smaller footprint for resource-constrained systems.[127] Key features focus on essential image enhancement for both human and computer vision, including basic high dynamic range (HDR) processing via Arm's Iridix technology for contextual tone mapping and dynamic range management, which preserves details in shadows and highlights without overexposure.[124] Additional capabilities encompass advanced noise reduction to minimize artifacts in low-light conditions and color management for accurate reproduction, alongside lens correction through geometric distortion compensation integrated into the processing flow.[124][129] These elements enable high-quality outputs for applications requiring reliable imaging without the advanced sensor fusion of later models like the Mali-C71.[124]Mali-C71AE
The Mali-C71AE is an image signal processor (ISP) developed by Arm for automotive and industrial applications, particularly advanced driver-assistance systems (ADAS) and machine vision tasks. Announced in September 2020, it builds on the architecture of the consumer-oriented Mali-C71 but incorporates enhancements for functional safety and reliability in harsh environments.[120][130] It supports processing from multiple camera streams to enable features like surround-view systems, object detection, and night-vision enhancement, delivering up to 1.2 gigapixels per second throughput.[131][132] Designed with automotive-grade ruggedization, the Mali-C71AE operates reliably in extreme conditions typical of vehicle and industrial settings, emphasizing fault tolerance and diagnostic coverage. It meets ISO 26262 ASIL B for random hardware faults and ASIL D for systematic failures, alongside IEC 61508 SIL 3 standards, through over 400 built-in fault-detection circuits, cyclic redundancy checks (CRC), and built-in self-test (BIST) mechanisms.[120][131] The architecture includes dedicated pipelines for simultaneous human-visible output (for displays) and computer-vision processing (for ADAS), supporting up to four real-time camera inputs at resolutions up to 4096 x 2560 pixels or 16 virtual streams from memory.[132] This multi-camera capability handles diverse sensor types, such as RGGB, RCCC, and RGBIr, with 4:1 high dynamic range (HDR) exposure fusion for twice the dynamic range of a single-exposure sensor.[131] Key features focus on enhancing image quality and safety for ADAS applications, including advanced 2D noise reduction via sinter technology, chromatic aberration correction, and per-exposure noise profiling for low-light conditions.[120] It enables multi-camera stitching for 360-degree views and region-of-interest cropping, while tagging suspect pixels and providing reversible transforms to maintain data integrity for downstream AI processing.[131] The ISP integrates with Arm's Automotive Enhanced (AE) ecosystem, such as the Cortex-A78AE CPU and Mali-G78AE GPU, and has been adopted in automotive system-on-chips (SoCs) for production monitoring, quality control, and all-around vehicle awareness.[133][132]Mali-C55
The Mali-C55 is an image signal processor (ISP) developed by Arm and released on June 8, 2022, designed specifically for efficient image processing in IoT and embedded vision systems.[134] It supports up to eight simultaneous camera inputs, enabling multi-sensor setups for applications such as smart cameras and drones, and handles resolutions up to 8K with a maximum image size of 48 megapixels.[135] The processor emphasizes high dynamic range (HDR) capabilities for cameras, including 2:1 HDR stitching, digital overlay (DOL), and dual-pixel HDR to capture details across varying lighting conditions.[136] Architecturally, the Mali-C55 features a compact, configurable design optimized for low power consumption and minimal silicon area—achieving half the footprint of its predecessor, the Mali-C52—making it suitable for battery-powered embedded devices.[134] It delivers a throughput of up to 1.2 gigapixels per second while supporting input formats including 14-bit RAW data for high-fidelity processing.[135][137] Key enhancements include multi-exposure fusion via HDR sensor support, advanced noise reduction with Temper temporal and Sinter 2.6 spatial algorithms (reducing memory bandwidth by up to 50% compared to prior generations), and improved Iridix local tone mapping for natural image rendering in challenging environments.[136] The Mali-C55 is widely adopted in smart home devices, such as security cameras and hubs, where it enables real-time image enhancement for endpoint vision tasks.[134] It integrates edge AI processing through a dedicated output pipe to machine learning accelerators, facilitating on-device inference for features like object detection without cloud dependency.[135] This combination of efficiency and configurability positions the Mali-C55 as a mid-range complement to the Mali-C52, targeting cost-sensitive IoT deployments.[136]Comparison of image signal processors
The Arm Mali image signal processors (ISPs) have evolved to address diverse applications, with throughput ranging from 0.6 gigapixels per second (GP/s) in entry-level models to 1.2 GP/s in advanced configurations, enabling efficient processing for embedded vision systems.[127][126][138][131] Early models like the Mali-C32 prioritize low-power operation for cost-sensitive IoT devices, while later variants such as the Mali-C55 and Mali-C71AE incorporate multi-camera support and enhanced dynamic range handling for more demanding consumer and automotive scenarios. This progression reflects a shift toward higher efficiency and integration with machine learning pipelines, particularly after 2020, where ISPs began facilitating direct feeds to AI accelerators for real-time computer vision tasks.[139][135][120][140]| Model | Throughput (GP/s) | Max Inputs/Streams | Primary Use Cases |
|---|---|---|---|
| Mali-C32 | 0.6 | Up to 4 independent camera sources | Low-power IoT, entry-level embedded vision (e.g., access control) |
| Mali-C52 | 0.6 | Up to 4 independent camera sources, dual outputs | Consumer cameras, drones, action cams with HDR needs |
| Mali-C55 | 1.2 | 8 separate inputs | Battery-powered IoT, smart cameras, edge ML integration |
| Mali-C71AE | 1.2 | 4 real-time inputs or 16 streams | Automotive ADAS, industrial multi-camera systems |
Open-source drivers
Lima
Lima is an open-source, reverse-engineered graphics driver for ARM's Mali Utgard architecture GPUs, including the Mali-400 and Mali-450 series. Developed as a community effort within the Mesa 3D graphics library, it utilizes the Gallium3D driver framework to provide free software support for these embedded GPUs. The project was initiated by Luc Verhaegen in 2012 and later upstreamed into Mesa 19.1 in 2019, marking a significant milestone for open-source Mali compatibility.[149][150][151] The driver focuses on enabling 3D acceleration through reverse engineering of the proprietary hardware, replacing binary blobs with verifiable source code. It supports OpenGL ES 2.0 with a 97% pass rate on Khronos conformance tests, alongside partial implementations of OpenGL 2.1 and OpenGL ES 1.1. These features target basic 2D and 3D rendering workloads suitable for the fixed-function shader model of Utgard GPUs.[149] In Linux environments, Lima has reached a mature state for 2D and 3D operations, integrated with display drivers like sun4i-drm for Allwinner SoCs and rockchip for Rockchip platforms. It is commonly deployed on single-board computers such as Olimex boards and Armbian-supported devices with Allwinner A10/A20 or H3 processors, offering an open alternative to proprietary drivers in Raspberry Pi-like ecosystems. Development now emphasizes bug fixes and broader application compatibility rather than major new features.[149][152][153] Due to the hardware constraints of the Utgard architecture, Lima does not support compute shaders, OpenGL 3.x or higher, OpenGL ES 3.x, OpenCL, or Vulkan, limiting it to legacy graphics APIs. Fragment shaders are restricted to FP16 precision, aligning with the GPU's original design for mobile and embedded use cases.[149]Panfrost
Panfrost is an open-source graphics driver developed for Arm Mali GPUs featuring the Midgard and Bifrost microarchitectures, including the T600 series and G30 through G76 models.[154] Initiated in 2018 as a reverse-engineered implementation built on the Gallium3D framework within the Mesa 3D graphics library, it aims to deliver conformant support for modern graphics APIs without relying on proprietary binaries.[155] By 2022, Panfrost provided full support for OpenGL ES 3.1 and Vulkan 1.1 on these architectures, enabling robust 3D rendering and compatibility with applications targeting embedded systems.[156] Key features of Panfrost include support for unified shaders, which allow flexible execution of vertex, fragment, and compute workloads on the same hardware units, along with compute shader capabilities for general-purpose GPU computing tasks.[154] These elements enable efficient handling of complex shaders and parallel processing, essential for games and graphical applications. The driver has been integrated into Mesa versions 20 and later, facilitating widespread adoption in open-source Linux distributions and facilitating hardware acceleration for desktop environments like GNOME on compatible devices.[157] Panfrost is considered production-ready for both Android and Linux environments, powering smooth graphics performance in real-world scenarios such as video playback, UI compositing, and light gaming. For instance, it delivers reliable OpenGL ES acceleration on the Rockchip RK3399 system-on-chip, which integrates a Mali-T860 GPU, enabling Wayland compositing and application rendering without proprietary drivers.[158] Development of Panfrost began as a community-led effort hosted on freedesktop.org, with initial focus on reverse-engineering shader binaries and kernel interfaces.[159] Following Arm's official endorsement in 2020, the company began contributing code and documentation, accelerating progress toward API conformance and performance optimizations while maintaining the project's open-source ethos.[160] As the successor to the Lima driver, Panfrost extends open-source support to architectures with unified shaders.[161]Panthor
Panthor is an open-source kernel driver developed for Arm Mali GPUs utilizing the Command Stream Frontend (CSF) architecture, beginning with third-generation Valhall models such as the Mali-G610, and extending to other third-generation Valhall GPUs like the Mali-G310, G510, and G710, as well as fifth-generation architectures including the Immortalis series like the G720 and the Mali-G1 series.[162][163] Development on Panthor was publicly announced in late 2023 by engineers at Collabora, with initial patches focusing on upstream integration into the Linux kernel's Direct Rendering Manager (DRM) subsystem. It builds upon the userspace components of the Panfrost driver to provide a unified model for modern Mali hardware.[162] Key features of Panthor include support for advanced graphics capabilities such as ray tracing on Immortalis GPUs and asynchronous compute operations, enabling efficient parallel workload execution on supported hardware.[164] The driver is designed to be identical in functionality to Arm's own open-sourced kernel components for CSF-based GPUs, ensuring compatibility with upstream firmware blobs while promoting full open-source stack adoption.[165] In conjunction with the Mesa userspace libraries, particularly the PanVK Vulkan driver, Panthor achieves conformance to Vulkan 1.3, allowing developers to leverage modern API features like dynamic rendering and enhanced synchronization. As of 2025, with the PanVK Vulkan driver, Panthor achieves conformance to Vulkan 1.2 on Mali-G610, with support for Vulkan 1.3 and 1.4 implemented, nearing full conformance for higher versions.[166][167] Panthor was merged into the Linux kernel as part of version 6.10, released in July 2024, initially targeting third-generation Valhall GPUs and select devices with compatible hardware, such as those featuring the Mali-G715 in later Google Pixel series beyond the Pixel 6.[168][169] Subsequent enhancements in Linux 6.18 expand support to additional Valhall GPUs such as the Mali-G310, G510, and G710. Further support for fifth-generation and Immortalis GPUs, including the Mali-G1 series, has been added in late 2025 kernel versions.[61][170][171] Advancements in Panthor emphasize enhanced power management, with future iterations incorporating standalone Dynamic Voltage and Frequency Scaling (DVFS) for CSF-based GPUs to optimize energy efficiency during varying workloads. It also facilitates AI workload support via compute shaders and integration with Arm's shader cores, enabling machine learning inference and other parallel processing tasks on Mali hardware without proprietary dependencies.[164]References
- Arm Immortalis and Mali GPUs deliver immersive graphics and compute performance for everything from high-end smartphones to smart TVs.
- Mali-G77 is the highest performing mobile GPU for complex use cases, such as graphics and on-device machine learning, and delivers consistent battery life ...
- Jun 23, 2006 · ARM Holdings plc, a licensor of processor and physical IP technology, has acquired Falanx Microsystems AS, a Norwegian developer of graphics ...
- Feb 12, 2007 · ARM Builds Graphics Stack And Broadens Portfolio With Mali200 And Mali55 Processors. ARM Mali processors enable visually stunning 2D and 3D ...
- Feb 10, 2012 · The aim of this driver is to finally bring all the advantages of open source software to ARM SoC graphics drivers.