Hubbry Logo
ARM Cortex-A73ARM Cortex-A73Main
Open search
ARM Cortex-A73
Community hub
ARM Cortex-A73
logo
7 pages, 0 posts
0 subscribers
Be the first to start a discussion here.
Be the first to start a discussion here.
ARM Cortex-A73
ARM Cortex-A73
from Wikipedia
ARM Cortex-A73
General information
Launched2016
Designed byARM Holdings
Max. CPU clock rateto 2.8 GHz 
Physical specifications
Cores
  • 1–4 per cluster, multiple clusters
Cache
L1 cache96–128 KiB (64 KiB I-cache with parity, 32–64 KiB D-cache) per core
L2 cache1–8 MiB
L3 cacheNone
Architecture and classification
ApplicationMobile
Instruction setARMv8-A
Products, models, variants
Product code name
  • Artemis
History
PredecessorsARM Cortex-A72
ARM Cortex-A17
SuccessorARM Cortex-A75

The ARM Cortex-A73 is a central processing unit implementing the ARMv8-A 64-bit instruction set designed by ARM Holdings' Sophia design centre. The Cortex-A73 is a 2-wide decode out-of-order superscalar pipeline.[1] The Cortex-A73 serves as the successor of the Cortex-A72, designed to offer 30% greater performance or 30% increased power efficiency.[2]

Design

[edit]

The design of the Cortex-A73 is based on the 32-bit ARMv7-A Cortex-A17, emphasizing power efficiency and sustained peak performance.[3] The Cortex-A73 is primarily targeted at mobile computing.[4] In reviews, the Cortex-A73 showed improved integer instructions per clock (IPC), though lower floating point IPC, relative to the Cortex-A72.[5]

Licensing

[edit]

The Cortex-A73 is available as SIP core to licensees, and its design makes it suitable for integration with other SIP cores (e.g. GPU, display controller, DSP, image processor, etc.) into one die constituting a system on a chip (SoC).

The Cortex-A73 is also the first ARM core to be modified through ARM's semi-custom 'Built on ARM' license.[6][7] The Kryo 280 was the first released semi-custom product, though the modifications made relative to the stock Cortex-A73 were not announced.[5]

Products

[edit]

The HiSilicon Kirin 960, released in 2016, utilizes 4 Cortex-A73 cores (clocked at 2.36 GHz) as the 'big' cores in a big.LITTLE arrangement with 4 'little' ARM Cortex-A53 cores.[8]

The MediaTek Helio X30 utilizes 2 Cortex-A73 cores (at 2.56 GHz) as the 'big' cores in deca-core big.LITTLE arrangement with 4 Cortex-A53 and 4 Cortex-A35 'little' cores.[9]

The Kryo 280, released in March 2017 by Qualcomm in the Snapdragon 835, uses a modified Cortex-A73 core.[5][10] The SoC utilizes 8 Kryo 280 cores in a big.LITTLE arrangement as two 4-core blocks, clocked at 2.456 GHz and 1.906 GHz. The modifications made by Qualcomm relative to the stock Cortex-A73 core are unknown, and the resulting Kryo 280 core demonstrated increased integer IPC.[5] The Kryo 260 also used Cortex-A73 cores, though at lower clock speeds than the Kryo 280 and in combination with Cortex-A53 cores.[11]

The Cortex-A73 is also found in a wide range of mid-range chipsets such as the Samsung Exynos 7885, MediaTek Helio P series, and other HiSilicon Kirin models. Like the Snapdragon 636/660, most of these chipsets implement 4 A73 cores and 4 A53 cores in a big.LITTLE configuration, although some lower end models of Samsung chips implement only 2 A73 cores with 6 A53 cores.

See also

[edit]

References

[edit]
Revisions and contributorsEdit on WikipediaRead on Wikipedia
from Grokipedia
The ARM Cortex-A73 is a high-performance, power-efficient CPU core developed by , implementing the Armv8-A 64-bit architecture and designed primarily for premium mobile and embedded applications such as smartphones and automotive systems. Announced in May 2016 as part of Arm's premium mobile processor suite, it supports configurations of 1 to 4 symmetrical (SMP) cores per cluster, with multiple coherent clusters enabled via AMBA 4 ACE interconnect technology, enabling scalable big.LITTLE heterogeneous processing when paired with efficiency cores like the Cortex-A53 or Cortex-A35. Key architectural features include per-core L1 instruction and data caches, a shared L2 unified cache per cluster, and support for advanced instruction sets such as (64-bit), AArch32 for with Armv7, TrustZone security, advanced SIMD and DSP extensions, VFPv4 floating-point unit, and . The core achieves clock speeds up to 2.8 GHz in mobile process nodes, delivering the highest peak and sustained performance in its low-power class while offering up to 30% improved power efficiency over predecessors like the Cortex-A72, making it suitable for battery-constrained devices with demanding workloads in , , and computing. In terms of integration and , the Cortex-A73 features a compact footprint as the smallest Armv8-A premium processor at the time of release, with low-power L2 wait-for-interrupt (WFI) states, dynamic retention modes for L2 RAMs, and compatibility with Arm's GPUs, TrustZone security IP, and CoreSight SoC-400 debug/trace components via standard AMBA interfaces. Targeted at system-on-chip (SoC) designs in slim form factors, it has been widely adopted in flagship mobile SoCs for enhanced user experiences in immersive applications, though later Cortex-A series cores have since succeeded it in Arm's portfolio for even greater efficiency and AI capabilities.

Overview and Specifications

Introduction

The ARM Cortex-A73 is a high-performance (CPU) core implementing the ARMv8-A 64-bit . Announced by on May 30, , at in , it was positioned as a key component of the company's premium mobile processor portfolio for 2017 devices. Designed from scratch by ARM's team at the Sophia Antipolis design center in , the core emphasized mobile-focused optimizations for efficiency and sustained operation. Serving as the successor to the Cortex-A72 in ARM's high-performance mobile lineup, the A73 was later followed by the Cortex-A75 as its direct replacement. This progression marked an evolution toward more capable processors for power-constrained environments, building on the big.LITTLE approach by pairing with efficiency cores like the Cortex-A53 or A35. Targeted primarily at premium smartphones and tablets, the Cortex-A73 also supported emerging applications such as (AR), (VR), and advanced . Its design prioritized sustained peak performance without thermal throttling, enabling operation at frequencies up to 2.8 GHz on 10 nm processes while delivering 30% better efficiency than its predecessor.

Technical Specifications

The ARM Cortex-A73 implements the ARMv8-A , supporting both 64-bit execution and 32-bit AArch32 legacy mode for with ARMv7 software. Its microarchitecture features an out-of-order, superscalar design with a 2-wide decode stage, enabling a sustained throughput of up to two instructions per cycle. The core supports configurations of 1 to 4 cores per cluster, with shared L2 cache and compatibility for multiple clusters in heterogeneous big.LITTLE systems, often paired with efficiency cores like the Cortex-A53. It is single-threaded per core, lacking hardware multithreading capabilities. Maximum clock frequencies reach up to 2.8 GHz when implemented on advanced process nodes. Optimized for 16 nm FinFET processes and scalable to smaller nodes such as 10 nm, the Cortex-A73 delivers efficient performance in mobile and embedded applications. It includes support for key extensions, such as from the ARMv8-A base, along with optional cryptography accelerations for AES and SHA algorithms, and CRC32 via ARMv8.1 compatibility in select implementations.

Design and Microarchitecture

Core Design

The ARM Cortex-A73 core was engineered with a primary focus on achieving a balance between high sustained performance and low power consumption, particularly for mobile applications, while delivering up to a 25% reduction in area compared to the Cortex-A72 on the same process node. This design philosophy prioritized efficiency in premium smartphones and other battery-constrained devices, enabling higher clock speeds up to 2.8 GHz without excessive thermal throttling. The core implements the ARMv8-A architecture, emphasizing sustained workloads over peak bursts to support immersive experiences like gaming and . The Cortex-A73 employs a cluster-based configuration supporting up to four cores per cluster, initially designed for big.LITTLE heterogeneous systems. This allows for scalable multi-core setups, such as pairing A73 cores with efficiency cores like the Cortex-A53, using a shared L2 cache and AMBA 4 interconnect for coherent memory access. The core's framework underpins its ability to handle complex instruction streams efficiently. Branch prediction in the Cortex-A73 features advanced algorithms with a two-level global history buffer, contributing to up to 10% better at iso-frequency compared to the Cortex-A72. The load/store unit supports up to three loads and two stores per cycle, enhanced by store forwarding optimizations to reduce latency in data-dependent operations. For operations, the core features dual pipelines equipped with arithmetic logic units (ALUs), barrel shifters, and multipliers, enabling parallel execution of scalar instructions to maintain high throughput. The floating-point and SIMD capabilities are handled by a unit providing 128-bit vector processing, though it exhibits lower (IPC) compared to integer units due to simplified scheduling mechanisms. This design choice trades some vector performance for overall power savings, aligning with the core's mobile optimization goals while still supporting advanced media and workloads.

Pipeline and Execution Units

The ARM Cortex-A73 implements a superscalar, out-of-order optimized for energy efficiency in , featuring an 11-stage integer that supports clock speeds up to 2.8 GHz. The front end includes a four-stage fetch unit delivering instructions to a 2-wide decode stage, which processes variable-length ARM instructions by splitting them into two parallel streams. Subsequent rename and dispatch stages follow, enabling up to two instructions to be dispatched per cycle to the execution units, with the overall design sustaining a maximum throughput of two . The execution units consist of two integer arithmetic logic units (ALUs) for basic operations, one complex integer unit dedicated to multiply and divide operations, a dedicated branch unit, and shared pipelines for floating-point and SIMD processing, with the latter featuring dual pipes for improved vector throughput. The branch unit incorporates an predictor that tracks up to 256 targets overall, supporting up to 16 possible targets per to enhance accuracy in . Reordering is managed through a slot-based execution model with theoretically unlimited capacity, eschewing a traditional fixed-size reorder buffer to facilitate deep without hard limits on instruction window size. In practice, this enables extensive out-of-order execution, though constrained by downstream resources such as scheduler queues. The pipeline supports resource limits including 11 in-flight stores after an unresolved branch—a reduction from the prior Cortex-A72's 15—and up to 4 outstanding L1 instruction cache misses.

Memory Hierarchy

The ARM Cortex-A73 implements a hierarchical consisting of private L1 caches per core and a shared L2 cache within the processor cluster, optimized for low-latency access in power-constrained environments. This prioritizes hit rates and bandwidth for typical mobile workloads while maintaining compatibility with the ARMv8-A architecture's model. The caches use 64-byte line sizes throughout, enabling efficient burst transfers from lower levels. The Level 1 (L1) instruction cache is fixed at 64 KiB and organized as 4-way set-associative with Virtually Indexed Physically Tagged (VIPT) indexing. It incorporates parity bits for single-error detection, allowing the to invalidate corrupted lines and refetch from lower levels without halting execution. The L1 data cache is configurable to either 32 KiB (8-way set-associative) or 64 KiB (16-way set-associative), also VIPT, and operates as write-back to minimize bus traffic. It supports non-temporal stores through a dedicated store buffer that bypasses the cache for full-line writes not present in L1, directing them straight to the L2 cache to avoid pollution in streaming scenarios. The Level 2 (L2) cache is unified, serving both instruction and data requests, and is configurable from 256 KiB to 8 MiB per cluster in powers-of-two increments. It employs 16-way set-associativity and maintains inclusivity with respect to the L1 caches, ensuring that all L1 content is also present in L2 for simplified coherency management across cores. Optional (ECC) protection is available for both tags and data arrays. The Cortex-A73 does not feature an on-core L3 cache; higher-level caching is handled by external system caches in the SoC, such as those integrated in multi-cluster configurations. Memory management is facilitated by an integrated (MMU) compliant with the ARMv8 architecture, supporting hierarchical page tables with a base granule size of 4 KB. It includes Large (LPAE) for up to 40-bit physical addressing, enabling efficient virtual-to-physical translation in systems with large memory footprints. The L1 data cache delivers up to 32 bytes per cycle in load bandwidth, complemented by a hardware that identifies and prefetches streams—up to eight concurrent streams—to reduce latency for linear data patterns common in multimedia and graphics workloads.

Performance and Efficiency

Integer and Floating-Point Performance

The ARM Cortex-A73 features a superscalar out-of-order capable of sustaining approximately 2.0 (IPC) for operations, with potential uplifts to 2.5 IPC in optimized workloads. This represents a 30% over the Cortex-A72 in integer-intensive tasks akin to SPECint benchmarks, driven by enhancements in branch prediction and execution . Floating-point performance is comparatively lower, achieving 1.5-2.0 IPC due to narrower pipelines in the (FPU), which prioritize efficiency for mobile over raw throughput. The core's dual-pipe FPU supports fused multiply-add operations with a latency of 7 cycles, balancing latency-sensitive tasks in graphics and . Integer throughput reaches up to 2 operations per cycle for adds and multiplies across two ALU ports, while divides incur approximately 12 cycles of latency, reflecting a for power-sensitive environments. The SIMD extension delivers 4 single-precision (32-bit) floating-point operations per cycle via its 128-bit vector pathways, enabling effective acceleration for and video workloads but limiting scalability for applications. Under constraints, the Cortex-A73 sustains over 90% of peak , minimizing frequency throttling observed in prior cores through optimized power delivery and microarchitectural efficiencies. Its reordering capacity further enables high IPC by tolerating dependencies in integer streams.

Power Consumption and Efficiency

The ARM Cortex-A73 core is engineered for high efficiency within the constrained mobile power envelope, delivering the highest while maintaining low usage suitable for battery-powered devices. Compared to its predecessor, the Cortex-A72, it achieves up to 30% better power efficiency, enabling either 30% higher at the same power level or 30% reduced power consumption at equivalent levels. This efficiency stems from architectural optimizations such as aggressive and power-optimized RAM designs, which minimize dynamic power dissipation during operation. In terms of power envelope, the Cortex-A73 operates effectively at around 0.5-1.0 W per core when clocked at 2.5 GHz on a , representing over 20% lower power draw than the A72 for integer workloads and even greater savings for floating-point and memory-intensive tasks at iso-frequency. Its thermal design supports sustained high-performance operation without frequent throttling, facilitated by dynamic voltage and (DVFS) that adjusts supply voltage and clock speed in real-time to balance performance and heat generation. Leakage power is controlled through advanced techniques, allowing inactive core sections to enter low-power retention states, which further enhances during idle or lightly loaded scenarios. The core's area efficiency contributes significantly to overall SoC power optimization, occupying up to 46% less die area than the A72 when implemented on the same process node, with a footprint of about 0.65 mm² on 10 nm technology—making it the smallest premium Armv8-A core at the time. This compact design enables denser integration of multiple cores in big.LITTLE configurations, promoting better power balancing across performance and efficiency clusters. Process scaling is optimized for advanced nodes like 7-10 nm, where it achieves peak efficiency through reduced leakage and improved , though it scales reliably to 16 nm and even 28 nm for cost-sensitive applications without substantial efficiency loss. The ARM Cortex-A73 provides approximately 30% higher sustained performance than its predecessor, the Cortex-A72, primarily through improvements in integer instruction throughput while maintaining similar floating-point capabilities. This uplift stems from enhanced branch prediction and reordering mechanisms that allow better handling of sustained workloads, though the A72 may achieve a slightly higher peak IPC in short bursts due to its design focus on bursty performance. In real-world benchmarks, devices with the A73, such as those using the 835, demonstrate 10-20% gains over A72-based systems in web loading and multi-threaded tasks, reflecting the A73's emphasis on thermal stability over raw peak speed. Compared to its successor, the Cortex-A75, the A73 delivers 20-25% lower overall performance at equivalent frequencies and power envelopes, as the A75 introduces a wider 3-wide pipeline for better parallelism. The A75 achieves this with roughly the same energy efficiency, offering up to 25% higher SPECint scores at 1W per core and 30% at 2W, making it more suitable for demanding mobile applications. However, the A73's narrower 2-wide and large reordering capacity provide advantages in power-constrained scenarios where the A75's added complexity can lead to higher leakage. In the context of 2025 contemporaries like the Cortex-A78, the A73 lags significantly, scoring around 415 in 5 single-core tests compared to the A78's 934, underscoring its legacy status in modern . This gap highlights generational advances in the A78, including improved vector processing and larger caches, which double the effective throughput in integer-heavy workloads. For multi-core scaling, a 4-core A73 cluster at 1.844 GHz achieves a PassMark CPU Mark of about 1,368, positioning it as adequate for devices from 2017-2020 but insufficient for current demands. This score reflects efficient scaling within big.LITTLE configurations, where the A73 pairs with efficiency cores to balance loads without excessive throttling. Recent 2024-2025 microbenchmark analyses reveal that the A73's reordering limits, despite being theoretically large, cap its IPC in complex workloads due to small ALU schedulers (6 entries) and narrow fetch bandwidth, often resulting in sustained IPC below 2.0 in SPECint-like tasks. These constraints, analyzed in implementations like the S922X, emphasize the core's efficiency focus over peak throughput, limiting its relevance in today's wider architectures.

Licensing and Implementations

Licensing Model

The ARM Cortex-A73 has been available as a synthesizable (SIP) core since its announcement in May 2016, licensed on a royalty-based model from . This traditional licensing approach involves upfront fees for access to the IP, followed by royalties calculated on the number of chips shipped by the licensee. Exact pricing details are protected under non-disclosure agreements, but estimates for initial mobile licensee fees range from $1 million to $2 million, with royalties typically at 1-2% of the chip's selling price. Arm provides multiple customization levels for the Cortex-A73: a standard core license for direct implementation of the reference design, semi-custom options through the "Built on Cortex" program that allow limited modifications such as tweaks while maintaining architectural compatibility, and full custom designs via architectural licenses for extensive alterations. The core integrates seamlessly with Arm's CoreLink interconnects, such as the CCI-550, to enable cache coherency in multi-cluster system-on-chip (SoC) configurations. The Cortex-A73 is backward-compatible with the broader Armv8 ecosystem, implementing the Armv8-A architecture and supporting TrustZone for secure execution environments.

Custom and Standard Implementations

The ARM Cortex-A73 core was employed in standard configurations by several licensees, enabling direct integration without significant architectural alterations. HiSilicon's Kirin 960, released in 2016, featured four Cortex-A73 performance cores clocked at 2.4 GHz alongside four Cortex-A53 efficiency cores in a big.LITTLE arrangement, fabricated on 's 16 nm process. Similarly, MediaTek's Helio X30, launched in early 2017, incorporated two Cortex-A73 cores at up to 2.6 GHz, paired with four Cortex-A53 cores at 2.2 GHz and four Cortex-A35 cores at 1.8 GHz, all on a to enhance power efficiency. Licensees also pursued semi-custom implementations under ARM's "Built on ARM Cortex Technology" model, which permits targeted modifications such as adjustments to dispatch widths, branch predictors, or cache configurations while retaining the core . A prominent example is Qualcomm's Kryo 280 in the Snapdragon 835 SoC, released in 2017, which utilized eight customized Cortex-A73 cores—all configured as performance cores without smaller variants—optimized for Samsung's 10 nm FinFET process to achieve higher sustained performance through tweaks like expanded execution resources. This approach leveraged the Cortex-A73's baseline design but allowed process-specific tuning, such as improved and , to balance efficiency and throughput on advanced nodes. These implementations commonly integrated the Cortex-A73 in heterogeneous big.LITTLE clusters with efficiency cores like the Cortex-A53 or A35, often paired with ARM's GPUs for graphics processing; for instance, the Kirin 960 used a -G71 MP8, while the Helio X30 opted for an PowerVR 7XT. Post-2017, evolutions included minor silicon optimizations for finer process nodes, as seen in HiSilicon's Kirin 970 of , which retained four Cortex-A73 cores at 2.4 GHz with four Cortex-A53 cores at 1.8 GHz on TSMC's 10 nm node, incorporating refinements like enhanced power management for better thermal stability. Such adaptations extended the core's viability in premium mobile SoCs through the late , aligning with the licensing model's flexibility for incremental enhancements.

Applications and Legacy

Usage in Mobile SoCs

The ARM Cortex-A73 core found widespread adoption in mobile system-on-chips (SoCs) from major vendors during the late , particularly in configurations pairing 2 to 4 high-performance A73 cores with efficiency-oriented cores to balance power and performance in smartphones. These setups typically operated the A73 cores at clock speeds between 2.0 and 2.5 GHz, enabling sustained operation in big.LITTLE architectures for demanding tasks like multitasking and gaming while conserving battery life. HiSilicon integrated the Cortex-A73 into its flagship Kirin series, starting with the Kirin 960 in 2016, which featured four A73 cores at 2.36 GHz alongside four A53 cores at 1.84 GHz and powered devices such as the Huawei Mate 9. The follow-up Kirin 970, launched in 2017, retained a similar octa-core configuration with four A73 cores at 2.36 GHz and four A53 cores at 1.84 GHz, but added a dedicated neural processing unit (NPU) for AI acceleration, appearing in models like the Huawei Mate 10 series. Qualcomm employed A73-based Kryo 280 cores in the Snapdragon 835 SoC of 2017, configuring four such cores at up to 2.45 GHz with four A53 cores at 1.9 GHz, which drove premium devices including the S8. In the mid-range segment, the Snapdragon 660 (2017) used four A73 cores at 2.2 GHz paired with four A53 cores at 1.84 GHz, while the Snapdragon 636 (2018) featured four A73 cores at 1.8 GHz paired with four A53 cores at 1.6 GHz, appearing in various budget Android handsets. MediaTek incorporated the Cortex-A73 into both premium and mid-range offerings, with the Helio X30 in 2018 adopting a unique deca-core design of two A73 cores at 2.56 GHz, four A53 cores at 2.2 GHz, and four ultra-efficient Cortex-A35 cores at 1.9 GHz, though it saw limited uptake in major flagships like select models. The Helio P series variants, such as the (2018), shifted to four A73 cores at 2.0 GHz with four A53 cores at 2.0 GHz, targeting affordable devices with AI capabilities via an integrated processing unit. Samsung utilized the Cortex-A73 in mid-range Exynos SoCs, notably the Exynos 7885 from 2017, which combined two A73 cores at 2.2 GHz with six A53 cores at 1.6 GHz and powered Galaxy A and J series devices like the Galaxy A8 (2018). Overall, the Cortex-A73 achieved peak integration in Android smartphones from 2017 to 2019, forming the performance backbone of numerous flagships and mid-tier models across these vendors.

Current Status and Legacy in 2025

By 2020, the Cortex-A73 had been phased out of flagship mobile SoCs in favor of newer architectures like the Cortex-A77 and custom designs, as manufacturers shifted toward higher-performance cores for premium devices. As of 2025, it persists primarily in low-end devices and embedded systems, where its mature design supports basic tasks like web browsing and light multimedia without demanding advanced power budgets. Ongoing adoption in 2025 centers on budget IoT applications, such as smart appliances and gateways via processors like the SL1680, which integrates quad-core Cortex-A73 at 2.1 GHz for efficient . In automotive contexts, it serves as a foundational element for in-vehicle (IVI) and digital systems, enabling reliable performance in cost-sensitive setups like the RK3572 for single-board computers. Legacy maintenance continues for older handsets, with re-spins on 28 nm processes sustaining availability in entry-level markets. The Cortex-A73's impact lies in enabling efficient throughout the by prioritizing sustained performance under thermal constraints, which influenced ARM's transition to the DynamIQ big.LITTLE framework starting with the Cortex-A75 in 2017. Its efficiency techniques, including out-of-order retirement and compact resource structures, were inherited by successors like the A75 and A76, which built on its two-wide to achieve 20-30% performance gains at similar power levels. In 2024 studies, the core's design—featuring a verification queue limited to 11 stores after branches—provided historical insights into trade-offs between reordering capacity and power efficiency in ARM's evolution. Implementations remain vulnerable to certain security issues, such as CVE-2024-10929, requiring updates to Trusted Firmware-A and for protection. With limited representation in new ARM-based chips shipped in 2025, the Cortex-A73's reflects its niche in cost-optimized, low-volume segments rather than broad .

References

  1. https://en.wikichip.org/wiki/mediatek/helio/mt6799
  2. https://en.wikichip.org/wiki/hisilicon/kirin/970
  3. https://en.wikichip.org/wiki/qualcomm/snapdragon_6
  4. https://en.wikichip.org/wiki/samsung/exynos/7885
Add your contribution
Related Hubs
User Avatar
No comments yet.