Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

ARM architecture family

From Wikipedia, the free encyclopedia
(Redirected fromArmhf)
Family of computer architectures
"ARM architecture" redirects here. For the Australian architectural firm, seeARM Architecture (company).

ARM
Designer
Bits32-bit,64-bit
Introduced1985; 40 years ago (1985)
DesignRISC
TypeLoad–store
BranchingCondition code, compare and branch
OpenProprietary
ARMAArch64 (64/32-bit)
Introduced2011; 14 years ago (2011)
VersionARMv8-R, ARMv8-A, ARMv8.1-A, ARMv8.2-A, ARMv8.3-A, ARMv8.4-A, ARMv8.5-A, ARMv8.6-A, ARMv8.7-A, ARMv8.8-A, ARMv8.9-A, ARMv9.0-A, ARMv9.1-A, ARMv9.2-A, ARMv9.3-A, ARMv9.4-A, ARMv9.5-A, ARMv9.6-A
EncodingAArch64/A64 andAArch32/A32 use 32-bit instructions, AArch32/T32 (Thumb-2) uses mixed 16- and 32-bit instructions[1]
EndiannessBi (little as default)
ExtensionsSVE, SVE2, SME, AES, SM3, SM4, SHA, CRC32, RNDR, TME; All mandatory:Thumb-2,Neon, VFPv4-D16, VFPv4; obsolete:Jazelle
Registers
General-purpose31 × 64-bit integer registers[1]
Floating point32 ×128-bit registers[1] for scalar 32- and 64-bitFP orSIMD FP or integer; or cryptography
ARMAArch32 (32-bit)
VersionARMv9-R, ARMv9-M, ARMv8-R, ARMv8-M, ARMv7-A, ARMv7-R, ARMv7E-M, ARMv7-M, ARMv6-M
Encoding32-bit, except Thumb-2 extensions use mixed 16- and 32-bit instructions.
EndiannessBi (little as default)
ExtensionsThumb-2,Neon,Jazelle, AES, SM3, SM4, SHA, CRC32, RNDR, DSP, Saturated, FPv4-SP, FPv5, Helium
Registers
General-purpose15 × 32-bit integer registers, including R14 (link register), but not R15 (PC)
Floating pointUp to 32 × 64-bit registers,[2] SIMD/floating-point (optional)
ARM 32-bit (legacy)
VersionARMv6, ARMv5, ARMv4T, ARMv3, ARMv2
Encoding32-bit, except Thumb extension uses mixed 16- and 32-bit instructions.
EndiannessBi (little as default) in ARMv3 and above
ExtensionsThumb,Jazelle
Registers
General-purpose15 × 32-bit integer registers, including R14 (link register), but not R15 (PC, 26-bit addressing in older)
Floating pointNone

ARM (stylised in lowercase asarm, formerly an acronym forAdvanced RISC Machines and originallyAcorn RISC Machine) is a family ofRISCinstruction set architectures (ISAs) forcomputer processors.Arm Holdings develops the ISAs and licenses them to other companies, who build the physical devices that use the instruction set. It also designs and licensescores that implement these ISAs.

Due to their low costs, low power consumption, and low heat generation, ARM processors are useful for light, portable, battery-powered devices, includingsmartphones,laptops, andtablet computers, as well asembedded systems.[3][4][5] However, ARM processors are also used fordesktops andservers, includingFugaku, the world's fastestsupercomputer from 2020[6] to 2022. With over 230 billion ARM chips produced,[7][8] since at least 2003, and with its dominance increasing every year[update], ARM is the most widely used family of instruction set architectures.[9][4][10][11][12]

There have been several generations of the ARM design. The original ARM1 used a32-bit internal structure but had a 26-bitaddress space that limited it to 64 MB ofmain memory. This limitation was removed in the ARMv3 series, which has a 32-bit address space, and several additional generations up to ARMv7 remained 32-bit. Released in 2011, the ARMv8-A architecture added support for a64-bit address space and 64-bit arithmetic with its new 32-bit fixed-length instruction set.[13] Arm Holdings has also released a series of additional instruction sets for different rules; the "Thumb" extension adds both 32- and 16-bit instructions for improvedcode density, whileJazelle added instructions for directly handlingJava bytecode. More recent changes include the addition ofsimultaneous multithreading (SMT) for improved performance orfault tolerance.[14]

History

[edit]

BBC Micro

[edit]
Main article:BBC Micro

Acorn Computers' first widely successful design was theBBC Micro, introduced in December 1981. This was a relatively conventional machine based on theMOS Technology 6502 CPU but ran at roughly double the performance of competing designs like theApple II due to its use of fasterdynamic random-access memory (DRAM). Typical DRAM of the era ran at about 2 MHz; Acorn arranged a deal withHitachi for a supply of faster 4 MHz parts.[15]

Machines of the era generally shared memory between the processor and theframebuffer, which allowed the processor to quickly update the contents of the screen without having to perform separateinput/output (I/O). As the timing of the video display is exacting, the video hardware had to have priority access to that memory. Due to a quirk of the 6502's design, the CPU left the memory untouched for half of the time. Thus by running the CPU at 1 MHz, the video system could read data during those down times, taking up the total 2 MHz bandwidth of the RAM. In the BBC Micro, the use of 4 MHz RAM allowed the same technique to be used, but running at twice the speed. This allowed it to outperform any similar machine on the market.[16]

Acorn Business Computer

[edit]
Main article:Acorn Business Computer

1981 was also the year that theIBM Personal Computer was introduced. Using the recently introducedIntel 8088, a16-bit CPU compared to the 6502's8-bit design, it offered higher overall performance. Its introduction changed the desktop computer market radically: what had been largely a hobby and gaming market emerging over the prior five years began to change to a must-have business tool where the earlier 8-bit designs simply could not compete. Even newer32-bit designs were also coming to market, such as theMotorola 68000[17] andNational Semiconductor NS32016.[18]

Acorn began considering how to compete in this market and produced a new paper design named theAcorn Business Computer. They set themselves the goal of producing a machine with ten times the performance of the BBC Micro, but at the same price.[19] This would outperform and underprice the PC. At the same time, the recent introduction of theApple Lisa brought thegraphical user interface (GUI) concept to a wider audience and suggested the future belonged to machines with a GUI.[20] The Lisa, however, cost $9,995, as it was packed with support chips, large amounts of memory, and ahard disk drive, all very expensive then.[21]

The engineers then began studying all of the CPU designs available. Their conclusion about the existing 16-bit designs was that they were a lot more expensive and were still "a bit crap",[22] offering only slightly higher performance than their BBC Micro design. They also almost always demanded a large number of support chips to operate even at that level, which drove up the cost of the computer as a whole. These systems would simply not hit the design goal.[22] They also considered the new 32-bit designs, but these cost even more and had the same issues with support chips.[23] According toSophie Wilson, all the processors tested at that time performed about the same, with about a 4 Mbit/s bandwidth.[24][a]

Two key events led Acorn down the path to ARM. One was the publication of a series of reports from theUniversity of California, Berkeley, which suggested that a simple chip design could nevertheless have extremely high performance, much higher than the latest 32-bit designs on the market.[25] The second was a visit bySteve Furber and Sophie Wilson to theWestern Design Center, a company run byBill Mensch and his sister, which had become the logical successor to the MOS team and was offering new versions like theWDC 65C02. The Acorn team saw high school students producing chip layouts on Apple II machines, which suggested that anyone could do it.[26][27] In contrast, a visit to another design firm working on modern 32-bit CPU revealed a team with over a dozen members who were already on revision H of their design and yet it still contained bugs.[b] This cemented their late 1983 decision to begin their own CPU design, the Acorn RISC Machine.[28]

Design concepts

[edit]

The originalBerkeley RISC designs were in some sense teaching systems, not designed specifically for outright performance. To the RISC's basic register-heavy and load/store concepts, ARM added a number of the well-received design notes of the 6502. Primary among them was the ability to quickly serveinterrupts, which allowed the machines to offer reasonableinput/output performance with no added external hardware. To offer interrupts with similar performance as the 6502, the ARM design limited its physicaladdress space to 64 MB of total addressable space, requiring 26 bits of address. As instructions were 4 bytes (32 bits) long, and required to be aligned on 4-byte boundaries, the lower 2 bits of an instruction address were always zero. This meant theprogram counter (PC) only needed to be 24 bits, allowing it to be stored along with the eight bitprocessor flags in a single 32-bit register. That meant that upon receiving an interrupt, the entire machine state could be saved in a single operation, whereas had the PC been a full 32-bit value, it would require separate operations to store the PC and the status flags. This decision halved the interrupt overhead.[29]

Another change, and among the most important in terms of practical real-world performance, was the modification of theinstruction set to take advantage ofpage mode DRAM. Recently introduced, page mode allowed subsequent accesses of memory to run twice as fast if they were roughly in the same location, or "page", in the DRAM chip. Berkeley's design did not consider page mode and treated all memory equally. The ARM design added special vector-like memory access instructions, the "S-cycles", that could be used to fill or save multiple registers in a single page using page mode. This doubled memory performance when they could be used, and was especially important for graphics performance.[30]

The Berkeley RISC designs usedregister windows to reduce the number of register saves and restores performed inprocedure calls; the ARM design did not adopt this.

Wilson developed the instruction set, writing a simulation of the processor inBBC BASIC that ran on a BBC Micro with asecond 6502 processor.[31][32] This convinced Acorn engineers they were on the right track. Wilson approached Acorn's CEO,Hermann Hauser, and requested more resources. Hauser gave his approval and assembled a small team to design the actual processor based on Wilson's ISA.[33] The official Acorn RISC Machine project started in October 1983.

ARM1

[edit]
ARM1 2nd processor for the BBC Micro

Acorn choseVLSI Technology as the "silicon partner", as they were a source of ROMs and custom chips for Acorn. Acorn provided the design and VLSI provided the layout and production. The first samples of ARM silicon worked properly when first received and tested on 26 April 1985.[3] Known as ARM1, these versions ran at 6 MHz.[34]

The first ARM application was as a second processor for the BBC Micro, where it helped in developing simulation software to finish development of the support chips (VIDC, IOC, MEMC), and sped up theCAD software used in ARM2 development. Wilson subsequently rewroteBBC BASIC in ARMassembly language. The in-depth knowledge gained from designing the instruction set enabled the code to be very dense, making ARM BBC BASIC an extremely good test for any ARM emulator.

ARM2

[edit]

The result of the simulations on the ARM1 boards led to the late 1986 introduction of the ARM2 design running at 8 MHz, and the early 1987 speed-bumped version at 10 to 12 MHz.[c] A significant change in the underlying architecture was the addition of aBooth multiplier, whereas formerly multiplication had to be carried out in software.[36] Further, a new Fast Interrupt reQuest mode, FIQ for short, allowed registers 8 through 14 to be replaced as part of the interrupt itself. This meant FIQ requests did not have to save out their registers, further speeding interrupts.[37]

The first use of the ARM2 was theAcorn Archimedes personal computer models A305, A310, and A440 launched in 1987.

According to theDhrystone benchmark, the ARM2 was roughly seven times the performance of a typical 7 MHz 68000-based system like theAmiga orMacintosh SE. It was twice as fast as anIntel 80386 running at 16 MHz, and about the same speed as a multi-processorVAX-11/784superminicomputer. The only systems that beat it were theSun SPARC andMIPS R2000 RISC-basedworkstations.[38] Further, as the CPU was designed for high-speed I/O, it dispensed with many of the support chips seen in these machines; notably, it lacked any dedicateddirect memory access (DMA) controller which was often found on workstations. The graphics system was also simplified based on the same set of underlying assumptions about memory and timing. The result was a dramatically simplified design, offering performance on par with expensive workstations but at a price point similar to contemporary desktops.[38]

The ARM2 featured a32-bitdata bus,26-bit address space and 27 32-bitregisters, of which 16 are accessible at any one time (including thePC).[39] The ARM2 had atransistor count of just 30,000,[40] compared to Motorola's six-year-older 68000 model with around 68,000. Much of this simplicity came from the lack ofmicrocode, which represents about one-quarter to one-third of the 68000's transistors, and the lack of (like most CPUs of the day) acache. This simplicity enabled the ARM2 to have a low power consumption and simpler thermal packaging by having fewer powered transistors. Nevertheless, ARM2 offered better performance than the contemporary 1987IBM PS/2 Model 50, which initially utilised anIntel 80286, offering 1.8 MIPS @ 10 MHz, and later in 1987, the 2 MIPS of the PS/2 70, with itsIntel 386 DX @ 16 MHz.[41][42]

A successor, ARM3, was produced with a 4 KB cache, which further improved performance.[43] The address bus was extended to 32 bits in the ARM6, but program code still had to lie within the first 64 MB of memory in 26-bit compatibility mode, due to the reserved bits for the status flags.[44]

Advanced RISC Machines Ltd. – ARM6

[edit]
Microprocessor-based system on a chip
Die of an ARM610 microprocessor

In the late 1980s,Apple Computer andVLSI Technology started working with Acorn on newer versions of the ARM core. In 1990, Acorn spun off the design team into a new company named Advanced RISC Machines Ltd.,[45][46][47] which became ARM Ltd. when its parent company,Arm Holdings plc, floated on theLondon Stock Exchange andNasdaq in 1998.[48] The new Apple–ARM work would eventually evolve into the ARM6, first released in early 1992. Apple used the ARM6-based ARM610 as the basis for theirApple Newton PDA.

Early licensees

[edit]

In 1994, Acorn used the ARM610 as the maincentral processing unit (CPU) in theirRiscPC computers.DEC licensed the ARMv4 architecture and produced theStrongARM.[49] At 233 MHz, this CPU drew only one watt (newer versions draw far less). This work was later passed to Intel as part of a lawsuit settlement, and Intel took the opportunity to supplement theiri960 line with the StrongARM. Intel later developed its own high performance implementation namedXScale, which it has since sold toMarvell. Transistor count of the ARM core remained essentially the same throughout these changes; ARM2 had 30,000 transistors,[50] while ARM6 grew only to 35,000.[51]

Market share

[edit]

In 2005, about 98% of all mobile phones sold used at least one ARM processor.[52] In 2010, producers of chips based on ARM architectures reported shipments of 6.1 billionARM-based processors, representing 95% ofsmartphones, 35% ofdigital televisions andset-top boxes, and 10% ofmobile computers. In 2011, the 32-bit ARM architecture was the most widely used architecture in mobile devices and the most popular 32-bit one in embedded systems.[53] In 2013, 10 billion were produced[54] and "ARM-based chips are found in nearly 60 percent of the world's mobile devices".[55]

Licensing

[edit]
See also:Arm Holdings § Licensees
Die of aSTM32­F103VGT6ARM Cortex-M3 microcontroller with 1 MBflash memory bySTMicroelectronics

Core licence

[edit]

Arm Holdings's primary business is sellingIP cores, which licensees use to createmicrocontrollers (MCUs),CPUs, andsystems-on-chips based on those cores. Theoriginal design manufacturer combines the ARM core with other parts to produce a complete device, typically one that can be built in existingsemiconductor fabrication plants (fabs) at low cost and still deliver substantial performance. The most successful implementation has been theARM7TDMI with hundreds of millions sold.Atmel has been a precursor design center in the ARM7TDMI-based embedded system.

The ARM architectures used in smartphones, PDAs and othermobile devices range from ARMv5 toARMv8-A.

In 2009, some manufacturers introduced netbooks based on ARM architecture CPUs, in direct competition with netbooks based onIntel Atom.[56]

Arm Holdings offers a variety of licensing terms, varying in cost and deliverables. Arm Holdings provides to all licensees an integratable hardware description of the ARM core as well as complete software development toolset (compiler,debugger,software development kit), and the right to sell manufacturedsilicon containing the ARM CPU.

SoC packages integrating ARM's core designs include Nvidia Tegra's first three generations, CSR plc's Quatro family, ST-Ericsson's Nova and NovaThor, Silicon Labs's Precision32 MCU, Texas Instruments'sOMAP products, Samsung's Hummingbird andExynos products, Apple'sA4,A5, andA5X, andNXP'si.MX.

Fabless licensees, who wish to integrate an ARM core into their own chip design, are usually only interested in acquiring a ready-to-manufacture verifiedsemiconductor intellectual property core. For these customers, Arm Holdings delivers agate netlist description of the chosen ARM core, along with an abstracted simulation model and test programs to aid design integration and verification. More ambitious customers, including integrated device manufacturers (IDM) and foundry operators, choose to acquire the processor IP insynthesizableRTL (Verilog) form. With the synthesizable RTL, the customer has the ability to perform architectural level optimisations and extensions. This allows the designer to achieve exotic design goals not otherwise possible with an unmodified netlist (high clock speed, very low power consumption, instruction set extensions, etc.). While Arm Holdings does not grant the licensee the right to resell the ARM architecture itself, licensees may freely sell manufactured products such as chip devices, evaluation boards and complete systems.Merchant foundries can be a special case; not only are they allowed to sell finished silicon containing ARM cores, they generally hold the right to re-manufacture ARM cores for other customers.

Arm Holdings prices its IP based on perceived value. Lower performing ARM cores typically have lower licence costs than higher performing cores. In implementation terms, a synthesisable core costs more than a hard macro (blackbox) core. Complicating price matters, a merchant foundry that holds an ARM licence, such as Samsung or Fujitsu, can offer fab customers reduced licensing costs. In exchange for acquiring the ARM core through the foundry's in-house design services, the customer can reduce or eliminate payment of ARM's upfront licence fee.

Compared to dedicated semiconductor foundries (such asTSMC andUMC) without in-house design services, Fujitsu/Samsung charge two- to three-times more per manufacturedwafer.[citation needed] For low to mid volume applications, a design service foundry offers lower overall pricing (through subsidisation of the licence fee). For high volume mass-produced parts, the long term cost reduction achievable through lower wafer pricing reduces the impact of ARM's NRE (non-recurring engineering) costs, making the dedicated foundry a better choice.

Companies that have developed chips with cores designed by Arm includeAmazon.com'sAnnapurna Labs subsidiary,[57]Analog Devices,Apple,AppliedMicro (now:MACOM Technology Solutions[58]),Atmel,Broadcom,Cavium,Cypress Semiconductor,Freescale Semiconductor (nowNXP Semiconductors),Huawei,Intel,[dubiousdiscuss]Maxim Integrated,Nvidia,NXP,Qualcomm,Renesas,Samsung Electronics,ST Microelectronics,Texas Instruments, andXilinx.

Built on ARM Cortex Technology licence

[edit]

In February 2016, ARM announced the Built on ARM Cortex Technology licence, often shortened to Built on Cortex (BoC) licence. This licence allows companies to partner with ARM and make modifications to ARM Cortex designs. These design modifications will not be shared with other companies. These semi-custom core designs also have brand freedom, for exampleKryo 280.

Companies that are current licensees of Built on ARM Cortex Technology includeQualcomm.[59]

Architectural licence

[edit]

Companies can also obtain an ARMarchitectural licence for designing their own CPU cores using the ARM instruction sets. These cores must comply fully with the ARM architecture. Companies that have designed cores that implement an ARM architecture include Apple, AppliedMicro (now:Ampere Computing), Broadcom,Cavium (now: Marvell),Digital Equipment Corporation, Intel, Nvidia, Qualcomm, Samsung Electronics,Fujitsu, and NUVIA Inc. (acquired by Qualcomm in 2021).

ARM Flexible Access

[edit]

On 16 July 2019, ARM announced ARM Flexible Access. ARM Flexible Access provides unlimited access to included ARMintellectual property (IP) for development. Per product licence fees are required once a customer reaches foundry tapeout or prototyping.[60][61]

75% of ARM's most recent IP over the last two years are included in ARM Flexible Access. As of October 2019:

  • CPUs:Cortex-A5,Cortex-A7,Cortex-A32,Cortex-A34,Cortex-A35,Cortex-A53,Cortex-R5,Cortex-R8,Cortex-R52,Cortex-M0,Cortex-M0+,Cortex-M3,Cortex-M4,Cortex-M7,Cortex-M23,Cortex-M33
  • GPUs:Mali-G52,Mali-G31. Includes Mali Driver Development Kits (DDK).
  • Interconnect: CoreLink NIC-400, CoreLink NIC-450, CoreLink CCI-400, CoreLink CCI-500, CoreLink CCI-550, ADB-400 AMBA, XHB-400 AXI-AHB
  • System Controllers: CoreLink GIC-400, CoreLink GIC-500, PL192 VIC, BP141 TrustZone Memory Wrapper, CoreLink TZC-400, CoreLink L2C-310, CoreLink MMU-500, BP140 Memory Interface
  • Security IP: CryptoCell-312, CryptoCell-712, TrustZone True Random Number Generator
  • Peripheral Controllers: PL011 UART, PL022 SPI, PL031 RTC
  • Debug & Trace: CoreSight SoC-400, CoreSight SDC-600, CoreSight STM-500, CoreSight System Trace Macrocell, CoreSight Trace Memory Controller
  • Design Kits: Corstone-101, Corstone-201
  • Physical IP: Artisan PIK for Cortex-M33 TSMC 22ULL including memory compilers, logic libraries, GPIOs and documentation
  • Tools & Materials: Socrates IP ToolingARM Design Studio, Virtual System Models
  • Support: Standard ARM Technical support, ARM online training, maintenance updates, credits toward onsite training and design reviews

Cores

[edit]
Main article:List of ARM processors
ArchitectureCore
bit-width
CoresProfileRefe-
rences
Arm Ltd.Third-party
ARMv1
ARM1
Classic
ARMv2
32
ARM2, ARM250,ARM3Amber, STORM Open Soft Core[62]
Classic
ARMv3
32
ARM6,ARM7
Classic
ARMv4
32
ARM8StrongARM, FA526, ZAP Open Source Processor Core
Classic
ARMv4T
32
ARM7TDMI,ARM9TDMI,SecurCore SC100
Classic
ARMv5TE
32
ARM7EJ,ARM9E,ARM10EXScale, FA626TE, Feroceon, PJ1/Mohawk
Classic
ARMv6
32
ARM11
Classic
ARMv6-M
32
ARM Cortex-M0,ARM Cortex-M0+,ARM Cortex-M1,SecurCore SC000
ARMv7-M
32
ARM Cortex-M3,SecurCore SC300Apple M7 motion coprocessor
Microcontroller
ARMv7E-M
32
ARM Cortex-M4,ARM Cortex-M7
Microcontroller
ARMv8-M
32
ARM Cortex-M23,[64]ARM Cortex-M33[65]
Microcontroller
ARMv8.1-M
32
ARM Cortex-M55,ARM Cortex-M85
Microcontroller
ARMv7-R
32
ARM Cortex-R4,ARM Cortex-R5,ARM Cortex-R7,ARM Cortex-R8
ARMv8-R
32
ARM Cortex-R52
Real-time
64
ARM Cortex-R82
Real-time
ARMv7-A
32
ARM Cortex-A5,ARM Cortex-A7,ARM Cortex-A8,ARM Cortex-A9,ARM Cortex-A12,ARM Cortex-A15,ARM Cortex-A17QualcommScorpion/Krait, PJ4/Sheeva, Apple Swift (A6,A6X)
ARMv8-A
32
ARM Cortex-A32[71]
Application
64/32
ARM Cortex-A35,[72]ARM Cortex-A53,ARM Cortex-A57,[73]ARM Cortex-A72,[74]ARM Cortex-A73[75]X-Gene,Nvidia Denver 1/2,Cavium ThunderX,AMD K12, Apple Cyclone (A7)/Typhoon (A8,A8X)/Twister (A9,A9X)/Hurricane+Zephyr (A10,A10X),QualcommKryo, Samsung M1/M2 ("Mongoose") /M3 ("Meerkat")
Application
ARM Cortex-A34[81]
Application
ARMv8.1-A
64/32
TBACavium ThunderX2
Application
ARMv8.2-A
64/32
ARM Cortex-A55,[83]ARM Cortex-A75,[84]ARM Cortex-A76,[85]ARM Cortex-A77,ARM Cortex-A78,ARM Cortex-X1,ARM Neoverse N1Nvidia Carmel, Samsung M4 ("Cheetah"),Fujitsu A64FX (ARMv8 SVE 512-bit)
Application
64
ARM Cortex-A65,ARM Neoverse E1 withsimultaneous multithreading (SMT),ARM Cortex-A65AE[89] (also having e.g. ARMv8.4 Dot Product; made for safety critical tasks such asadvanced driver-assistance systems (ADAS))Apple Monsoon+Mistral (A11) (September 2017)
Application
ARMv8.3-A
64/32
TBA
Application
64
TBAApple Vortex+Tempest (A12,A12X,A12Z), Marvell ThunderX3 (v8.3+)[90]
Application
ARMv8.4-A
64/32
TBA
Application
64
ARM Neoverse V1Apple Lightning+Thunder (A13), Apple Firestorm+Icestorm (A14,M1)
Application
ARMv8.5-A
64/32
TBA
Application
64
TBA
Application
ARMv8.6-A
64
TBAApple Avalanche+Blizzard (A15,M2), Apple Everest+Sawtooth (A16),[91]A17
Application
ARMv8.7-A
64
TBA
Application
ARMv8.8-A
64
TBA
Application
ARMv8.9-A
64
TBA
Application
ARMv9.0-A
64
ARM Cortex-A510,ARM Cortex-A710,ARM Cortex-A715,ARM Cortex-X2,ARM Cortex-X3,ARM Neoverse E2,ARM Neoverse N2,ARM Neoverse V2
Application
ARMv9.1-A
64
TBA
Application
ARMv9.2-A
64
ARM Cortex-A520,ARM Cortex-A720,ARM Cortex-X4,ARM Neoverse V3,[95]ARM Cortex-X925[96]Apple M4,[97]A18
Application
ARMv9.3-A
64
TBA
Application
ARMv9.4-A
64
TBA
Application
ARMv9.5-A
64
TBA
Application
ARMv9.6-A
64
TBA
Application
  1. ^abAlthough mostdatapaths andCPU registers in the early ARM processors were 32-bit,addressable memory was limited to 26 bits; with upper bits, then, used for status flags in the program counter register.
  2. ^abcARMv3 included a compatibility mode to support the26-bit addresses of earlier versions of the architecture. This compatibility modeoptional in ARMv4, and removed entirely in ARMv5.

Arm provides a list of vendors who implement ARM cores in their design (application specific standard products (ASSP), microprocessor and microcontrollers).[102]

Example applications of ARM cores

[edit]
Tronsmart MK908, aRockchip-based quad-core Android "mini PC", with a microSD card next to it for a size comparison
Main article:List of products using ARM processors

ARM cores are used in a number of products, particularlyPDAs andsmartphones. Somecomputing examples areMicrosoft'sfirst generation Surface,Surface 2 andPocket PC devices (following2002),Apple'siPads, andAsus'sEee Pad Transformertablet computers, and severalChromebook laptops. Others include Apple'siPhonesmartphones andiPodportable media players,Canon PowerShotdigital cameras,Nintendo Switch hybrid, theWii security processor and3DShandheld game consoles, andTomTom turn-by-turnnavigation systems.

In 2005, Arm took part in the development ofManchester University's computerSpiNNaker, which used ARM cores to simulate thehuman brain.[103]

ARM chips are also used inRaspberry Pi,BeagleBoard,BeagleBone,PandaBoard, and othersingle-board computers, because they are very small, inexpensive, and consume very little power.

32-bit architecture

[edit]
An ARMv7 was used to power older versions of the popularRaspberry Pi single-board computers like this Raspberry Pi 2 from 2015.
An ARMv7 is also used to power theCuBox family of single-board computers.
See also:Comparison of ARMv7-A processors

The 32-bit ARM architecture (ARM32), such asARMv7-A (implementing AArch32; seesection on Armv8-A for more on it), was the most widely used architecture in mobile devices as of 2011[update].[53]

Since 1995, various versions of theARM Architecture Reference Manual (see§ External links) have been the primary source of documentation on the ARM processor architecture and instruction set, distinguishing interfaces that all ARM processors are required to support (such as instruction semantics) from implementation details that may vary. The architecture has evolved over time, and version seven of the architecture, ARMv7, defines three architecture "profiles":

  • A-profile, the "Application" profile, implemented by 32-bit cores in theCortex-A series and by some non-ARM cores
  • R-profile, the "Real-time" profile, implemented by cores in theCortex-R series
  • M-profile, the "Microcontroller" profile, implemented by most cores in theCortex-M series

Although the architecture profiles were first defined for ARMv7, ARM subsequently defined the ARMv6-M architecture (used by the CortexM0/M0+/M1) as a subset of the ARMv7-M profile with fewer instructions.

CPU modes

[edit]

Except in the M-profile, the 32-bit ARM architecture specifies several CPU modes, depending on the implemented architecture features. At any moment in time, the CPU can be in only one mode, but it can switch modes due to external events (interrupts) or programmatically.[104]

  • User mode: The only non-privileged mode.
  • FIQ mode: A privileged mode that is entered whenever the processor accepts afast interrupt request.
  • IRQ mode: A privileged mode that is entered whenever the processor accepts an interrupt.
  • Supervisor (svc) mode: A privileged mode entered whenever the CPU is reset or when an SVC instruction is executed.
  • Abort mode: A privileged mode that is entered whenever a prefetch abort or data abort exception occurs.
  • Undefined mode: A privileged mode that is entered whenever an undefined instruction exception occurs.
  • System mode (ARMv4 and above): The only privileged mode that is not entered by an exception. It can only be entered by executing an instruction that explicitly writes to the mode bits of the Current Program Status Register (CPSR) from another privileged mode (not from user mode).
  • Monitor mode (ARMv6 and ARMv7 Security Extensions, ARMv8 EL3): A monitor mode is introduced to support TrustZone extension in ARM cores.
  • Hyp mode (ARMv7 Virtualization Extensions, ARMv8 EL2): A hypervisor mode that supportsPopek and Goldberg virtualization requirements for the non-secure operation of the CPU.[105][106]
  • Thread mode (ARMv6-M, ARMv7-M, ARMv8-M): A mode which can be specified as either privileged or unprivileged. Whether the Main Stack Pointer (MSP) or Process Stack Pointer (PSP) is used can also be specified in CONTROL register with privileged access. This mode is designed for user tasks in RTOS environment but it is typically used in bare-metal for super-loop.
  • Handler mode (ARMv6-M, ARMv7-M, ARMv8-M): A mode dedicated for exception handling (except the RESET which are handled in Thread mode). Handler mode always uses MSP and works in privileged level.

Instruction set

[edit]

The original (and subsequent) ARM implementation was hardwired withoutmicrocode, like the much simpler8-bit6502 processor used in prior Acorn microcomputers.

The 32-bit ARM architecture (and the 64-bit architecture for the most part) includes the following RISC features:

  • Load–store architecture.
  • No support forunaligned memory accesses in the original version of the architecture. ARMv6 and later, except some microcontroller versions, support unaligned accesses for half-word and single-word load/store instructions with some limitations, such as no guaranteedatomicity.[107][108]
  • Uniform 16 × 32-bitregister file (including the program counter, stack pointer and the link register).
  • Fixed instruction width of 32 bits to ease decoding andpipelining, at the cost of decreasedcode density. Later, theThumb instruction set added 16-bit instructions and increased code density.
  • Mostly single clock-cycle execution.

To compensate for the simpler design, compared with processors like the Intel 80286 andMotorola 68020, some additional design features were used:

  • Conditional execution of most instructions reduces branch overhead and compensates for the lack of abranch predictor in early chips.
  • Arithmetic instructions altercondition codes only when desired.
  • 32-bitbarrel shifter can be used without performance penalty with most arithmetic instructions and address calculations.
  • Has powerful indexedaddressing modes.
  • Alink register supports fast leaf function calls.
  • A simple, but fast, 2-priority-levelinterrupt subsystem has switched register banks.

Arithmetic instructions

[edit]

ARM includes integer arithmetic operations for add, subtract, and multiply; some versions of the architecture also support divide operations.

ARM supports 32-bit × 32-bit multiplies with either a 32-bit result or 64-bit result, though Cortex-M0 / M0+ / M1 cores do not support 64-bit results.[109] Some ARM cores also support 16-bit × 16-bit and 32-bit × 16-bit multiplies.

The divide instructions are only included in the following ARM architectures:

  • Armv7-M and Armv7E-M architectures always include divide instructions.[110]
  • Armv7-R architecture always includes divide instructions in the Thumb instruction set, but optionally in its 32-bit instruction set.[111]
  • Armv7-A architecture optionally includes the divide instructions. The instructions might not be implemented, or implemented only in the Thumb instruction set, or implemented in both the Thumb and ARM instruction sets, or implemented if the Virtualization Extensions are included.[111]

Registers

[edit]
Registers across CPU modes
usrsyssvcabtundirqfiq
R0
R1
R2
R3
R4
R5
R6
R7
R8R8_fiq
R9R9_fiq
R10R10_fiq
R11R11_fiq
R12R12_fiq
R13R13_svcR13_abtR13_undR13_irqR13_fiq
R14R14_svcR14_abtR14_undR14_irqR14_fiq
R15
CPSR
SPSR_svcSPSR_abtSPSR_undSPSR_irqSPSR_fiq

Registers R0 through R7 are the same across all CPU modes; they are never banked.

Registers R8 through R12 are the same across all CPU modes except FIQ mode. FIQ mode has its own distinct R8 through R12 registers.

R13 and R14 are banked across all privileged CPU modes except system mode. That is, each mode that can be entered because of an exception has its own R13 and R14. These registers generally contain the stack pointer and the return address from function calls, respectively.

Aliases:

The Current Program Status Register (CPSR) has the following 32 bits.[112]

  • M (bits 0–4) is the processor mode bits.
  • T (bit 5) is the Thumb state bit.
  • F (bit 6) is the FIQ disable bit.
  • I (bit 7) is the IRQ disable bit.
  • A (bit 8) is the imprecise data abort disable bit.
  • E (bit 9) is the data endianness bit.
  • IT (bits 10–15 and 25–26) is the if-then state bits.
  • GE (bits 16–19) is the greater-than-or-equal-to bits.
  • DNM (bits 20–23) is the do not modify bits.
  • J (bit 24) is the Java state bit.
  • Q (bit 27) is the sticky overflow bit.
  • V (bit 28) is the overflow bit.
  • C (bit 29) is the carry/borrow/extend bit.
  • Z (bit 30) is the zero bit.
  • N (bit 31) is the negative/less than bit.

Conditional execution

[edit]

Almost every ARM instruction has a conditional execution feature calledpredication, which is implemented with a 4-bit condition code selector (the predicate). To allow for unconditional execution, one of the four-bit codes causes the instruction to be always executed. Most other CPU architectures only have condition codes on branch instructions.[113]

Though the predicate takes up four of the 32 bits in an instruction code, and thus cuts down significantly on the encoding bits available for displacements in memory access instructions, it avoids branch instructions when generating code for smallif statements. Apart from eliminating the branch instructions themselves, this preserves the fetch/decode/execute pipeline at the cost of only one cycle per skipped instruction.

An algorithm that provides a good example of conditional execution is the subtraction-basedEuclidean algorithm for computing thegreatest common divisor. In theC programming language, the algorithm can be written as:

intgcd(inta,intb){while(a!=b)// We enter the loop when a < b or a > b, but not when a == bif(a>b)// When a > b we do thisa-=b;else// When a < b we do that (no "if (a < b)" needed since a != b is checked in while condition)b-=a;returna;}

The same algorithm can be rewritten in a way closer to target ARMinstructions as:

loop:// Compare a and bGT=a>b;LT=a<b;NE=a!=b;// Perform operations based on flag resultsif(GT)a-=b;// Subtract *only* if greater-thanif(LT)b-=a;// Subtract *only* if less-thanif(NE)gotoloop;// Loop *only* if compared values were not equalreturna;

and coded inassembly language as:

; assign a to register r0, b to r1loop:CMPr0,r1; set condition "NE" if (a ≠ b),;               "GT" if (a > b),;            or "LT" if (a < b)SUBGTr0,r0,r1; if "GT" (Greater Than), then a = a − bSUBLTr1,r1,r0; if "LT" (Less    Than), then b = b − aBNEloop; if "NE" (Not Equal), then loopBlr; return

which avoids the branches around thethen andelse clauses. Ifr0 andr1 are equal then neither of theSUB instructions will be executed, eliminating the need for a conditional branch to implement thewhile check at the top of the loop, for example hadSUBLE (less than or equal) been used.

One of the ways that Thumb code provides a more dense encoding is to remove the four-bit selector from non-branch instructions.

Other features

[edit]

Another feature of theinstruction set is the ability to fold shifts and rotates into thedata processing (arithmetic, logical, and register-register move) instructions, so that, for example, the statement inC language:

a+=(j<<2);

could be rendered as a one-word, one-cycle instruction:[114]

ADDRa,Ra,Rj,LSL#2

This results in the typical ARM program being denser than expected with fewer memory accesses; thus the pipeline is used more efficiently.

The ARM processor also has features rarely seen in other RISC architectures, such asPC-relative addressing (indeed, on the 32-bit[1] ARM thePC is one of its 16 registers) and pre- and post-increment addressing modes.

The ARM instruction set has increased over time. Some early ARM processors (before ARM7TDMI), for example, have no instruction to store a two-byte quantity.

Pipelines and other implementation issues

[edit]

The ARM7 and earlier implementations have a three-stagepipeline; the stages being fetch, decode, and execute. Higher-performance designs, such as the ARM9, have deeper pipelines: Cortex-A8 has thirteen stages. Additional implementation changes for higher performance include a fasteradder and more extensivebranch prediction logic. The difference between the ARM7DI and ARM7DMI cores, for example, was an improved multiplier; hence the added "M".

Coprocessors

[edit]

The ARM architecture (pre-Armv8) provides a non-intrusive way of extending the instruction set using "coprocessors" that can be addressed using MCR, MRC, MRRC, MCRR, and similar instructions. The coprocessor space is divided logically into 16 coprocessors with numbers from 0 to 15, coprocessor 15 (cp15) being reserved for some typical control functions like managing the caches andMMU operation on processors that have one.

In ARM-based machines, peripheral devices are usually attached to the processor by mapping their physical registers into ARM memory space, into the coprocessor space, or by connecting to another device (a bus) that in turn attaches to the processor. Coprocessor accesses have lower latency, so some peripherals—for example, an XScale interrupt controller—are accessible in both ways: through memory and through coprocessors.

In other cases, chip designers only integrate hardware using the coprocessor mechanism. For example, an image processing engine might be a small ARM7TDMI core combined with a coprocessor that has specialised operations to support a specific set of HDTV transcoding primitives.

Debugging

[edit]
This sectionneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources in this section. Unsourced material may be challenged and removed.(March 2011) (Learn how and when to remove this message)

All modern ARM processors include hardware debugging facilities, allowing software debuggers to perform operations such as halting, stepping, and breakpointing of code starting from reset. These facilities are built usingJTAG support, though some newer cores optionally support ARM's own two-wire "SWD" protocol. In ARM7TDMI cores, the "D" represented JTAG debug support, and the "I" represented presence of an "EmbeddedICE" debug module. For ARM7 and ARM9 core generations, EmbeddedICE over JTAG was a de facto debug standard, though not architecturally guaranteed.

The ARMv7 architecture defines basic debug facilities at an architectural level. These include breakpoints, watchpoints and instruction execution in a "Debug Mode"; similar facilities were also available with EmbeddedICE. Both "halt mode" and "monitor" mode debugging are supported. The actual transport mechanism used to access the debug facilities is not architecturally specified, but implementations generally include JTAG support.

There is a separate ARM "CoreSight" debug architecture, which is not architecturally required by ARMv7 processors.

Debug Access Port

[edit]

The Debug Access Port (DAP) is an implementation of an ARM Debug Interface.[115]There are two different supported implementations, the Serial WireJTAG Debug Port (SWJ-DP) and the Serial Wire Debug Port (SW-DP).[116]CMSIS-DAP is a standard interface that describes how various debugging software on a host PC can communicate over USB to firmware running on a hardware debugger, which in turn talks over SWD or JTAG to a CoreSight-enabled ARM Cortex CPU.[117][118][119]

DSP enhancement instructions

[edit]

To improve the ARM architecture fordigital signal processing and multimedia applications, DSP instructions were added to the instruction set.[120] These are signified by an "E" in the name of the ARMv5TE and ARMv5TEJ architectures. E-variants also imply T, D, M, and I.

The new instructions are common indigital signal processor (DSP) architectures. They include variations on signedmultiply–accumulate,saturated add and subtract, andcount leading zeros.

First introduced in 1999, this extension of the core instruction set contrasted with ARM's earlier DSP coprocessor known as Piccolo, which employed a distinct, incompatible instruction set whose execution involved a separate program counter.[121] Piccolo instructions employed a distinct register file of sixteen 32-bit registers, with some instructions combining registers for use as 48-bit accumulators and other instructions addressing 16-bit half-registers. Some instructions were able to operate on two such 16-bit values in parallel. Communication with the Piccolo register file involvedload to Piccolo andstore from Piccolo coprocessor instructions via two buffers of eight 32-bit entries. Described as reminiscent of other approaches, notably Hitachi's SH-DSP and Motorola's 68356, Piccolo did not employ dedicated local memory and relied on the bandwidth of the ARM core for DSP operand retrieval, impacting concurrent performance.[122] Piccolo's distinct instruction set also proved not to be a "good compiler target".[121]

SIMD extensions for multimedia

[edit]

Introduced in the ARMv6 architecture, this was a precursor to Advanced SIMD, also namedNeon.[123]

Jazelle

[edit]
Main article:Jazelle

Jazelle DBX (Direct Bytecode eXecution) is a technique that allowsJava bytecode to be executed directly in the ARM architecture as a third execution state (and instruction set) alongside the existing ARM and Thumb-mode. Support for this state is signified by the "J" in the ARMv5TEJ architecture, and in ARM9EJ-S and ARM7EJ-S core names. Support for this state is required starting in ARMv6 (except for the ARMv7-M profile), though newer cores only include a trivial implementation that provides no hardware acceleration.

Thumb

[edit]

To improve compiled code density, processors since the ARM7TDMI (released in 1994[124]) have featured theThumbcompressed instruction set, which have their own state. (The "T" in "TDMI" indicates the Thumb feature.) When in this state, the processor executes the Thumb instruction set, a compact 16-bit encoding for a subset of the ARM instruction set.[125] Most of the Thumb instructions are directly mapped to normal ARM instructions. The space saving comes from making some of the instruction operands implicit and limiting the number of possibilities compared to the ARM instructions executed in the ARM instruction set state.

In Thumb, the 16-bit opcodes have less functionality. For example, only branches can be conditional, and many opcodes are restricted to accessing only half of all of the CPU's general-purpose registers. The shorter opcodes give improved code density overall, even though some operations require extra instructions. In situations where the memory port or bus width is constrained to less than 32 bits, the shorter Thumb opcodes allow increased performance compared with 32-bit ARM code, as less program code may need to be loaded into the processor over the constrained memory bandwidth.

Unlike processor architectures with variable length (16- or 32-bit) instructions, such as the Cray-1 andHitachiSuperH, the ARM and Thumb instruction sets exist independently of each other. Embedded hardware, such as theGame Boy Advance, typically have a small amount of RAM accessible with a full 32-bit datapath; the majority is accessed via a 16-bit or narrower secondary datapath. In this situation, it usually makes sense to compile Thumb code and hand-optimise a few of the most CPU-intensive sections using full 32-bit ARM instructions, placing these wider instructions into the 32-bit bus accessible memory.

The first processor with a Thumbinstruction decoder was the ARM7TDMI. All processors supporting 32-bit instruction sets, starting with ARM9, and including XScale, have included a Thumb instruction decoder. It includes instructions adopted from the HitachiSuperH (1992), which was licensed by ARM.[126] ARM's smallest processor families (Cortex M0 and M1) implement only the 16-bit Thumb instruction set for maximum performance in lowest cost applications. ARM processors that don't support 32-bit addressing also omit Thumb.

Thumb-2

[edit]

Thumb-2 technology was introduced in theARM1156 core, announced in 2003. Thumb-2 extends the limited 16-bit instruction set of Thumb with additional 32-bit instructions to give the instruction set more breadth, thus producing a variable-length instruction set. A stated aim for Thumb-2 was to achieve code density similar to Thumb with performance similar to the ARM instruction set on 32-bit memory.

Thumb-2 extends the Thumb instruction set with bit-field manipulation, table branches and conditional execution. At the same time, the ARM instruction set was extended to maintain equivalent functionality in both instruction sets. A new "Unified Assembly Language" (UAL) supports generation of either Thumb or ARM instructions from the same source code; versions of Thumb seen on ARMv7 processors are essentially as capable as ARM code (including the ability to write interrupt handlers). This requires a bit of care, and use of a new "IT" (if-then) instruction, which permits up to four successive instructions to execute based on a tested condition, or on its inverse. When compiling into ARM code, this is ignored, but when compiling into Thumb it generates an actual instruction. For example:

; if (r0 == r1)CMPr0,r1ITEEQ; ARM: no code ... Thumb: IT instruction; then r0 = r2;MOVEQr0,r2; ARM: conditional; Thumb: condition via ITE 'T' (then); else r0 = r3;MOVNEr0,r3; ARM: conditional; Thumb: condition via ITE 'E' (else); recall that the Thumb MOV instruction has no bits to encode "EQ" or "NE".

All ARMv7 chips support the Thumb instruction set. All chips in the Cortex-A series that support ARMv7, all Cortex-R series, and all ARM11 series support both "ARM instruction set state" and "Thumb instruction set state", while chips in theCortex-M series support only the Thumb instruction set.[127][128][129]

Thumb Execution Environment (ThumbEE)

[edit]

ThumbEE (erroneously calledThumb-2EE in some ARM documentation), which was marketed as Jazelle RCT[130] (Runtime Compilation Target), was announced in 2005 and deprecated in 2011. It first appeared in theCortex-A8 processor. ThumbEE is a fourth instruction set state, making small changes to the Thumb-2 extended instruction set. These changes make the instruction set particularly suited to code generated at runtime (e.g. byJIT compilation) in managedExecution Environments. ThumbEE is a target for languages such asJava,C#,Perl, andPython, and allowsJIT compilers to output smaller compiled code without reducing performance.[citation needed]

New features provided by ThumbEE include automatic null pointer checks on every load and store instruction, an instruction to perform an array bounds check, and special instructions that call a handler. In addition, because it utilises Thumb-2 technology, ThumbEE provides access to registers r8–r15 (where the Jazelle/DBX Java VM state is held).[131] Handlers are small sections of frequently called code, commonly used to implement high level languages, such as allocating memory for a new object. These changes come from repurposing a handful of opcodes, and knowing the core is in the new ThumbEE state.

On 23 November 2011, Arm deprecated any use of the ThumbEE instruction set,[132] and Armv8 removes support for ThumbEE.

Floating-point (VFP)

[edit]

VFP (Vector Floating Point) technology is afloating-point unit (FPU) coprocessor extension to the ARM architecture[133] (implemented differently in Armv8 – coprocessors not defined there). It provides low-costsingle-precision anddouble-precision floating-point computation fully compliant with theANSI/IEEE Std 754-1985 Standard for Binary Floating-Point Arithmetic. VFP provides floating-point computation suitable for a wide spectrum of applications such as PDAs, smartphones, voice compression and decompression, three-dimensional graphics and digital audio, printers, set-top boxes, and automotive applications. The VFP architecture was intended to support execution of short "vector mode" instructions but these operated on each vector element sequentially and thus did not offer the performance of truesingle instruction, multiple data (SIMD) vector parallelism. This vector mode was therefore removed shortly after its introduction,[134] to be replaced with the much more powerful Advanced SIMD, also namedNeon.

Some devices such as the ARM Cortex-A8 have a cut-downVFPLite module instead of a full VFP module, and require roughly ten times more clock cycles per float operation.[135] Pre-Armv8 architecture implemented floating-point/SIMD with the coprocessor interface. Other floating-point and/or SIMD units found in ARM-based processors using the coprocessor interface includeFPA, FPE,iwMMXt, some of which were implemented in software by trapping but could have been implemented in hardware. They provide some of the same functionality as VFP but are notopcode-compatible with it. FPA10 also providesextended precision, but implements correct rounding (required by IEEE 754) only in single precision.[136]

VFPv1
Obsolete
VFPv2
An optional extension to the ARM instruction set in the ARMv5TE, ARMv5TEJ and ARMv6 architectures. VFPv2 has 16 64-bit FPU registers.
VFPv3 or VFPv3-D32
Implemented on most Cortex-A8 and A9 ARMv7 processors. It is backward-compatible with VFPv2, except that it cannot trap floating-point exceptions. VFPv3 has 32 64-bit FPU registers as standard, adds VCVT instructions to convert between scalar, float and double, adds immediate mode to VMOV such that constants can be loaded into FPU registers.
VFPv3-D16
As above, but with only 16 64-bit FPU registers. Implemented on Cortex-R4 and R5 processors and theTegra 2 (Cortex-A9).
VFPv3-F16
Uncommon; it supportsIEEE754-2008 half-precision (16-bit) floating point as a storage format.
VFPv4 or VFPv4-D32
Implemented on Cortex-A12 and A15 ARMv7 processors, Cortex-A7 optionally has VFPv4-D32 in the case of an FPU with Neon.[137] VFPv4 has 32 64-bit FPU registers as standard, adds both half-precision support as a storage format andfused multiply-accumulate instructions to the features of VFPv3.
VFPv4-D16
As above, but it has only 16 64-bit FPU registers. Implemented on Cortex-A5 and A7 processors in the case of an FPU without Neon.[137]
VFPv5-D16-M
Implemented on Cortex-M7 when single and double-precision floating-point core option exists.

InDebianLinux and derivatives such asUbuntu andLinux Mint,armhf (ARM hard float) refers to the ARMv7 architecture including the additional VFP3-D16 floating-point hardware extension (and Thumb-2) above. Software packages and cross-compiler tools use the armhf vs. arm/armel suffixes to differentiate.[138]

Advanced SIMD (Neon)

[edit]

TheAdvanced SIMD extension (also known asNeon or "MPE" Media Processing Engine) is a combined 64- and128-bit SIMD instruction set that provides standardised acceleration for media and signal processing applications. Neon is included in all Cortex-A8 devices, but is optional in Cortex-A9 devices.[139] Neon can execute MP3 audio decoding on CPUs running at 10 MHz, and can run theGSMadaptive multi-rate (AMR) speech codec at 13 MHz. It features a comprehensive instruction set, separate register files, and independent execution hardware.[140] Neon supports 8-, 16-, 32-, and 64-bit integer and single-precision (32-bit) floating-point data and SIMD operations for handling audio and video processing as well as graphics and gaming processing. In Neon, the SIMD supports up to 16 operations at the same time. The Neon hardware shares the same floating-point registers as used in VFP. Devices such as the ARM Cortex-A8 and Cortex-A9 support 128-bit vectors, but will execute with 64 bits at a time,[135] whereas newer Cortex-A15 devices can execute 128 bits at a time.[141][142]

A quirk of Neon in Armv7 devices is that it flushes allsubnormal numbers to zero, and as a result theGCC compiler will not use it unless-funsafe-math-optimizations, which allows losing denormals, is turned on. "Enhanced" Neon defined since Armv8 does not have this quirk, but as ofGCC 8.2 the same flag is still required to enable Neon instructions.[143] On the other hand, GCC does consider Neon safe on AArch64 for Armv8.

ProjectNe10 is ARM's first open-source project (from its inception; while they acquired an older project, now namedMbed TLS). The Ne10 library is a set of common, useful functions written in both Neon and C (for compatibility). The library was created to allow developers to use Neon optimisations without learning Neon, but it also serves as a set of highly optimised Neon intrinsic and assembly code examples for common DSP, arithmetic, and image processing routines. The source code is available on GitHub.[144]

ARM Helium technology

[edit]

Helium is the M-Profile Vector Extension (MVE). It adds more than 150 scalar and vector instructions.[145]

Security extensions

[edit]

TrustZone (for Cortex-A profile)

[edit]

The Security Extensions, marketed as TrustZone Technology, is in ARMv6KZ and later application profile architectures. It provides a low-cost alternative to adding another dedicated security core to an SoC, by providing two virtual processors backed by hardware based access control. This lets the application core switch between two states, referred to asworlds (to reduce confusion with other names for capability domains), to prevent information leaking from the more trusted world to the less trusted world. This world switch is generally orthogonal to all other capabilities of the processor, thus each world can operate independently of the other while using the same core. Memory and peripherals are then made aware of the operating world of the core and may use this to provide access control to secrets and code on the device.[146]

Typically, a rich operating system is run in the less trusted world, with smaller security-specialised code in the more trusted world, aiming to reduce theattack surface. Typical applications includeDRM functionality for controlling the use of media on ARM-based devices,[147] and preventing any unapproved use of the device.

In practice, since the specific implementation details of proprietary TrustZone implementations have not been publicly disclosed for review, it is unclear what level of assurance is provided for a giventhreat model, but they are not immune from attack.[148][149]

Open Virtualization[150] is an open source implementation of the trusted world architecture for TrustZone.

AMD has licensed and incorporated TrustZone technology into itsSecure Processor Technology.[151] AMD'sAPUs include a Cortex-A5 processor for handling secure processing, which is enabled in some, but not all products.[152][153][154] In fact, the Cortex-A5 TrustZone core had been included in earlier AMD products, but was not enabled due to time constraints.[153]

Samsung Knox uses TrustZone for purposes such as detecting modifications to the kernel, storing certificates and attestating keys.[155]

TrustZone for Armv8-M (for Cortex-M profile)

[edit]

The Security Extension, marketed as TrustZone for Armv8-M Technology, was introduced in the Armv8-M architecture. While containing similar concepts to TrustZone for Armv8-A, it has a different architectural design, as world switching is performed using branch instructions instead of using exceptions. It also supports safe interleaved interrupt handling from either world regardless of the current security state. Together these features provide low latency calls to the secure world and responsive interrupt handling. ARM provides a reference stack of secure world code in the form of Trusted Firmware for M andPSA Certified.

No-execute page protection

[edit]

As of ARMv6, the ARM architecture supportsno-execute page protection, which is referred to asXN, foreXecute Never.[156]

Large Physical Address Extension (LPAE)

[edit]

The Large Physical Address Extension (LPAE), which extends the physical address size from 32 bits to 40 bits, was added to the Armv7-A architecture in 2011.[157]

The physical address size may be even larger in processors based on the 64-bit (Armv8-A) architecture. For example, it is 44 bits in Cortex-A75 and Cortex-A65AE.[158]

Armv8-R and Armv8-M

[edit]

TheArmv8-R andArmv8-M architectures, announced after the Armv8-A architecture, share some features with Armv8-A. However, Armv8-M does not include any 64-bit AArch64 instructions, and Armv8-R originally did not include any AArch64 instructions; those instructions were added toArmv8-R later.

Armv8.1-M

[edit]

The Armv8.1-M architecture, announced in February 2019, is an enhancement of the Armv8-M architecture. It brings new features including:

  • A new vector instruction set extension. The M-Profile Vector Extension (MVE), or Helium, is for signal processing and machine learning applications.
  • Additional instruction set enhancements for loops and branches (Low Overhead Branch Extension).
  • Instructions forhalf-precision floating-point support.
  • Instruction set enhancement for TrustZone management for Floating Point Unit (FPU).
  • New memory attribute in the Memory Protection Unit (MPU).
  • Enhancements in debug including Performance Monitoring Unit (PMU), Unprivileged Debug Extension, and additional debug support focus on signal processing application developments.
  • Reliability, Availability and Serviceability (RAS) extension.

64/32-bit architecture

[edit]
Main article:AArch64
Armv8-A Platform with Cortex A57/A53 MPCore big.LITTLE CPU chip

Armv8

[edit]

Armv8-A

[edit]
See also:Comparison of Armv8-A processors

Announced in October 2011,[13]Armv8-A (often called ARMv8 while the Armv8-R is also available) represents a fundamental change to the ARM architecture. It supports twoExecution states: a 64-bit state namedAArch64 and a 32-bit state namedAArch32. In the AArch64 state, a new 64-bitA64 instruction set is supported; in the AArch32 state, two instruction sets are supported: the original 32-bit instruction set, namedA32, and the 32-bit Thumb-2 instruction set, namedT32. AArch32 providesuser-space compatibility with Armv7-A. The processor state can change on an Exception level change; this allows 32-bit applications to be executed in AArch32 state under a 64-bit OS whose kernel executes in AArch64 state, and allows a 32-bit OS to run in AArch32 state under the control of a 64-bithypervisor running in AArch64 state.[1] ARM announced their Cortex-A53 and Cortex-A57 cores on 30 October 2012.[73] Apple was the first to release an Armv8-A compatible core in a consumer product (Apple A7 iniPhone 5S).AppliedMicro, using anFPGA, was the first to demo Armv8-A.[159] The first Armv8-ASoC fromSamsung is the Exynos 5433 used in theGalaxy Note 4, which features two clusters of four Cortex-A57 and Cortex-A53 cores in abig.LITTLE configuration; but it will run only in AArch32 mode.[160]

To both AArch32 and AArch64, Armv8-A makes VFPv3/v4 and advanced SIMD (Neon) standard. It also adds cryptography instructions supportingAES,SHA-1/SHA-256 andfinite field arithmetic.[161] AArch64 was introduced in Armv8-A and its subsequent revision. AArch64 is not included in the 32-bit Armv8-R and Armv8-M architectures.

An ARMv8-A processor can support one or both of AArch32 and AArch64; it may support AArch32 and AArch64 at lower Exception levels and only AArch64 at higher Exception levels.[162] For example, the ARM Cortex-A32 supports only AArch32,[163] theARM Cortex-A34 supports only AArch64,[164] and theARM Cortex-A72 supports both AArch64 and AArch32.[165] An ARMv9-A processor must support AArch64 at all Exception levels, and may support AArch32 at EL0.[162]

Armv8-R

[edit]

Optional AArch64 support was added to the Armv8-R profile, with the first ARM core implementing it being the Cortex-R82.[166] It adds the A64 instruction set.

Armv9

[edit]

Armv9-A

[edit]

Announced in March 2021, the updated architecture places a focus on secure execution andcompartmentalisation.[167][168]

Arm SystemReady

[edit]

Arm SystemReady is a compliance program that helps ensure the interoperability of an operating system on Arm-based hardware from datacenter servers to industrial edge and IoT devices. The key building blocks of the program are the specifications for minimum hardware and firmware requirements that the operating systems and hypervisors can rely upon. These specifications are:[169]

  • Base System Architecture (BSA)[170] and the market segment specific supplements (e.g., Server BSA supplement)[171]
  • Base Boot Requirements (BBR)[172] and Base Boot Security Requirements (BBSR)[173]

These specifications are co-developed byArm and its partners in the System Architecture Advisory Committee (SystemArchAC).

Architecture Compliance Suite (ACS) is the test tools that help to check the compliance of these specifications. The Arm SystemReady Requirements Specification documents the requirements of the certifications.[174]

This program was introduced byArm in 2020 at the firstDevSummit event. Its predecessor Arm ServerReady was introduced in 2018 at the Arm TechCon event. This program currently includes two bands:

  • SystemReady Band: this band focuses on operating system interoperability for Advanced Configuration and Power InterfaceACPI environments, where generic operating systems can be installed on either new or old hardware without modification. This band is relevant for systems usingWindows,Linux,VMware, andBSD environments.[175]
  • SystemReady Devicetree Band: this band optimizes install and boot for embedded systems wheredevicetree is the preferred method of describing hardware, with a focus on forward compatibility. This applies toLinux distributions andBSD environments specifically.[176]

PSA Certified

[edit]

PSA Certified, formerly named Platform Security Architecture, is an architecture-agnostic security framework and evaluation scheme. It is intended to help secureInternet of things (IoT) devices built on system-on-a-chip (SoC) processors.[177] It was introduced to increase security where a fulltrusted execution environment is too large or complex.[178]

The architecture was introduced byArm in 2017 at the annualTechCon event.[178][179] Although the scheme is architecture agnostic, it was first implemented on Arm Cortex-M processor cores intended for microcontroller use. PSA Certified includes freely available threat models and security analyses that demonstrate the process for deciding on security features in common IoT products.[180] It also provides freely downloadable application programming interface (API) packages, architectural specifications, open-source firmware implementations, and related test suites.[181]

Following the development of the architecture security framework in 2017, thePSA Certified assurance scheme launched two years later at Embedded World in 2019.[182] PSA Certified offers a multi-level security evaluation scheme for chip vendors, OS providers and IoT device makers.[183] The Embedded World presentation introduced chip vendors to Level 1 Certification. A draft of Level 2 protection was presented at the same time.[184] Level 2 certification became a usable standard in February 2020.[185]

The certification was created by PSA Joint Stakeholders to enable a security-by-design approach for a diverse set of IoT products. PSA Certified specifications are implementation and architecture agnostic, as a result they can be applied to any chip, software or device.[186][184] The certification also removes industry fragmentation forIoT product manufacturers and developers.[187]

Operating system support

[edit]

32-bit operating systems

[edit]

Historical operating systems

[edit]

The first 32-bit ARM-based personal computer, theAcorn Archimedes, was originally intended to run an ambitious operating system calledARX. The machines shipped withRISC OS, which was also used on later ARM-based systems from Acorn and other vendors. Some early Acorn machines were also able to run aUnix port calledRISC iX. (Neither is to be confused withRISC/os, a contemporary Unix variant for the MIPS architecture.)

Embedded operating systems

[edit]

The 32-bit ARM architecture is supported by a large number ofembedded andreal-time operating systems, including:

Mobile device operating systems

[edit]

As of March 2024, the 32-bit ARM architecture used to be the primary hardware environment for most mobile device operating systems such as the following but many of these platforms such as Android and Apple iOS have evolved to the 64-bit ARM architecture:

Formerly, but now discontinued:

Desktop and server operating systems

[edit]

The 32-bit ARM architecture is supported by RISC OS and by multipleUnix-like operating systems including:

64-bit operating systems

[edit]

Embedded operating systems

[edit]

Mobile device operating systems

[edit]

Desktop and server operating systems

[edit]

Porting to 32- or 64-bit ARM operating systems

[edit]

Windows applications recompiled for ARM and linked with Winelib, from theWine project, can run on 32-bit or 64-bit ARM in Linux, FreeBSD, or other compatible operating systems.[216][217] x86 binaries, e.g. when not specially compiled for ARM, have been demonstrated on ARM usingQEMU with Wine (on Linux and more),[citation needed] but do not work at full speed or same capability as with Winelib.

Notes

[edit]
  1. ^Using 32-bit words, 4 Mbit/s corresponds to 1MIPS.
  2. ^Available references do not mention which design team this was, but given the timing and known history of designs of the era, it is likely this was the National Semiconductor team whose NS32016 suffered from a large number of bugs.
  3. ^Matt Evans notes that it appears the faster versions were simply binned higher, and appear to have no underlying changes.[35]

See also

[edit]

References

[edit]

Citations

[edit]
  1. ^abcdefGrisenthwaite, Richard (2011)."ARMv8-A Technology Preview"(PDF). Archived fromthe original(PDF) on 11 November 2011. Retrieved31 October 2011.
  2. ^"6.1.2.1 VFP register usage conventions".Procedure Call Standard for the ARM Architecture.Arm Holdings. 6 October 2023. Retrieved22 August 2024.
  3. ^abWilson, Roger (2 November 1988)."Some facts about the Acorn RISC Machine".Newsgroupcomp.arch. Retrieved25 May 2007.
  4. ^abHachman, Mark (14 October 2002)."ARM Cores Climb into 3G Territory".ExtremeTech. Retrieved24 May 2018.
  5. ^Turley, Jim (18 December 2002)."The Two Percent Solution".Embedded. Retrieved14 February 2023.
  6. ^Cutress, Ian (22 June 2020)."New #1 Supercomputer: Fujitsu's Fugaku and A64FX take Arm to the Top with 415 PetaFLOPs".anandtech.com. Retrieved25 January 2021.
  7. ^"Arm Partners Have Shipped 200 Billion Chips".Arm (Press release). Retrieved3 November 2021.
  8. ^"Enabling Mass IoT connectivity as ARM partners ship 100 billion chips".community.arm.com. 27 February 2017. Retrieved8 April 2020.the cumulative deployment of 100 billion chips, half of which shipped in the last four years. [..] why not a trillion or more? That is our target, seeing a trillion connected devices deployed over the next two decades.
  9. ^"MCU Market on Migration Path to 32-bit and ARM-based Devices: 32-bit tops in sales; 16-bit leads in unit shipments". IC Insights. 25 April 2013. Retrieved1 July 2014.
  10. ^Turley, Jim (2002)."The Two Percent Solution". embedded.com. Archived fromthe original on 15 February 2023.
  11. ^"Arm Holdings eager for PC and server expansion".The Register. 1 February 2011.
  12. ^McGuire-Balanza, Kerry (11 May 2010)."ARM from zero to billions in 25 short years".Arm Holdings. Retrieved8 November 2012.
  13. ^ab"ARM Discloses Technical Details of the Next Version of the ARM Architecture" (Press release).Arm Holdings. 27 October 2011. Archived fromthe original on 1 January 2019. Retrieved20 September 2013.
  14. ^"Announcing the ARM Neoverse N1 Platform".community.arm.com. 20 February 2019. Retrieved8 April 2020.
  15. ^Fairbairn, Douglas (31 January 2012)."Oral History of Sophie Wilson"(PDF).Archived(PDF) from the original on 3 March 2016. Retrieved2 February 2016.
  16. ^Smith, Tony (30 November 2011)."The BBC Micro turns 30".The Register Hardware.Archived from the original on 12 December 2011. Retrieved12 December 2011.
  17. ^Polsson, Ken."Chronology of Microprocessors".Processortimeline.info.Archived from the original on 9 August 2018. Retrieved27 September 2013.
  18. ^Leedy, Glenn (April 1983)."The National Semiconductor NS16000 Microprocessor Family".Byte. pp. 53–66. Retrieved22 August 2020.
  19. ^Evans 2019, 6:00.
  20. ^Manners, David (29 April 1998)."ARM's way".Electronics Weekly. Archived fromthe original on 29 July 2012. Retrieved26 October 2012.
  21. ^Evans 2019, 5:30.
  22. ^abEvans 2019, 7:45.
  23. ^Evans 2019, 8:30.
  24. ^Sophie Wilson at Alt Party 2009 (Part 3/8).Archived from the original on 11 December 2021.
  25. ^Chisnall, David (23 August 2010).Understanding ARM Architectures. Retrieved26 May 2013.
  26. ^Evans 2019, 9:00.
  27. ^Furber, Stephen B. (2000).ARM system-on-chip architecture. Boston: Addison-Wesley.ISBN 0-201-67519-6.
  28. ^Evans 2019, 9:50.
  29. ^Evans 2019, 23:30.
  30. ^Evans 2019, 26:00.
  31. ^"ARM Instruction Set design history with Sophie Wilson (Part 3)". 10 May 2015.Archived from the original on 11 December 2021. Retrieved25 May 2020 – via YouTube.
  32. ^"Oral History of Sophie Wilson – 2012 Computer History Museum Fellow"(PDF).Computer History Museum. 31 January 2012. Retrieved25 May 2020.
  33. ^Harker, T. (Summer 2009)."ARM gets serious about IP (Second in a two-part series [Associated Editors' View]".IEEE Solid-State Circuits Magazine.1 (3):8–69.doi:10.1109/MSSC.2009.933674.ISSN 1943-0590.S2CID 36567166.
  34. ^Evans 2019, 20:30.
  35. ^Evans 2019, 22:00.
  36. ^Evans 2019, 21:30.
  37. ^Evans 2019, 22:0030.
  38. ^abEvans 2019, 14:00.
  39. ^"From one Arm to the next! ARM Processors and Architectures". Retrieved31 May 2022.
  40. ^Levy, Markus."The History of The ARM Architecture: From Inception to IPO"(PDF). Retrieved18 July 2022.
  41. ^Introducing the Commodore Amiga 3000(PDF). Commodore-Amiga, Inc. 1991.
  42. ^"Computer MIPS and MFLOPS Speed Claims 1980 to 1996".www.roylongbottom.org.uk. Retrieved17 June 2023.
  43. ^Santanu Chattopadhyay (2010).Embedded System Design. PHI Learning Pvt. Ltd. p. 9.ISBN 978-81-203-4024-4.
  44. ^Richard Murray."32 bit operation".
  45. ^"ARM Company Milestones".ARM. Archived fromthe original on 20 April 2015. Retrieved8 April 2015.
  46. ^Andrews, Jason (2005). "3 SoC Verification Topics for the ARM Architecture".Co-verification of hardware and software for ARM SoC design. Oxford, UK:Elsevier. pp. 69.ISBN 0-7506-7730-9.ARM started as a branch of Acorn Computer in Cambridge, England, with the formation of a joint venture between Acorn, Apple and VLSI Technology. A team of twelve employees produced the design of the first ARM microprocessor between 1983 and 1985.
  47. ^Weber, Jonathan (28 November 1990)."Apple to Join Acorn, VLSI in Chip-Making Venture".Los Angeles Times. Los Angeles. Retrieved6 February 2012.Apple has invested about $3 million (roughly 1.5 million pounds) for a 30% interest in the company, dubbed Advanced Risc Machines Ltd. (ARM) [...]
  48. ^"ARM Corporate Backgrounder"(PDF).ARM. Archived fromthe original(PDF) on 4 October 2006.
  49. ^Montanaro, James; et al. (1997)."A 160-MHz, 32-b, 0.5-W CMOS RISC Microprocessor"(PDF).Digital Technical Journal.9 (1):49–62.
  50. ^DeMone, Paul (9 November 2000)."ARM's Race to Embedded World Domination".Real World Technologies. Retrieved6 October 2015.
  51. ^"March of the Machines".technologyreview.com.MIT Technology Review. 20 April 2010. Archived fromthe original on 16 October 2015. Retrieved6 October 2015.
  52. ^Krazit, Tom (3 April 2006)."ARMed for the living room". CNET.
  53. ^abFitzpatrick, J. (2011)."An Interview with Steve Furber".Communications of the ACM.54 (5):34–39.doi:10.1145/1941487.1941501.
  54. ^Tracy Robinson (12 February 2014)."Celebrating 50 Billion shipped ARM-powered Chips".
  55. ^Sarah Murry (3 March 2014)."ARM's Reach: 50 Billion Chip Milestone". Archived fromthe original on 16 September 2015.
  56. ^Brown, Eric (2009)."ARM netbook ships with detachable tablet". Archived fromthe original on 3 January 2013. Retrieved19 August 2009.
  57. ^Peter Clarke (7 January 2016)."Amazon Now Sells Own ARM chips".
  58. ^"MACOM Successfully Completes Acquisition of AppliedMicro" (Press release). 26 January 2017.
  59. ^Frumusanu, Andrei."ARM Details Built on ARM Cortex Technology License".AnandTech. Retrieved26 May 2019.
  60. ^Cutress, Ian."ARM Flexible Access: Design the SoC Before Spending Money".AnandTech. Retrieved9 October 2019.
  61. ^"ARM Flexible Access Frequently Asked Questions".ARM. Retrieved9 October 2019.
  62. ^Nolting, Stephan."STORM CORE Processor System"(PDF).OpenCores. Retrieved1 April 2014.
  63. ^ZAP onGitHub
  64. ^"Cortex-M23 Processor". ARM. Retrieved27 October 2016.
  65. ^"Cortex-M33 Processor". ARM. Retrieved27 October 2016.
  66. ^"ARMv8-M Architecture Simplifies Security for Smart Embedded". ARM. Retrieved10 November 2015.
  67. ^Ltd, Arm."M-Profile Architectures".Arm | The Architecture for the Digital World. Retrieved29 August 2023.
  68. ^"ARMv8-R Architecture". Retrieved10 July 2015.
  69. ^Craske, Simon (October 2013)."ARM Cortex-R Architecture"(PDF). Arm Holdings. Archived fromthe original(PDF) on 6 April 2014. Retrieved1 February 2014.
  70. ^Smith, Ryan (20 September 2016)."ARM Announces Cortex-R52 CPU: Deterministic & Safe, for ADAS & More".AnandTech. Retrieved20 September 2016.
  71. ^"Cortex-A32 Processor". ARM. Retrieved10 October 2019.
  72. ^"Cortex-A35 Processor". ARM. Retrieved10 November 2015.
  73. ^ab"ARM Launches Cortex-A50 Series, the World's Most Energy-Efficient 64-bit Processors" (Press release).Arm Holdings. Retrieved31 October 2012.
  74. ^"Cortex-A72 Processor". ARM. Retrieved10 July 2015.
  75. ^"Cortex-A73 Processor". ARM. Retrieved2 June 2016.
  76. ^"ARMv8-A Architecture". Retrieved10 July 2015.
  77. ^"Cavium Thunder X ups the ARM core count to 48 on a single chip".SemiAccurate. 3 June 2014.
  78. ^"Cavium at Supercomputing 2014".Yahoo Finance. 17 November 2014. Archived fromthe original on 16 October 2015. Retrieved15 January 2017.
  79. ^Burt, Jeff (17 November 2014)."Cray to Evaluate ARM Chips in Its Supercomputers".eWeek.
  80. ^"Samsung Announces Exynos 8890 with Cat.12/13 Modem and Custom CPU".AnandTech.
  81. ^"Cortex-A34 Processor". ARM. Retrieved10 October 2019.
  82. ^"D21500 [AARCH64] Add support for Broadcom Vulcan".reviews.llvm.org.
  83. ^"Cortex-A55 Processor". ARM. Retrieved29 May 2017.
  84. ^"Cortex-A75 Processor". ARM. Retrieved29 May 2017.
  85. ^"Cortex-A76 Processor". ARM. Retrieved11 October 2018.
  86. ^Berenice Mann (April 2017)."ARM Architecture – ARMv8.2-A evolution and delivery".community.ARM.com.
  87. ^Frumusanu, Andrei."Samsung Announces the Exynos 9825 SoC: First 7nm EUV Silicon Chip".AnandTech. Retrieved11 October 2019.
  88. ^"Fujitsu began to produce Japan's billions of super-calculations with the strongest ARM processor A64FX".China IT News. Archived fromthe original on 20 June 2019. Retrieved17 August 2019.ARMv8 SVE (Scalable Vector Extension) chip, which uses 512bit floating point.
  89. ^"Cortex-A65AE – ARM".ARM. Retrieved8 April 2020.can execute two-threads in parallel on each cycle. Each thread can be at different exception levels and run different operating systems.
  90. ^Frumusanu, Andrei."Marvell Announces ThunderX3: 96 Cores & 384 Thread 3rd Gen ARM Server Processor".AnandTech. Retrieved26 May 2020.
  91. ^"AArch64: add support for newer Apple CPUs · apple/llvm-project@677da09".GitHub. Retrieved23 September 2022.
  92. ^"New features for the Armv8-A architecture - Architectures and Processors blog - Arm Community blogs - Arm Community".community.arm.com. 21 September 2020. Retrieved28 December 2021.
  93. ^"Arm's solution to the future needs of AI, security and specialized computing is v9".Arm. Retrieved16 August 2021.
  94. ^"First Armv9 Cortex CPUs for Consumer Compute".community.arm.com. 25 May 2021. Retrieved16 August 2021.
  95. ^"Documentation – Arm Developer".developer.arm.com. Retrieved3 October 2024.
  96. ^"Documentation – Arm Developer".developer.arm.com. Retrieved3 October 2024.
  97. ^"Apple M4 Support Added To The LLVM Compiler, Confirming Its ISA Capabilities".www.phoronix.com. Retrieved15 June 2024.
  98. ^"Arm A-Profile Architecture Developments 2021 - Architectures and Processors blog - Arm Community blogs - Arm Community".community.arm.com. 8 September 2021. Retrieved25 September 2023.
  99. ^"Arm A-Profile Architecture Developments 2022 - Architectures and Processors blog - Arm Community blogs - Arm Community".community.arm.com. 29 September 2022. Retrieved25 September 2023.
  100. ^"Arm A-Profile Architecture Developments 2023 - Architectures and Processors blog - Arm Community blogs - Arm Community".community.arm.com. 29 September 2022. Retrieved11 October 2024.
  101. ^"Arm A-Profile Architecture Developments 2024 - Architectures and Processors blog - Arm Community blogs - Arm Community".community.arm.com. 29 September 2022. Retrieved11 October 2024.
  102. ^"Line Card"(PDF). 2003. Retrieved1 October 2012.
  103. ^Parrish, Kevin (14 July 2011)."One Million ARM Cores Linked to Simulate Brain".EE Times. Retrieved2 August 2011.
  104. ^"Processor mode".Arm Holdings. Retrieved26 March 2013.
  105. ^"KVM/ARM"(PDF). Retrieved14 February 2023.
  106. ^Brash, David (August 2010).Extensions to the ARMv7-A Architecture. 2010 IEEE Hot Chips 22 Symposium (HCS). pp. 1–21.doi:10.1109/HOTCHIPS.2010.7480070.ISBN 978-1-4673-8875-7.S2CID 46339775.
  107. ^"How does the ARM Compiler support unaligned accesses?". 2011. Retrieved5 October 2013.
  108. ^"Unaligned data access". Retrieved5 October 2013.
  109. ^"Cortex-M0 r0p0 Technical Reference Manual"(PDF).Arm.
  110. ^"ARMv7-M Architecture Reference Manual". Arm. Retrieved18 July 2022.
  111. ^ab"ARMv7-A and ARMv7-R Architecture Reference Manual; Arm Holdings". arm.com. Retrieved19 January 2013.
  112. ^"ARM Information Center". Retrieved10 July 2015.
  113. ^"Condition Codes 1: Condition flags and codes".ARM Community. 11 September 2013. Retrieved26 September 2019.
  114. ^"9.1.2. Instruction cycle counts".
  115. ^"CoreSight Components: About the Debug Access Port".
  116. ^"The Cortex-M3: Debug Access Port (DAP)".
  117. ^Anderson, Mike."Understanding ARM HW Debug Options"(PDF).
  118. ^"CMSIS-DAP Debugger User's Guide".
  119. ^"CMSIS-DAP".
  120. ^"ARM DSP Instruction Set Extensions".arm.com.Archived from the original on 14 April 2009. Retrieved18 April 2009.
  121. ^abClarke, Peter (3 May 1999)."EPF: ARC, ARM add DSP extensions to their RISC cores".EE Times. Retrieved15 March 2024.
  122. ^Turley, Jim (18 November 1996)."ARM Tunes Piccolo for DSP Performance"(PDF).Microprocessor Report. Retrieved15 March 2024.
  123. ^"DSP & SIMD". Retrieved10 July 2015.
  124. ^"ARM7TDMI Technical Reference Manual"(PDF). p. ii.
  125. ^Jaggar, Dave (1996).ARM Architecture Reference Manual. Prentice Hall. pp. 6–1.ISBN 978-0-13-736299-8.
  126. ^Nathan Willis (10 June 2015)."Resurrecting the SuperH architecture".LWN.net.
  127. ^"ARM Processor Instruction Set Architecture". ARM.com.Archived from the original on 15 April 2009. Retrieved18 April 2009.
  128. ^"ARM aims son of Thumb at uCs, ASSPs, SoCs". Linuxdevices.com. Archived fromthe original on 9 December 2012. Retrieved18 April 2009.
  129. ^"ARM Information Center". Infocenter.arm.com. Retrieved18 April 2009.
  130. ^"Jazelle". ARM Ltd. Archived fromthe original on 2 June 2017.
  131. ^Tom R. Halfhill (2005)."ARM strengthens Java compilers: New 16-Bit Thumb-2EE Instructions Conserve System Memory"(PDF). Archived fromthe original(PDF) on 5 October 2007.
  132. ^ARM Architecture Reference Manual, Armv7-A and Armv7-R edition, issue C.b, Section A2.10, 25 July 2012.
  133. ^"ARM Compiler toolchain Using the Assembler – VFP coprocessor". ARM.com. Retrieved20 August 2014.
  134. ^"VFP directives and vector notation". ARM.com. Retrieved21 November 2011.
  135. ^ab"Differences between ARM Cortex-A8 and Cortex-A9". Shervin Emami. Retrieved21 November 2011.
  136. ^"FPA10 Data Sheet"(PDF).chrisacorns.computinghistory.org.uk. GEC Plessey Semiconductors. 11 June 1993. Retrieved26 November 2020.In relation to IEEE 754-1985, the FPA achieves conformance in single-precision arithmetic [...] Occasionally, double- and extended-precision multiplications may be produced with an error of 1 or 2 units in the least significant place of the mantissa.
  137. ^ab"Cortex-A7 MPCore Technical Reference Manual – 1.3 Features". ARM. Retrieved11 July 2014.
  138. ^"ArmHardFloatPort – Debian Wiki". Wiki.debian.org. 20 August 2012. Retrieved8 January 2014.
  139. ^"Cortex-A9 Processor".arm.com. Retrieved21 November 2011.
  140. ^"About the Cortex-A9 NEON MPE".arm.com. Retrieved21 November 2011.
  141. ^"US20050125476A1".
  142. ^"US20080141004A1".
  143. ^"ARM Options".GNU Compiler Collection Manual. Retrieved20 September 2019.
  144. ^Ne10: An open optimized software library project for the ARM Architecture onGitHub
  145. ^Joseph Yiu."Introduction to ARMv8.1-M architecture"(PDF). Retrieved18 July 2022.
  146. ^"Genode – An Exploration of ARM TrustZone Technology". Retrieved10 July 2015.
  147. ^"ARM Announces Availability of Mobile Consumer DRM Software Solutions Based on ARM TrustZone Technology" (Press release). News.thomasnet.com. Retrieved18 April 2009.
  148. ^Laginimaineb (8 October 2015)."Bits, Please!: Full TrustZone exploit for MSM8974".Bits, Please!. Retrieved3 May 2016.
  149. ^Di Shen."Attacking your 'Trusted Core' Exploiting TrustZone on Android"(PDF).Black Hat Briefings. Retrieved3 May 2016.
  150. ^"ARM TrustZone and ARM Hypervisor Open Source Software". Open Virtualization. Archived fromthe original on 14 June 2013. Retrieved14 June 2013.
  151. ^"AMD Secure Technology".AMD. Archived fromthe original on 23 July 2016. Retrieved6 July 2016.
  152. ^Smith, Ryan (13 June 2012)."AMD 2013 APUs to include ARM Cortex A5 Processor for Trustzone Capabilities".AnandTech. Retrieved6 July 2016.
  153. ^abShimpi, Anand Lal (29 April 2014)."AMD Beema Mullins Architecture A10 micro 6700T Performance Preview".AnandTech. Retrieved6 July 2016.
  154. ^Walton, Jarred (4 June 2014)."AMD Launches Mobile Kaveri APUs".AnandTech. Retrieved6 July 2016.
  155. ^"Root of Trust" (white paper).Samsung Electronics. April 2016.
  156. ^"ARM Architecture Reference Manual"(PDF). p. B4-8. Archived fromthe original(PDF) on 6 February 2009.APX and XN (execute never) bits have been added in VMSAv6 [Virtual Memory System Architecture]
  157. ^ARM Architecture Reference Manual, ARMv7-A and ARMv7-R edition. ARM Limited.
  158. ^"Cortex-A65AE".ARM Developer. Retrieved26 April 2019.
  159. ^"AppliedMicro Showcases World's First 64-bit ARM v8 Core" (Press release). AppliedMicro. 28 October 2011. Retrieved11 February 2014.
  160. ^"Samsung's Exynos 5433 is an A57/A53 ARM SoC".AnandTech. Retrieved17 September 2014.
  161. ^"ARM Cortex-A53 MPCore Processor Technical Reference Manual: Cryptography Extension". ARM. Retrieved11 September 2016.
  162. ^ab"Impact of implemented Exception levels".Learn the architecture - AArch64 Exception Model. Arm.
  163. ^"Cortex-A32".Arm Developer.
  164. ^"Cortex-A34".Arm Developer.
  165. ^"Cortex-A72".Arm Developer.
  166. ^Frumusanu, Andrei (3 September 2020)."ARM Announced Cortex-R82: First 64-bit Real Time Processor".AnandTech.
  167. ^Frumusanu, Andrei (30 March 2021)."Arm Announces Armv9 Architecture: SVE2, Security, and the Next Decade".AnandTech.
  168. ^Harrod, Alex (30 March 2021)."Arm's Solution to the Future Needs of AI, Security and Specialized Computing is v9" (Press release).Arm Holdings.
  169. ^"SystemReady Compliance Program".Arm.
  170. ^"Arm Base System Architecture".Arm.
  171. ^"Arm Server Base System Architecture".Arm.
  172. ^"Arm Base Boot Requirements".Arm.
  173. ^"Base Boot Security Requirements".Arm.
  174. ^"Arm SystemReady Requirements Specification".Arm.
  175. ^"Arm SystemReady Band".Arm.
  176. ^"Arm SystemReady Devicetree Band".Arm.
  177. ^Osborne, Charlie."ARM announces PSA security architecture for IoT devices". ZDNet.
  178. ^abWong, William (25 October 2017)."ARM's Platform Security Architecture Targets Cortex-M".Electronic Design.Archived from the original on 8 May 2019.
  179. ^Hoffenberg, Steve (31 October 2017)."ARM: Security Isn't Just a Technological Imperative, It's a Social Responsibility".VDC Research.Archived from the original on 28 September 2023.
  180. ^Armasu, Lucian (22 February 2018)."ARM Reveals More Details About Its IoT Platform Security Architecture".Tom's Hardware.
  181. ^Williams, Chris."ARM PSA IoT API? BRB... Toolbox of tech to secure net-connected kit opens up some more".The Register.
  182. ^Hayes, Caroline (25 February 2019)."Embedded World: Arm introduces fourth security element to PSA".Electronics Weekly.
  183. ^"PSA Certified: building trust in IoT".PSA Certified.
  184. ^ab"PSA Certified–building trust, building value".EE Times. 4 March 2019.
  185. ^"The $6trn importance of security standards and regulation in the IoT era". IoT Now. 16 March 2020.
  186. ^McGregor, Jim (4 March 2019)."Arm Introduces Security Certification Testing For IoT".Forbes.
  187. ^Speed, Richard (26 February 2019)."Azure IoT heads spaceward to maintain connectivity at the edge, courtesy of Inmarsat".TheRegister.
  188. ^"OS-9 Specifications".Microware.
  189. ^ab"Pharos".SourceForge. Retrieved24 May 2018.
  190. ^"PikeOS Safe and Secure Virtualization". Retrieved10 July 2013.
  191. ^ab"Safety Certified Real-Time Operating Systems – Supported CPUs".
  192. ^"ARM Platform Port". opensolaris.org. Archived fromthe original on 2 December 2012. Retrieved29 December 2012.
  193. ^"Green Hills Software's INTEGRITY-based Multivisor Delivers Embedded Industry's First 64-bit Secure Virtualization Solution".ghs.com. Retrieved14 March 2018.
  194. ^"Enea OSE real-time operating system for 5G and LTE-A | Enea".enea.com. Archived fromthe original on 1 January 2019. Retrieved17 April 2018.
  195. ^"Supported Platforms".docs.sel4.systems. Retrieved23 November 2018.
  196. ^"QNX Software Development Platform (SDP 7.0) | BlackBerry QNX".blackberry.qnx.com. Retrieved27 July 2020.
  197. ^"Wind River Releases 64-Bit VxWorks RTOS" (Press release).Wind River Systems. 28 February 2011. Retrieved24 October 2023.
  198. ^"Manjaro-ARM".Manjaro wiki. 20 June 2022.
  199. ^Linus Torvalds (1 October 2012)."Re: [GIT PULL] arm64: Linux kernel port".Linux kernel mailing list (Mailing list). Retrieved2 May 2019.
  200. ^Larabel, Michael (27 February 2013)."64-bit ARM Version of Ubuntu/Debian Is Booting".Phoronix. Retrieved17 August 2014.
  201. ^"Debian Project News – August 14th, 2014".Debian. 14 August 2014. Retrieved17 August 2014.
  202. ^"Ubuntu Server for ARM".ubuntu.com.
  203. ^"Architectures/AArch64". Retrieved16 January 2015.
  204. ^"Portal:ARM/AArch64". Retrieved16 January 2015.
  205. ^"SUSE Linux Enterprise 12 SP2 Release Notes". Retrieved11 November 2016.
  206. ^"Red Hat introduces ARM server support for Red Hat Enterprise Linux".redhat.com. Retrieved18 January 2019.
  207. ^"64-bit ARM architecture project update". The FreeBSD Foundation. 24 November 2014.
  208. ^"OpenBSD/arm64". Retrieved25 September 2023.
  209. ^"NetBSD/arm64". Retrieved5 August 2018.
  210. ^"HP, Asus announce first Windows 10 ARM PCs: 20-hour battery life, gigabit LTE".Ars Technica. Retrieved22 January 2018.This new version of Windows 10 is Microsoft's first 64-bit ARM operating system. It'll run x86 and 32-bit ARM applications from the Store, and in due course, 64-bit ARM applications. However, Microsoft hasn't yet finalised its 64-bit ARM SDK. Many pieces are in place (there's a 64-bit ARM compiler, for example), but the company isn't yet taking 64-bit ARM applications submitted to the Store, and there aren't any 64-bit ARM desktop applications either.
  211. ^Hassan, Mehedi (10 December 2016)."Windows 10 on ARM64 gets its first compiled apps".MSPoweruser.
  212. ^Filippidis, Katrina (1 June 2018)."VLC becomes one of first ARM64 Windows apps".Engadget.
  213. ^Sweetgall, Marc (15 November 2018)."Official support for Windows 10 on ARM development". Windows Developer.Windows Blogs.Microsoft. Retrieved17 December 2019.
  214. ^Gartenberg, Chaim (12 November 2020)."macOS Big Sur is now available to download".The Verge. Retrieved13 November 2020.
  215. ^Clover, Juli (23 June 2020)."Rosetta Won't Support x86 Virtualization Apps Running Windows".MacRumors. Retrieved13 November 2020.
  216. ^"ARM – The Official Wine Wiki". Retrieved10 July 2015.
  217. ^"ARM64 – The Official Wine Wiki". Retrieved10 July 2015.
  218. ^"ARM Security Updates".ARM Developer. Retrieved24 May 2018.

Bibliography

[edit]

Further reading

[edit]
See also:List of books about ARM Cortex-M

External links

[edit]
Wikimedia Commons has media related toARM architecture.

Architecture manuals

[edit]

Quick-reference cards

[edit]

Instructions

[edit]

Opcodes

[edit]
Application ARM-based chips
Application
processors
(32-bit)
ARMv7-A
Cortex-A5
Cortex-A7
Cortex-A8
Cortex-A9
Cortex-A15
Cortex-A17
Others
ARMv7-A
compatible
ARMv8-A
Others
Application
processors
(64-bit)
ARMv8-A
Cortex-A35
Cortex-A53
Cortex-A57
Cortex-A72
Cortex-A73
Others
ARMv8-A
compatible
ARMv8.1-A
ARMv8.1-A
compatible
ARMv8.2-A
Cortex-A55
Cortex-A75
Cortex-A76
Cortex-A77
Cortex-A78
Cortex-X1
Neoverse N1
Others
  • Cortex-A65, Cortex-A65AE, Cortex-A76AE, Cortex-A78C, Cortex-X1C,Neoverse E1
ARMv8.2-A
compatible
ARMv8.3-A
ARMv8.3-A
compatible
ARMv8.4-A
Neoverse V1
ARMv8.4-A
compatible
ARMv8.5-A
ARMv8.5-A
compatible
ARMv8.6-A
ARMv8.6-A
compatible
ARMv8.7-A
ARMv8.7-A
compatible
ARMv9.0-A
Cortex-A510
Cortex-A710
Cortex-A715
Cortex-X2
Cortex-X3
Neoverse N2
Neoverse V2
ARMv9.2-A
Cortex-A520
Cortex-A720
Cortex-A725
Cortex-X4
Cortex-X925
Neoverse N3
-
Neoverse V3
-
ARMv9.2-A
compatible
Embedded ARM-based chips
Embedded
microcontrollers
Cortex-M0
  • CypressPSoC 4000, 4100, 4100M, 4200, 4200DS, 4200L, 4200M
  • InfineonXMC1000
  • Nordic nRF51
  • NXPLPC1100, LPC1200
  • nuvoTon NuMicro
  • Sonix SN32F700
  • STMicroelectronicsSTM32 F0
  • Toshiba TX00
  • Vorago VA108x0
Cortex-M0+
  • Cypress PSoC 4000S, 4100S, 4100S+, 4100PS, 4700S, FM0+
  • Holtek HT32F52000
  • Microchip (Atmel)SAM C2, D0, D1, D2, DA, L2, R2, R3
  • NXPLPC800, LPC11E60, LPC11U60
  • NXP (Freescale) Kinetis E, EA, L, M, V1, W0
  • Raspberry PiRP2040
  • Renesas Synergy S1
  • Silicon Labs (Energy Micro)EFM32 Zero, Happy
  • STMicroelectronicsSTM32 L0
Cortex-M1
  • Altera FPGAs Cyclone-II, Cyclone-III, Stratix-II, Stratix-III
  • Microsemi (Actel) FPGAs Fusion, IGLOO/e, ProASIC3L, ProASIC3/E
  • Xilinx FPGAs Spartan-3, Virtex-2-3-4
Cortex-M3
Cortex-M4
  • Microchip (Atmel)SAM 4L, 4N, 4S
  • NXP (Freescale) Kinetis K, W2
  • Renesas RA4W1, RA6M1, RA6M2, RA6M3, RA6T1
Cortex-M4F
  • Cypress 6200, FM4
  • InfineonXMC4000
  • Microchip (Atmel)SAM 4C, 4E, D5, E5, G5
  • Microchip CEC1302
  • Nordic nRF52
  • NXPLPC4000, LPC4300
  • NXP (Freescale) Kinetis K, V3, V4
  • Renesas Synergy S3, S5, S7
  • Silicon Labs (Energy Micro)EFM32 Wonder
  • STMicroelectronicsSTM32 F3, F4, L4, L4+, WB
  • Texas Instruments LM4F/TM4C,MSP432
  • Toshiba TX04
Cortex-M7F
  • Microchip (Atmel)SAM E7, S7, V7
  • NXP (Freescale) Kinetis KV5x, i.MX RT 10xx, i.MX RT 11xx, S32K3xx
  • STMicroelectronicsSTM32 F7, H7
Cortex-M23
  • GigaDevice CD32E2xx
  • Microchip (Atmel)SAM L10, L11, and PIC 32CM-LE 32CM-LS
  • Nuvoton M23xx family, M2xx family, NUC1262, M2L31
  • Renesas S1JA, RA2A1, RA2L1, RA2E1, RA2E2
Cortex-M33F
  • Analog Devices ADUCM4
  • Dialog DA1469x
  • GigaDevice GD32E5, GD32W5
  • Nordic nRF91, nRF5340, nRF54
  • NXPLPC5500, i.MX RT600
  • ON RSL15
  • Renesas RA4, RA6
  • STSTM32 H5, L5, U5, WBA
  • Silicon Labs Wireless Gecko Series 2
Cortex-M35P
  • STMicroelectronics ST33K
Cortex-M55F
Cortex-M85F
  • Renesas RA8
Real-time
microprocessors
Cortex-R4F
  • Texas Instruments RM4, TMS570
  • Renesas RZ/T1
Cortex-R5F
Cortex-R7F
  • Renesas RZ/G2E, RZ/G2H, RZ/G2M, RZ/G2N
Cortex-R52F
  • NXP S32Z, S32E
  • Renesas RZ/N2L, RZ/T2L, RZ/T2M
Cortex-R52+F
  • STMicroelectronics Stellar G, Stellar P
Classic ARM-based chips
Classic
processors
ARM7
ARM9
ARM11
ARMv2a
compatible
ARMv4
compatible
ARMv5TE
compatible
  • Intel/MarvellXScale
  • Marvell Sheeva, Feroceon, Jolteon, Mohawk
  • Faraday FA606TE, FA616TE, FA626TE, FA726TE
Origins
In active development
Development discontinued
Main
Architectures
Word length
4-bit
8-bit
16-bit
32-bit
64-bit
Interfaces
Programming
Debugging
Lists
See also
Models
Architecture
Instruction set
architectures
Types
Instruction
sets
Execution
Instruction pipelining
Hazards
Out-of-order
Speculative
Parallelism
Level
Multithreading
Flynn's taxonomy
Processor
performance
Types
By application
Systems
on chip
Hardware
accelerators
Word size
Core count
Components
Functional
units
Logic
Registers
Control unit
Datapath
Circuitry
Power
management
Related
Authority control databases: NationalEdit this at Wikidata
Retrieved from "https://en.wikipedia.org/w/index.php?title=ARM_architecture_family&oldid=1280616371#VFP"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp