Movatterモバイル変換

[0]ホーム

Jump to content

F16C

Edit links

From Wikipedia, the free encyclopedia

Architectural instruction

This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(December 2013) (Learn how and when to remove this message)

For the fighter aircraft, seeF-16C.

TheF16C^[1] (previously/informally known asCVT16) instruction set is anx86 instruction set architecture extension which provides support for converting betweenhalf-precision and standard IEEEsingle-precision floating-point formats.

History

[edit]

The CVT16 instruction set, announced byAMD on May 1, 2009,^[2] is an extension to the 128-bitSSE core instructions in thex86 andAMD64 instruction set.

CVT16 is a revision of part of theSSE5 instruction set proposal announced on August 30, 2007, which is supplemented by theXOP andFMA4 instruction sets. This revision makes the binary coding of the proposed new instructions more compatible withIntel'sAVX instruction extensions, while the functionality of the instructions is unchanged.

In recent documents, the name F16C is formally used in bothIntel andAMD x86-64 architecture specifications.

Technical information

[edit]

There are variants that convert four floating-point values in anXMM register or 8 floating-point values in aYMM register.

The instructions are abbreviations for "vector convert packed half to packed single" and vice versa:

VCVTPH2PS xmmreg,xmmrm64 – convert four half-precision floating point values in memory or the bottom half of an XMM register to four single-precision floating-point values in an XMM register.
VCVTPH2PS ymmreg,xmmrm128 – convert eight half-precision floating point values in memory or an XMM register (the bottom half of a YMM register) to eight single-precision floating-point values in a YMM register.
VCVTPS2PH xmmrm64,xmmreg,imm8 – convert four single-precision floating point values in an XMM register to half-precision floating-point values in memory or the bottom half an XMM register.
VCVTPS2PH xmmrm128,ymmreg,imm8 – convert eight single-precision floating point values in a YMM register to half-precision floating-point values in memory or an XMM register.

The 8-bit immediate argument toVCVTPS2PH selects therounding mode. Values 0–4 select nearest, down, up, truncate, and the mode set inMXCSR.RC.

Support for these instructions is indicated by bit 29 of ECX afterCPUID with EAX=1.

CPUs with F16C

[edit]

AMD:
- Jaguar-based processors
- Puma-based processors
- "Heavy Equipment" processors
  - Bulldozer-based processors, Q4 2011^[3]
  - Piledriver-based processors, Q4 2012^[4]
  - Steamroller-based processors, Q1 2014
  - Excavator-based processors, Q2 2015
- Zen-based processors, Q1 2017, and newer
Intel:
- Ivy Bridge processors and newer

References

[edit]

^Chuck Walbourn (September 11, 2012)."DirectXMath: F16C and FMA".
^"128-Bit and 256-Bit XOP, FMA4 and CVT16 Instructions"(PDF).AMD64 Architecture Programmer's Manual. Vol. 6. 2009-05-01. Archived fromthe original(PDF) on 2009-05-20. Retrieved2022-07-05.
^Dave Christie (2009-05-07),Striking a balance, AMD Developer blogs, archived fromthe original on 2013-11-09, retrieved2012-01-17
^New "Bulldozer" and "Piledriver" Instructions(PDF), AMD, October 2012

External links

[edit]

New Bulldozer and Piledriver Instructions[1]Archived 2013-01-07 at theWayback Machine
DirectX math F16C and FMA[2]
AMD64 Architecture Programmer's Manual Volume 1[3]Archived 2013-12-14 at theWayback Machine
AMD64 Architecture Programmer's Manual Volume 2[4]
AMD64 Architecture Programmer's Manual Volume 3[5]Archived 2013-12-14 at theWayback Machine
AMD64 Architecture Programmer's Manual Volume 4[6]Archived 2021-11-14 at theWayback Machine
AMD64 Architecture Programmer's Manual Volume 5[7]Archived 2013-12-14 at theWayback Machine
IA32 Architectures Software Developer Manual[8]

AMD technology

Software

Platforms

Current	ROCm GPUOpen
Obsolete	Spider Dragon Horus

Technology

Instructions

v t e Intel technology
Platforms	Centrino Centrino 2 Viiv MID Tablet CULV Ultrabook Skulltrail NUC Galileo Edison Curie Evo
Discontinued	Common Building Block MultiProcessor Specification Intel Communication Streaming Architecture Intel Inboard 386 Intel Play MMC-1 MMC-2
Current	Advanced Programmable Interrupt Controller CNVi Intel Turbo Boost vPro Intel Secure Key Intel Management Engine Active Management Technology AMT versions High-bandwidth Digital Content Protection High Definition Audio Hub Architecture Rapid Storage Technology SpeedStep Serial Digital Video Out Host Embedded Controller Interface Hyper-threading Omni-Path Platform Environment Control Interface QuickPath Interconnect Platform Controller Hub System Management Bus Thunderbolt Ultra Path Interconnect
Upcoming	Silicon Photonics Link

v t e Instruction set extensions
SIMD (RISC)	Alpha MVI ARM NEON SVE MIPS MDMX MIPS-3D MXU MIPS SIMD PA-RISC MAX Power ISA VMX SPARC VIS
SIMD (x86)	MMX (1996) 3DNow! (1998) SSE (1999) SSE2 (2001) SSE3 (2004) SSSE3 (2006) SSE4 (2006) SSE5~~(2007)~~ AVX (2008) F16C (2009) XOP (2009) FMA (FMA4: 2011, FMA3: 2012) AVX2 (2013) AVX-512 (2015) AMX (2022) AVX10 (2023)
Bit manipulation	BMI (ABM: 2007, BMI1: 2012, BMI2: 2013, TBM: 2012) ADX (2014)
Compressed instructions	Thumb MIPS16e ASE RVC
Security andcryptography	PadLock (2003) AES-NI (2008); ARMv8 also has AES instructions CLMUL (2010) RDRAND (2012) SHA (2013) MPX (2015) SGX (2015) TDX (2021)
Transactional memory	TSX (2013) ASF
Virtualization	VT-x (2005) AMD-V (2006) VT-d (AMD-Vi)
Suspended extensions' dates are~~struck through~~.

Retrieved from "https://en.wikipedia.org/w/index.php?title=F16C&oldid=1227910930"

Categories:

Hidden categories:

[8]ページ先頭