Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

SSE3

From Wikipedia, the free encyclopedia
CPU instruction set
Not to be confused withSSSE3.

SSE3,Streaming SIMD Extensions 3, also known by itsIntel code namePrescott New Instructions (PNI),[1] is the third iteration of theSSE instruction set for theIA-32 (x86) architecture. Intel introduced SSE3 in 2004 with thePrescott revision of theirPentium 4 andCeleron D CPUs.[1] In April 2005,AMD introduced a subset of SSE3 in revision E (Venice and San Diego) of theirAthlon 64 CPUs.[2] The earlierSIMD instruction sets on thex86 platform, from oldest to newest, areMMX,3DNow! (developed by AMD, no longer supported on newer CPUs),SSE, andSSE2.

SSE3 contains 13 new instructions overSSE2.[3]

Changes

[edit]

The most notable change is the capability to work horizontally in a register, as opposed to the more or less strictly vertical operation of all previous SSE instructions. More specifically, instructions to add and subtract the multiple values stored within a single register have been added.[4] These instructions can be used to speed up the implementation of a number ofDSP and3D operations. There is also a new instruction to convert floating point values to integers without having to change the global rounding mode, thus avoiding costlypipeline stalls. Finally, the extension addsLDDQU, an alternative misaligned integer vector load that has better performance onNetBurst based platforms for loads that cross cacheline boundaries.[5]

CPUs with SSE3

[edit]

New instructions

[edit]

Common instructions

[edit]

Arithmetic

[edit]
ADDSUBPD
Add-Subtract-Packed-Double[8]
  • Input: { A0, A1 }, { B0, B1 }
  • Output: { A0 − B0, A1 + B1 }
ADDSUBPS
Add-Subtract-Packed-Single[8]
  • Input: { A0, A1, A2, A3 }, { B0, B1, B2, B3 }
  • Output: { A0 − B0, A1 + B1, A2 − B2, A3 + B3 }

AOS ( Array Of Structures )

[edit]
HADDPD
Horizontal-Add-Packed-Double[8]
  • Input: { A0, A1 }, { B0, B1 }
  • Output: { A0 + A1, B0 + B1 }
HADDPS
Horizontal-Add-Packed-Single[8]
  • Input: { A0, A1, A2, A3 }, { B0, B1, B2, B3 }
  • Output: { A0 + A1, A2 + A3, B0 + B1, B2 + B3 }
HSUBPD
Horizontal-Subtract-Packed-Double[8]
  • Input: { A0, A1 }, { B0, B1 }
  • Output: { A0 − A1, B0 − B1 }
HSUBPS
Horizontal-Subtract-Packed-Single[8]
  • Input: { A0, A1, A2, A3 }, { B0, B1, B2, B3 }
  • Output: { A0 − A1, A2 − A3, B0 − B1, B2 − B3 }
LDDQU
As stated above, this is an alternative misaligned integer vector load.[8] It can be helpful for video compression tasks.
MOVDDUP,MOVSHDUP,MOVSLDUP[4]
These are useful for complex numbers and wave calculation like sound.
FISTTP
Like the older x87FISTP instruction, but ignores the floating point control register's rounding mode settings and uses the "chop" (truncate) mode instead.[4] Allows omission of the expensive loading and re-loading of the control register in languages such as C where float-to-int conversion requires truncate behaviour by standard.

Other instructions

[edit]
MONITOR,MWAIT
TheMONITOR instruction is used to specify a memory address for monitoring, while theMWAIT instruction puts the processor into a low-power state and waits for a write event to the monitored address.[4]

References

[edit]
  1. ^abShimpi, Anand Lal; Wilson, Derek."Intel's Pentium 4 E: Prescott Arrives with Luggage".www.anandtech.com. Archived fromthe original on April 24, 2010. Retrieved2023-04-10.
  2. ^Shimpi, Anand Lal."Industry Update - Q4-2004: AMD adds SSE3 Support, Intel's 925/915 not selling and more".www.anandtech.com. Archived fromthe original on August 9, 2010. Retrieved2023-04-10.
  3. ^"Intel Instruction Set Extensions Technology".Intel. Retrieved2023-04-10.
  4. ^abcdWright, Christopher."SSE3 Instruction Set".softpixel.com. Retrieved2023-04-10.
  5. ^"LDDQU — Load Unaligned Integer 128 Bits".www.felixcloutier.com. Retrieved2023-04-10.
  6. ^Wilson, Derek."AMD K8 E4 Stepping: SSE3 Performance".www.anandtech.com. Archived fromthe original on July 6, 2010. Retrieved2023-04-10.
  7. ^"Intel Xeon 3.4GHz ['Nocona' core]".HEXUS. 2004-08-18. Retrieved2023-04-10.
  8. ^abcdefg"SSE3 Instructions - x86 Assembly Language Reference Manual".docs.oracle.com. Retrieved2023-04-10.

External links

[edit]
SIMD (RISC)
SIMD (x86)
Bit manipulation
  • BMI (ABM: 2007, BMI1: 2012, BMI2: 2013, TBM: 2012)
  • ADX (2014)
Compressed instructions
Security andcryptography
Transactional memory
Virtualization
General-purpose registers
Suspended extensions' dates arestruck through.
Retrieved from "https://en.wikipedia.org/w/index.php?title=SSE3&oldid=1309323038"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp