NotificationsYou must be signed in to change notification settings
Fork5.2k
Star17.2k

[mono] Implement AdvSimd#49260

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

imhameed merged 58 commits intodotnet:mainfromimhameed:mono-arm64-advsimd

Mar 12, 2021

Merged

[mono] Implement AdvSimd#49260

imhameed merged 58 commits intodotnet:mainfromimhameed:mono-arm64-advsimd

Mar 12, 2021

Conversation

Copy link

Contributor

imhameed commentedMar 6, 2021•
edited
Loading

This change adds AdvSimd and AdvSimd.Arm64 support to LLVM-enabled Mono.

Most aarch64 LLVM intrinsic functions are overloaded and have names determined
by an invariant base string prepended to a string representation of one or two
type parameters. Intrinsic functions used by an LLVM module must have a
declaration somewhere in memory when JITting or somewhere in the output bitcode
file when AOTing. Currently Mono maintains a hash table that maps internal
intrinsic IDs to LLVM intrinsic declarations. These IDs have been extended: a
simplified type representation is added to the key's upper bits. This
representation is not especially compact, and currently uses 9 bits to label 18
states, but it's easy to look at in a debugger. (A simple base-18 encoding
could encode three parameters in 13 bits.)

These overload-tagged IDs can be passed to
OP_XOP_OVR{_,_SCALAR,_BYSCALAR}X_{X,X_X,X_X_X}. The return type of the
intrinsic that generates these mini ops is used to derive the overload tag to
find the corresponding LLVM intrinsic function declaration.

MonoLLVMModule::intrins_by_id is removed, because LLVM intrinsic lookup keys
are no longer small contiguous integers. It only seemed to serve as a lookup
table for data already contained in a hash table.

The corresponding instructions for some of these .NET-level intrinsics take
immediate parameters. For some of these instructions, the LLVM IR code that
selects these immediate-argument instructions can emit a fallback for
non-constant parameters, either by using an equivalent instruction with a
register operand or by using a longer and less-efficient instruction sequence.
For the rest, a branching code sequence is emitted. Helper functions
(immediate_unroll_begin etc.) are added to make this a little less
repetitious.

Some operations take an immediate operand denoting a lane to select in a vector
before proceeding with another generic vector or scalar operation. These are
decomposed into a sequence ofOP_ARM64_SELECT_SCALAR followed by the
non-lane-specific operation. LLVM can still optimize this to the lane-selecting
instruction when possible, and can generate fallback code for non-immediate
lane selection.

The tables describing the intrinsics supported by the runtime are extended to
support intrinsics with different target instructions for signed, unsigned and
floating point parameters. Whenever possible, .NET-level intrinsics that
correspond to a single LLVM intrinsic function are stored as a single entry in
these tables. Unfortunately many intrinsics need to be translated into a
sequence of LLVM IR operations; for these, new mini IR opcodes are added to
select the LLVM IR builder code that should run.

ghost added the area-Codegen-LLVM-mono label

Mar 6, 2021

imhameed force-pushed themono-arm64-advsimd branch 17 times, most recently from1bef631 to7d9469cCompare

March 8, 2021 05:32

This was referencedMar 8, 2021

System.Threading.Tasks.Tests timed out on net5.0-Linux-Debug-arm64-Mono_release#42024

Closed

Mono SIGABRT in System.Drawing.Common affecting some test runs#37838

Closed

imhameed added10 commits

March 8, 2021 12:23

Checkpoint

0848153

(Insert meaningful description here)

Add some shifts

706cf0e

Implement rounding

79514da

Implement ReverseElement{Bits,8,16,32}

6cc2596

Add reciprocal fp/u32 operations

f0e2e41

Add some bitwise operations

cca921b

Add negation

dad5fed

Add every AdvSimd symbol name

6341449

Minor cleanup

8580569

Checkpoint

a9bc75e

Remove `MonoLLVMModule::intrins_by_id`, which doesn't do anything otherthan serve as a lookup table for data contained in `intrins_id_to_intrins`Don't emit table-driven intrinsics when the corresponding intrinsicgroup isn't fully supported.

Fix ShiftLogicalSaturateScalar, ShiftArithmeticRoundedSaturateScalar,…

d81b1f0

… ShiftArithmeticSaturateScalar, ShiftLeftLogicalSaturate and ShiftLeftLogicalSaturateScalarFix ShiftLeftLogicalSaturate and ShiftLeftLogicalSaturateScalar:decompose it into a promotion of the second argument into a vectorfollowed by an overloaded invocation of @llvm.aarch64.neon.uqshl or@llvm.aarch64.neon.sqshl

vargaz approved these changes

Mar 9, 2021

View reviewed changes

imhameed added4 commits

March 9, 2021 14:34

Fix ShiftLeftLogicalSaturateUnsignedScalar, ShiftLogicalRoundedSatura…

e69386d

…teScalarShiftLeftLogicalSaturateUnsignedScalar: move scalar-op-from-vector-op code into shared functions

Fix PopCount

5b49456

Fix ReverseElement8, ReverseElement16, ReverseElement32

f31aec1

Fix ExtractNarrowingSaturateScalar, ExtractNarrowingSaturateUnsignedS…

12fd7c9

…calar

imhameed force-pushed themono-arm64-advsimd branch from46fccb6 to12fd7c9Compare

March 9, 2021 22:34

imhameed added4 commits

March 9, 2021 17:05

More test fixes:

d229944

MultiplyDoublingSaturateHighScalarMultiplyDoublingScalarBySelectedScalarSaturateHighMultiplyDoublingWideningSaturateScalarBySelectedScalarMultiplyDoublingWideningScalarBySelectedScalarAndAddSaturateMultiplyDoublingWideningScalarBySelectedScalarAndSubtractSaturateMultiplyRoundedDoublingByScalarSaturateHighMultiplyRoundedDoublingBySelectedScalarSaturateHighMultiplyRoundedDoublingSaturateHighScalarMultiplyRoundedDoublingScalarBySelectedScalarSaturateHigh    - remove unnecessary special casesMultiplyDoublingWideningSaturateScalar    - add support for the special-case scalar LLVM intrinsic for sqdmull

LoadAndReplicateToVector: coerce the source pointer to the element ty…

5baa928

…pe when loading a single element

Move OP_INSERT_* and OP_XCAST to a shared arm64/amd64 region

0d67d57

Address feedback: move IntrinsicId (and another LLVM-only anonymous e…

b14fc9a

…num) to a separate header

fanyang-mono approved these changes

Mar 10, 2021

View reviewed changes

Copy link

Member

fanyang-mono left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thanks for this massive change!

Don't attempt to scalarize a non-scalar sqshlu

059bce5

naricc reviewed

Mar 10, 2021

View reviewed changes

src/mono/mono/mini/mini-llvm.c

		staticvoidset_nonnull_load_flag (LLVMValueRefv);

		enum {
		INTRIN_scalar=1 <<0,

Copy link

Contributor

nariccMar 10, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is there any particular reason we are defining some of these with constant bit shifts, some with decimal literals, and some with hex literals?

Copy link

ContributorAuthor

imhameedMar 10, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

They're hints to the reader: the enumeration constants given values by constant bit shifts are meant to be used as bit selectors in a bit set, the enumeration constants given values by decimal literals are meant to be used to bound loop ranges, and the enumeration constants given values by hex literals are meant to be used as logical masks.

naricc approved these changes

Mar 10, 2021

View reviewed changes

MultiplyDoublingWideningSaturateScalar etc.: consistently place the s…

bd2e5b2

…calar or scalar-in-vector return value in a Vector64Remove OP_ARM64_ZERO_UPPER, which is unused

imhameed force-pushed themono-arm64-advsimd branch fromfe39968 toa3f1171Compare

March 10, 2021 21:13

imhameed added2 commits

March 11, 2021 07:55

Explicitly zero out the unused bits in scalar ops built out of vector…

ca728df

… opsundef can apparently pass through intrinsic functions duringoptimization, so bias towards slightly worse but correct codegen for now

Fix the vector concatenation overloads of Vector128/256

24a89e1

imhameed force-pushed themono-arm64-advsimd branch froma3f1171 to24a89e1Compare

March 11, 2021 15:55

Sha1.FixedRotate is a scalar-in-vector op. TODO: refactor to use XOP_…

14602df

…SCALAR_X_X

imhameed merged commit4e2491d intodotnet:main

Mar 12, 2021

imhameed mentioned this pull request

Mar 12, 2021

[mono] Tracking: ImplementSystem.Runtime.Intrinsics.Arm.AdvSimd#42266

Closed

runfoappbot mentioned this pull request

Mar 12, 2021

[tests] System.Text.Json.Tests segfault, forLibraries Test Run release coreclr OSX x64 Release#47805

Closed

This was referencedMar 22, 2021

Native Asset failure while System.Collections.Concurrent.Tests#48614

Closed

RunContinueWithStressTestsNoState timing out in CI#2271

Closed

System.Collections.Concurrent.Tests crashing in CI#45517

Closed

Test failure Wasm.Build.Tests.WasmBuildAppTest.InvariantGlobalization#49494

Closed

ghost locked asresolvedand limited conversation to collaborators

Apr 11, 2021

karelz added this to the6.0.0 milestone

May 20, 2021

Labels

area-Codegen-LLVM-mono

6 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[mono] Implement AdvSimd#49260

[mono] Implement AdvSimd#49260

Uh oh!

Conversation

imhameed commentedMar 6, 2021•
edited
Loading

Uh oh!

Uh oh!

fanyang-mono left a comment

Choose a reason for hiding this comment

Uh oh!

nariccMar 10, 2021

Choose a reason for hiding this comment

Uh oh!

imhameedMar 10, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Movatterモバイル変換

[mono] Implement AdvSimd#49260

[mono] Implement AdvSimd#49260

Uh oh!

Conversation

imhameed commentedMar 6, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

fanyang-mono left a comment

Choose a reason for hiding this comment

Uh oh!

nariccMar 10, 2021

Choose a reason for hiding this comment

Uh oh!

imhameedMar 10, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

imhameed commentedMar 6, 2021•
edited
Loading