This PR generalizes the existing implementations of arithmetic operations between scalars and arrays to more combinations of types. This is especially useful for operations between complex and real types.

A couple of notes:

This removes the special handling of commutative operations. (Before, commutative operations with the scalar on the left side were implemented by calling the operation with the scalar on the right side.) IMO, implementations for more combinations of types are more important than possible differences in compile time due to reusing implementations.
The new&arr (op) scalar implementation brings a performance boost.
An alternative approach for the "scalar on lhs" operations would be to add more implementations for specific combinations of types, e.g.f32 andComplex<f32>. I chose the generic approach instead for its conciseness and flexibility.
This change is backwards compatible, except for possible changes in type inference due to the implementations for more combinations of types.

Fixes#781.

jturner314 added the enhancement label

Feb 15, 2020

jturner314 mentioned this pull request

Feb 15, 2020

Scalar operations with complex array and complex scalars#781

Open

Copy link

MemberAuthor

jturner314 commentedFeb 16, 2020

It appears that Rust 1.37 has a bug that prevents this PR from working properly. Fortunately, this bug isn't present in the latest stable compiler, but we'll need to wait until we bump the minimum required Rust version before merging this PR.

jturner314 added the postponed label

Feb 16, 2020

bluss added this to the0.14.0 milestone

Apr 22, 2020

Copy link

Member

bluss commentedApr 22, 2020

Delightful that the.to_owned() was so easy to remove just like that

bluss self-requested a review

December 9, 2020 15:53

bluss removed the postponed label

Dec 9, 2020

jturner314 added3 commits

December 29, 2020 19:59

Add benches for op with scalar and strided array

52ca234

Allow&arr (op) scalar output to be any elem type

65e13d9

This change has two benefits:* The new implementation applies to more combinations of types. For  example, it now applies to `&Array2<f32>` and `Complex<f32>`.* The new implementation avoids cloning the elements twice, and it  avoids iterating over the elements twice. (The old implementation  called `.to_owned()` followed by the arithmetic operation, while the  new implementation clones the elements and performs the arithmetic  operation in the same iteration.)On my machine, this change improves the performance for bothcontiguous and discontiguous arrays. (`scalar_add_1/2` go from ~530ns/iter to ~380 ns/iter, and `scalar_add_strided_1/2` go from ~1540ns/iter to ~1420 ns/iter.)

Generalize lhs scalar ops to more combos of types

b2a7d0b

This doesn't have a noticeable impact on the results of the`scalar_add_2` and `scalar_add_strided_2` benchmarks.

bluss force-pushed thecomplex-real-ops branch from19e35d3 tob2a7d0bCompare

December 29, 2020 19:00

Copy link

Member

bluss commentedDec 29, 2020

Rebased to current master

bluss reviewed

Dec 29, 2020

View reviewed changes

src/impl_ops.rs

		$scalar: Clone + $trt<A, Output=B>,
		A: Clone,
		S: Data<Elem=A>,
		D: Dimension,

Copy link

Member

blussDec 29, 2020•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This impl somehow now breaks Rust -- see the failed tests -- and causes a recursion errror - for an expression that has typef32 +f32 which is quite strange/scary(!)

   --> tests/oper.rs:159:48    |159 |         .fold(f32::zero(), |acc, (&x, &y)| acc + x * y)    |                                                ^    |    = help: consider adding a `#![recursion_limit="256"]` attribute to your crate (`oper`)    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`    = note: required because of the requirements on the impl of `Add<&ndarray::ArrayBase<_, _>>` for `f32`

Unsure if this is a Rust bug - for example that the impl is accepted(?), but I think this impl is too general and has infinite descent.

Given the question iff32 implementsAdd<&ArrayBase<S, D>> look for other impl that hasf32: Add<A> whereS: Data<Elem=A> which looks recursive, is that it?

Copy link

MemberAuthor

jturner314Dec 30, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It looks like a compiler bug to me. As you point out, the expression involves onlyf32, but for some reason, the error message indicates that one of the arguments is an array. It's also interesting that on my machine with Rust 1.48.0, the error message is slightly different, saying "impl ofAdd<ndarray::ArrayBase<_, _>> forf32" instead of the error message in your comment "impl ofAdd<&ndarray::ArrayBase<_, _>> forf32". (Note the&.)

The function fails to compile (with the same error message) even after adding type annotations:

fnreference_dot<'a,V1,V2>(a:V1,b:V2) ->f32whereV1:AsArray<'a,f32>,V2:AsArray<'a,f32>,{let a:ArrayView1<'a,f32> = a.into();let b:ArrayView1<'a,f32> = b.into();    a.iter().zip(b.iter()).fold(f32::zero(), |acc:f32,(&x,&y):(&f32,&f32)| acc + x* y)}

but if I remove the+ x * y, it compiles successfully:

fnreference_dot<'a,V1,V2>(a:V1,b:V2) ->f32whereV1:AsArray<'a,f32>,V2:AsArray<'a,f32>,{let a:ArrayView1<'a,f32> = a.into();let b:ArrayView1<'a,f32> = b.into();    a.iter().zip(b.iter()).fold(f32::zero(), |acc:f32,(&x,&y):(&f32,&f32)| acc)}

I don't see any reason other than a compiler bug for the first function to fail to compile when the second one compiles without errors, since the type annotations confirm that the closure is operating only onf32 values.

This also compiles successfully:

fnreference_dot2<'a>(a:ArrayView1<'a,f32>,b:ArrayView1<'a,f32>) ->f32{    a.iter().zip(b.iter()).fold(f32::zero(), |acc:f32,(&x,&y):(&f32,&f32)| acc + x* y)}

so the bug involves the.into() calls in some way. It's surprising that adding explicit type annotations for the results of the.into() calls, as in the first example, doesn't work around the bug.

Fwiw, I don't thinkimpl<'a, A, S, D, B> $trt<&'a ArrayBase<S, D>> for $scalar is infinitely recursive, since AFAIK it's not possible to have an array of (arrays of (arrays of (arrays of ... [infinite depth]))). The innermost array type can only have an element type that's not an array. You're right that there is recursion if you're dealing with arrays of arrays, but that's the correct behavior, and the recursion is not infinite.

For the particular function we're looking at, the impl doesn't apply, and I don't think the compiler should be trying to apply it. (I think it should only apply the impl if it knows the RHS has some type&ArrayBase<?S, ?D>, where?S and?D are inference variables.)

Copy link

Member

blussDec 30, 2020•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Interesting, the test runners for cross_test, stable, mips vs i686 disagree with each other about the error too, in the same way, even if they both use Rust 1.48

Copy link

MemberAuthor

jturner314Dec 31, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I reported the issue (with a simplified example) atrust-lang/rust#80542.

Copy link

Member

bluss commentedDec 30, 2020•
edited
Loading

I have considered the question of deprecating scalars as left hand side (LHS) operands. The reason would be because their implementation does not fit well with how trait impls are normally written, and the inevitable asymmetry between array + scalar and scalar + array in terms of which types are accepted.

jturner314 mentioned this pull request

Dec 31, 2020

Operator impl causes compilation error for an expression involving the operator with a different pair of typesrust-lang/rust#80542

Open

Copy link

MemberAuthor

jturner314 commentedDec 31, 2020•
edited
Loading

I have considered the question of deprecating scalars as left hand side (LHS) operands.

I agree that the implementations we have are somewhat unsatisfying, but IMO they're useful enough to keep. I would guess that the vast majority of users are dealing with the element types we implement the operators for, probably mostlyf32/f64, and the impls are useful because subtraction and division aren't commutative. (To perform subtraction/division with a scalar on the left side without these impls, you'd have to usemapv orazip, which are much more verbose.)

I suppose an alternative option to the existing impls would be aScalar wrapper type so that you could write expressions like this:

Scalar(2.) / array

which would work with any element type but would be less intuitive and would make expressions more verbose. I'm not sure aScalar wrapper type is much better than usingmapv.

Copy link

Member

bluss commentedJan 10, 2021•
edited
Loading

I think I have found out that ifthis (ugly) workaround is applied, the ScalarOperand trait is not needed anymore - meaning an unrestrictedArray1<A> + A would be allowed (withoutA: ScalarOperand). However, I'm unsure if it can be extended toArray1<A> + B - probably not.

I think that Scalar is a lot better thanmapv, just more work for us to introduce it with all the right impls.