Move_assert_stacked_2d check into_assert_stacked_square
Improve performance of_commonType by using a cache
Avoidr.astype(...) for scalar arguments making a copy of the data (and internally converting to an array and back)

In this PR we perform the first step.

ENH: Improve np.linalg.det performance

a4ff9e0

github-actionsbot added the 01 - Enhancement label

Apr 4, 2025

eendebakpt marked this pull request as draft

April 4, 2025 15:18

eendebakpt commented

Apr 4, 2025

View reviewed changes

numpy/linalg/_linalg.py OutdatedShow resolvedHide resolved

Update numpy/linalg/_linalg.py

05abe63

eendebakpt marked this pull request as ready for review

April 4, 2025 16:01

tylerjereddy added the component: numpy.linalg label

Apr 4, 2025

tylerjereddy reviewed

Apr 4, 2025

View reviewed changes

Copy link

Contributor

tylerjereddy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

So farasv isn't showing much perf improvement on this branch on x86_64 Linux (i9-13900K):

asv continuous -E virtualenv -e -b "time_det.*" main linalg_refactor

BENCHMARKS NOT SIGNIFICANTLY CHANGED.

It may be because you're only making the first of the series of proposed changes.

Regardless of the performance changes, I suppose this is a reduction in lines of code, so maybe "ok" on its own anyway.

numpy/linalg/_linalg.py Outdated

		@@ -152,8 +152,8 @@ def _commonType(*arrays):
		for a in arrays:
		type_ = a.dtype.type
		if issubclass(type_, inexact):
		if isComplexType(type_):
		is_complex = True
		is_complex = is_complex or isComplexType(type_)

Copy link

Contributor

tylerjereddyApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is this really worth doing? It takes my brain longer to process and shouldn't matter much performance-wise?

Copy link

Contributor

mhvkApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm also not sure this routine is worth it, but if one goes for it, I'd start with something like

types = set(a.dtype.type for a in arrays)

which will generally reduce the number already, and then do

is_complex = any(isComplexType(type_) for type_ in types)

and something along similar lines forresult_type (but with a check whether one can really not simply use the built-innp.result_type - not obvious to me).

But I'd do it in a separate PR.

Copy link

ContributorAuthor

eendebakptApr 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Interesting idea! It might not work out because the common use case is with len(arrays) just 1 or 2 and the set overhead is too large. I will remove this change here and test out in a new PR.

mhvk reviewed

Apr 4, 2025

View reviewed changes

Copy link

Contributor

mhvk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I like the cleanup, though I'm not surprised it doesn't have that much of an effect on speed. A suggestion in-line for an extra (if very minor) performance boost.

Also, would suggest to do just the removal of_assert_stacked_square here.

numpy/linalg/_linalg.py Outdated

		@@ -152,8 +152,8 @@ def _commonType(*arrays):
		for a in arrays:
		type_ = a.dtype.type
		if issubclass(type_, inexact):
		if isComplexType(type_):
		is_complex = True
		is_complex = is_complex or isComplexType(type_)

Copy link

Contributor

mhvkApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm also not sure this routine is worth it, but if one goes for it, I'd start with something like

types = set(a.dtype.type for a in arrays)

which will generally reduce the number already, and then do

is_complex = any(isComplexType(type_) for type_ in types)

and something along similar lines forresult_type (but with a check whether one can really not simply use the built-innp.result_type - not obvious to me).

But I'd do it in a separate PR.

numpy/linalg/_linalg.py

		@@ -1320,11 +1317,6 @@ def eigvalsh(a, UPLO='L'):
		w = gufunc(a, signature=signature)
		return w.astype(_realType(result_t), copy=False)

		def _convertarray(a):

Copy link

Contributor

mhvkApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Nice catch that this is not actually used!

mhvk reviewed

Apr 4, 2025

View reviewed changes

numpy/linalg/_linalg.py Outdated

		@@ -197,6 +197,9 @@ def _assert_stacked_2d(*arrays):

		def _assert_stacked_square(*arrays):
		for a in arrays:
		if a.ndim < 2:

Copy link

Contributor

mhvkApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I like the combination. If one really wants to get out the most, it could be

try:    m, n = a.shape[-2:]except ValueError:    riase LinAlgError(f"{a.ndim}-dimensional...") from Noneif m != n:   ...

Using that these daystry/except has no cost if no exception is raised.

Copy link

Contributor

mhvkApr 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Sorry, pushed this after submitting the review - above is the most relevant comment! (and not very relevant at that!)

Copy link

ContributorAuthor

eendebakptApr 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The_assert_stacked_square is about 10% faster using your suggestion, I updated the PR with this.

eendebakpt added5 commits

April 9, 2025 09:32

revert change to complex detection

c020ed7

use suggestion

a760647

whitespace

3f094cd

add more small array benchmarks

ab6cc07

trigger build

2fc384f

mhvk approved these changes

Apr 9, 2025

View reviewed changes

Copy link

Contributor

mhvk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks good to me, thanks! Let's get it in.

mhvk merged commit422ca44 intonumpy:main

Apr 9, 2025

69 of 71 checks passed

Copy link

ContributorAuthor

eendebakpt commentedApr 10, 2025

Thanks for reviewing! Next PR is#28686

MaanasArora pushed a commit to MaanasArora/numpy that referenced this pull request

Apr 11, 2025

ENH: Improve np.linalg.det performance by simplifying checks (numpy#2…

11e2b54

…8649)* ENH: Improve np.linalg.det performance* Update numpy/linalg/_linalg.py* revert change to complex detection* use suggestion* whitespace* add more small array benchmarks* trigger build