NotificationsYou must be signed in to change notification settings
Fork33.3k
Star69.8k

gh-119372: recover inf's and zeros in _Py_c_quot#119457

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

serhiy-storchaka merged 8 commits intopython:mainfromskirpichev:fix-complex-tdiv-119372

Jun 29, 2024

Merged

gh-119372: recover inf's and zeros in _Py_c_quot#119457

serhiy-storchaka merged 8 commits intopython:mainfromskirpichev:fix-complex-tdiv-119372

Jun 29, 2024

Conversation

Copy link

Contributor

skirpichev commentedMay 23, 2024•
edited by bedevere-appbot
Loading

In some cases, previosly computed as(nan+nanj), we could recover meaningful component values in the result, see e.g. the C11, Annex G.5.2, routine _Cdivd():

>>>from cmathimport inf, infj, nanj>>> (1+1j)/(inf+infj)# was (nan+nanj)0j>>> (inf+nanj)/complex(2**1000,2**-1000)(inf-infj)

Issue:Invalid corner cases (resulting in nan+nanj) in complex division #119372

pythongh-119372: recover inf's and zeros in _Py_c_quot

06ac53d

In some cases, previosly computed as (nan+nanj), we couldrecover meaningful component values in the result, seee.g. the C11, Annex G.5.2, routine _Cdivd():>>> from cmath import inf, infj, nanj>>> (1+1j)/(inf+infj)  # was (nan+nanj)0j>>> (inf+nanj)/complex(2**1000, 2**-1000)(inf-infj)

bedevere-appbot added the awaiting review label

May 23, 2024

bedevere-appbot mentioned this pull request

May 23, 2024

Invalid corner cases (resulting in nan+nanj) in complex division#119372

Closed

serhiy-storchaka approved these changes

May 23, 2024

View reviewed changes

Copy link

Member

serhiy-storchaka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

LGTM.

Lib/test/test_complex.py OutdatedShow resolvedHide resolved

Lib/test/test_complex.py Outdated

		# test recover of infs if numerator has infinities and denominator is finite
		self.assertComplexesAreIdentical(complex('(inf-infj)')/(1+0j),complex('(inf-infj)'))
		self.assertComplexesAreIdentical(complex('(inf+infj)')/(0.0+1j),complex('(inf-infj)'))
		self.assertComplexesAreIdentical(complex('(nan+infj)')/complex(21000,2-1000),complex('(inf+infj)'))

Copy link

Member

serhiy-storchakaMay 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I wonder, how did you came to this test? Its result looks paradoxical at first glance, but makes sense after thinking about it more.

Copy link

ContributorAuthor

skirpichevMay 23, 2024•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That was just a coverage test, that triggers underflow inratio. The hard part, as you noted, was to figure out why it's true. I've just checked the exact answer in the Diofant (probably will work in the SymPy too), e.g. for the real part:

In [43]: x = Symbol('x', extended_real=True)In [44]: y = Symbol('y', positive=True, finite=True)In [45]: expr = re(expand((x + I*oo)*(y-I/y)))In [46]: expr.subs({x: Dummy(real=True)})Out[46]: ∞In [47]: expr.subs({x: Dummy(positive=True)})Out[47]: ∞In [48]: limit(expr, x, -oo)Out[48]: ∞

BTW, Mathematica does more brave things, but hardly it's correct:

In[14]:= Hold[(x + I*Infinity)*(2^10-I/2^10)]  (* no assumptions on x ! *)                                 10    IOut[14]= Hold[(x + I Infinity) (2   - ---)]                                       10                                      2In[15]:= ReleaseHold[%]         (1 + 1048576 I) InfinityOut[15]= ------------------------           Sqrt[1099511627777]In[16]:= ReleaseHold[%14/.{x->1-I*Infinity}]Infinity::indet: Indeterminate expression -I Infinity + I Infinity encountered.Out[16]= Indeterminate

However, it looks I should restore remark about corner cases. Not all are handled by this code (have no chance, because one component computed as non-nan). Here is an example:

>>>complex('(1+infj)')/complex(2**1000,2**-1000)(nan+infj)

This is correctly handled by C code both for gcc and clang, but it turns they use a different algorithm for tdiv (not Smith's method, rather something like Priest's algorithm; see full _Cdivd() code before handling nan+nanj case). Maybe it worth to adopt this, but probably it's out of the scope of this pr.

Edit:
An example, where current algorithm fails (bad imaginary part) is$(2^{1023}+i 2^{-1023})/(2^{677}+i 2^{-677})\approx 2^{346}-i 2^{-1008}$ (taken from M.Baudin: "A Robust Complex Division in Scilab"). gcc v12.2 (and, probably, clang), that follows to example in C standard, does produce the correct answer.

example code and "fixed" Smith's algorithm (uses extended precision to compute ratio)

#include<complex.h>#include<stdio.h>#include<math.h>/* CPython's version */complextdiv_smith(complexa,complexb){constdoubleabs_breal=fabs(creal(b));constdoubleabs_bimag=fabs(cimag(b));if (abs_breal >=abs_bimag) {if (abs_breal!=0.0) {constdoubleratio=cimag(b) /creal(b);constdoubledenom=creal(b)+cimag(b)*ratio;returnCMPLX((creal(a)+cimag(a)*ratio) /denom,                         (cimag(a)-creal(a)*ratio) /denom);        }    }elseif (abs_bimag >=abs_breal) {constdoubleratio=creal(b) /cimag(b);constdoubledenom=creal(b)*ratio+cimag(b);returnCMPLX((creal(a)*ratio+cimag(a)) /denom,                     (cimag(a)*ratio-creal(a)) /denom);    }returnCMPLX(NAN,NAN);}complextdiv_smith2(complexa,complexb){constdoubleabs_breal=fabs(creal(b));constdoubleabs_bimag=fabs(cimag(b));if (abs_breal >=abs_bimag) {if (abs_breal!=0.0) {constlongdoubleratio= (longdouble)cimag(b) / (longdouble)creal(b);constdoubledenom=creal(b)+cimag(b)*ratio;returnCMPLX((creal(a)+cimag(a)*ratio) /denom,                         (cimag(a)-creal(a)*ratio) /denom);        }    }elseif (abs_bimag >=abs_breal) {constlongdoubleratio= (longdouble)creal(b) / (longdouble)cimag(b);constdoubledenom=creal(b)*ratio+cimag(b);returnCMPLX((creal(a)*ratio+cimag(a)) /denom,                     (cimag(a)*ratio-creal(a)) /denom);    }returnCMPLX(NAN,NAN);}intmain(){complexa=CMPLX(0x1p+1023,0x1p-1023);complexb=CMPLX(0x1p677,0x1p-677);complexz;z=a /b;printf("%e%+ei\n",creal(z),cimag(z));z=tdiv_smith(a,b);printf("%e%+ei\n",creal(z),cimag(z));z=tdiv_smith2(a,b);printf("%e%+ei\n",creal(z),cimag(z));return0;}

$ cc -std=c11 a.c -lm && ./a.out1.433437e+104-3.645561e-304i1.433437e+104+0.000000e+00i1.433437e+104-3.645561e-304i

Copy link

Member

mdickinsonMay 27, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It's quite odd to see NaNs disappearing in an arithmetic operation on operands that include NaNs. However, that seems unavoidable if we want to follow the C standard Annex G semantics, since its rules imply thatcomplex(inf, nan) should be regarded as a complex infinity, and then the division of a finite complex number by a complex infinity should be a complex zero (so can't have any NaN components).

Copy link

ContributorAuthor

skirpichevMay 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It's quite odd to see NaNs disappearing in an arithmetic operation on operands that include NaNs.

I think that could be reasonably interpreted only as shown above, i.e.nan in components means "anything from the extended real line".

since its rules imply that complex(inf, nan) should be regarded as a complex infinity

I don't think there is such thing in the C standard. It has two types of infinities: when one component is finite, nonzero and other is infinite; other type (with few cases) is a directed infinity (phase is finite and absolute value is infinite). I.e.(1+infj) and(inf+infj). That brings an interesting clash with CAS like Mathematica, that has no notion like the first type if infinities. E.g.ArcSin[Infinity] is just-I Infinity c.f.cmath.asin(). Seempmath/mpmath#793

But CPython already adopts Annex G semantics in this respect, viacmath.isinf() andcmath.isfinite().

@mdickinson, what do you think about changing Smith's algorithm to something more closer to C standard?

Copy link

Member

mdickinsonJun 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@skirpichev

I don't think there is such thing in the C standard.

Take a look at Annex G. Quoting from §G.3 "Conventions" in C99, paragraph 1:

A complex or imaginary value with at least one infinite part is regarded as an infinity (even if its other part is a NaN).

The language is slightly changed in the most recent publicly available draft of C23 (n3096.pdf):

A complex or imaginary value with at least one infinite part is regarded as an infinity (even if its other part is a quiet NaN).

bedevere-appbot added awaiting merge and removed awaiting review labels

May 23, 2024

Tests reformatted

fc3f91d

serhiy-storchaka approved these changes

May 23, 2024

View reviewed changes

serhiy-storchaka requested a review frommdickinson

May 23, 2024 16:58

mdickinson reviewed

May 27, 2024

View reviewed changes

Copy link

Member

mdickinson left a comment•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Changes LGTM, and the new addition looks clean. This brings us in line with C's Annex G and the explicit requirements in §G.5.1p4 (n3096.pdf).

I'm a bit worried about backwards compatibility. This is admittedly a corner case, and I agree that the new semantics are better than the old ones, but it's also changing behaviour that's been there for decades.

I'm not too worried that there's "real" code out there whose mathematical correctness relies on the old division semantics - that seems unlikely. But it does seem likely that this change could break code that's relying on reproducing the Python semantics. The most obvious example would be the behaviour of the NumPy complex types - the NumPy folks probably have tests that ensure thatcomplex128 behaves the same ascomplex in corner cases.

It may be worth checking the NumPy behaviour here - ifcomplex128 currently has the same behaviour as Python'scomplex in these corner cases, then we could open an issue on the NumPy tracker alerting them to the upcoming change and giving them the chance to complain.

Objects/complexobject.cShow resolvedHide resolved

Objects/complexobject.c OutdatedShow resolvedHide resolved

skirpichev added2 commits

May 27, 2024 18:07

Address review: use Py_IS_* macros

99669a1

Merge branch 'main' into fix-complex-tdiv-119372

d4718bc

skirpichev mentioned this pull request

May 27, 2024

Deprecate Py_IS_NAN/INFINITY/FINITE?#119613

Closed

Copy link

Member

picnixz commentedMay 28, 2024

My 2 cents:

the NumPy folks probably have tests that ensure that complex128 behaves the same as complex in corner cases.

Here are some relevant places where the complex128 tests take place:

https://github.com/numpy/numpy/blob/main/numpy/_core/tests/test_umath.py#L632

(+ below and above code)

Most of the basic operations tests are inhttps://github.com/numpy/numpy/blob/main/numpy/_core/tests/test_umath.py but the corner cases being tested are not as exhaustive as here.

Technically, if the corner cases handling changes, then NumPy (by default) would just raise errors. However, with different error handler flags, you indeed might have breaking changes where a user code would incorrectly check that something is nan but not check that something is inf (or vice-versa). I think the more plausible breaking case is(1+1j)/(cmath.inf+cmath.infj) which currently givesnan+nanj instead of 0.

Merge branch 'main' into fix-complex-tdiv-119372

b1b3e76

vstinner mentioned this pull request

May 28, 2024

gh-119613: use C99+ functions instead of Py_IS_NAN/INFINITY/FINITE#119619

Merged

Copy link

ContributorAuthor

skirpichev commentedMay 28, 2024

@mdickinson, I'm not sure that numpy people are trying to reproduce CPython complex behaviour, here is the code:
https://github.com/numpy/numpy/blob/012e63d8b04636bc8e0a7b05312e5a8cc985f727/numpy/_core/src/npymath/npy_math_complex.c.src#L92-L120
Yes, they are using Smith's algorithm. But that's all. In some "corner cases" behaviour already different:

>>> np.__version__'1.24.4'>>> np.complex128(1)/np.complex128(0)<stdin>:1: RuntimeWarning: divide by zero encountered in scalar divide<stdin>:1: RuntimeWarning: invalid value encountered in scalar divide(inf+nanj)

we could open an issue on the NumPy tracker alerting them to the upcoming change and giving them the chance to complain.

FYI:numpy/numpy#26560

skirpichev added2 commits

May 30, 2024 18:09

Merge branch 'master' into fix-complex-tdiv-119372

f6f5510

Partially revert99669a1(don't use Py_IS_* macros)

1c9ccc4

skirpichev mentioned this pull request

Jun 2, 2024

BUG: fix corner cases in complex division (were nan+nanj)numpy/numpy#26595

Closed

Copy link

ContributorAuthor

skirpichev commentedJun 18, 2024

@serhiy-storchaka, it seems Mark OK with this change. We have numpy issue with pr, do we need something else?

Merge branch 'main' into fix-complex-tdiv-119372

6e17df6

serhiy-storchaka merged commit2cb84b1 intopython:main

Jun 29, 2024

bedevere-appbot removed the awaiting merge label

Jun 29, 2024

skirpichev deleted the fix-complex-tdiv-119372 branch

June 29, 2024 08:34

mrahtz pushed a commit to mrahtz/cpython that referenced this pull request

Jun 30, 2024

pythongh-119372: Recover inf's and zeros in _Py_c_quot (pythonGH-119457)

36994b1

In some cases, previously computed as (nan+nanj), we couldrecover meaningful component values in the result, seee.g. the C11, Annex G.5.2, routine _Cdivd().

noahbkim pushed a commit to hudson-trading/cpython that referenced this pull request

Jul 11, 2024

pythongh-119372: Recover inf's and zeros in _Py_c_quot (pythonGH-119457)

1c23bc3

In some cases, previously computed as (nan+nanj), we couldrecover meaningful component values in the result, seee.g. the C11, Annex G.5.2, routine _Cdivd().

estyxx pushed a commit to estyxx/cpython that referenced this pull request

Jul 17, 2024

pythongh-119372: Recover inf's and zeros in _Py_c_quot (pythonGH-119457)

b92b619

In some cases, previously computed as (nan+nanj), we couldrecover meaningful component values in the result, seee.g. the C11, Annex G.5.2, routine _Cdivd().

Labels

None yet

Movatterモバイル変換

Uh oh!

gh-119372: recover inf's and zeros in _Py_c_quot#119457

gh-119372: recover inf's and zeros in _Py_c_quot#119457

Uh oh!

Conversation

skirpichev commentedMay 23, 2024• edited by bedevere-appbotLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

serhiy-storchaka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

serhiy-storchakaMay 23, 2024

Choose a reason for hiding this comment

Uh oh!

skirpichevMay 23, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mdickinsonMay 27, 2024

Choose a reason for hiding this comment

Uh oh!

skirpichevMay 29, 2024

Choose a reason for hiding this comment

Uh oh!

mdickinsonJun 29, 2024

Choose a reason for hiding this comment

Uh oh!

mdickinson left a comment• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

picnixz commentedMay 28, 2024

Uh oh!

skirpichev commentedMay 28, 2024

Uh oh!

skirpichev commentedJun 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

skirpichev commentedMay 23, 2024•
edited by bedevere-appbot
Loading

skirpichevMay 23, 2024•
edited
Loading

mdickinson left a comment•
edited
Loading