NotificationsYou must be signed in to change notification settings
Fork8.1k
Star22.1k

ticker.py: always use np.round?#27609

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Draft

doronbehar wants to merge4 commits intomatplotlib:main

base:main

Choose a base branch

fromdoronbehar:ticker-round

Draft

ticker.py: always use np.round?#27609

doronbehar wants to merge4 commits intomatplotlib:mainfromdoronbehar:ticker-round

Conversation

Copy link

Contributor

doronbehar commentedJan 7, 2024•
edited
Loading

PR summary

The error I experienced initially as described below, can be reproduced with:

importmatplotlib.pyplotaspltfig,ax=plt.subplots()ax.plot([1,2], [1,1e300])ax.set_yscale("log")plt.show()

The PR is currently a draft because I don't know how to best solve the issue. The last commit adds a test with some notes on the peculiarities of it. Ideas are welcome.

I still think that switching frommath tonumpy implementations of allround,isclose and mathematical functions should be a good idea, but I also understand the consideration of "if it ain't broken don't fix it" that recommends to keep the changes to a minimum.

Original observation of the error:

I encountered the following error when using a logarithmicyscale in a graph:

Traceback (most recent call last):  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/backends/backend_gtk3.py", line 277, in idle_draw    self.draw()  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/backends/backend_agg.py", line 388, in draw    self.figure.draw(self.renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 95, in draw_wrapper    result = draw(artist, renderer, *args, **kwargs)             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/figure.py", line 3154, in draw    mimage._draw_list_compositing_images(  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/image.py", line 132, in _draw_list_compositing_images    a.draw(renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axes/_base.py", line 3070, in draw    mimage._draw_list_compositing_images(  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/image.py", line 132, in _draw_list_compositing_images    a.draw(renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axis.py", line 1387, in draw    ticks_to_draw = self._update_ticks()                    ^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axis.py", line 1276, in _update_ticks    major_labels = self.major.formatter.format_ticks(major_locs)                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 216, in format_ticks    return [self(value, i) for i, value in enumerate(values)]           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 216, in <listcomp>    return [self(value, i) for i, value in enumerate(values)]            ^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 1066, in __call__    is_x_decade = _is_close_to_int(fx)                  ^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 2239, in _is_close_to_int    return math.isclose(x, round(x))                           ^^^^^^^^OverflowError: cannot convert float infinity to integer

Where the above is printed repeatedly, for ever, and the GUI is stuck. With this patch applied, I get this graph in a logarithmic scale:

The orange line goes upwards to huge values - which proves that the fit I tried to perform was wrong, but still matplotlib shouldn't almost crash due to such scientific error. Without this patch, I managed to fix one instance of the above error, using the following code:

import matplotlib.ticker as tickerimport mathdef my_is_close_to_int(x):    return math.isclose(x, np.round(x))ticker._is_close_to_int = my_is_close_to_int

But then I encountered this error:

Traceback (most recent call last):  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/backends/backend_gtk3.py", line 277, in idle_draw    self.draw()  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/backends/backend_agg.py", line 388, in draw    self.figure.draw(self.renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 95, in draw_wrapper    result = draw(artist, renderer, *args, **kwargs)             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/figure.py", line 3154, in draw    mimage._draw_list_compositing_images(  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/image.py", line 132, in _draw_list_compositing_images    a.draw(renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axes/_base.py", line 3070, in draw    mimage._draw_list_compositing_images(  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/image.py", line 132, in _draw_list_compositing_images    a.draw(renderer)  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/artist.py", line 72, in draw_wrapper    return draw(artist, renderer)           ^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axis.py", line 1387, in draw    ticks_to_draw = self._update_ticks()                    ^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/axis.py", line 1276, in _update_ticks    major_labels = self.major.formatter.format_ticks(major_locs)                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 216, in format_ticks    return [self(value, i) for i, value in enumerate(values)]           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 216, in <listcomp>    return [self(value, i) for i, value in enumerate(values)]            ^^^^^^^^^^^^^^  File "/nix/store/40yp1y2hd4l9pajlclp8nhzxpv5hksgm-python3-3.11.6-env/lib/python3.11/site-packages/matplotlib/ticker.py", line 1067, in __call__    exponent = round(fx) if is_x_decade else np.floor(fx)               ^^^^^^^^^OverflowError: cannot convert float infinity to integer

So I decided to discuss with you about the simple suggestion of replacing all of Python's builtin usages ofround, withnumpy's implementation, that can handleNaN orInf easily.

It was very hard to debug it, as I couldn't use even atry - except clause to catch the error. I'm sorry, but I also currently haven't yet generated a reproducible example, due to the structure that my data is currently held at. I'd like to hear what you think first before I'll write tests to this PR.

PR checklist

[N/A] "closes #0000" is in the body of the PR description tolink the related issue
new and changed code istested -It'd be nice to write a test that will prove the issue and that will be fixed in a commit afterwards.
[N/A]Plotting related features are demonstrated in anexample
[N/A]New Features andAPI Changes are noted with adirective and release note
[N/A] Documentation complies withgeneral anddocstring guidelines

github-actionsbot reviewed

Jan 7, 2024

View reviewed changes

Copy link

github-actionsbot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thank you for opening your first PR into Matplotlib!

If you have not heard from us in a week or so, please leave a new comment below and that should bring it to our attention. Most of our reviewers are volunteers and sometimes things fall through the cracks.

You can also join uson gitter for real-time discussion.

For details on testing, writing docs, and our review process, please seethe developer guide

We strive to be a welcoming and open project. Please follow ourCode of Conduct.

Copy link

Member

oscargus commentedJan 7, 2024

Thanks, I think that this makes sense! As seen, we usenp.floor, just afterround in the second failure. Although it may just be a coincidence.

Can you please add a test for this? I think a smoke test, where you basically plot something (simple) that currently triggers the error should be enough. With a comment that it is a smoke test and point to this PR for motivation. In this way we can make sure that the error doesn't return later if someone thinks "well, I can replace that np.round with round"...

One can of course discuss if actually all round should be np.round, like now, or if one should read the code a bit more careful and see if any can stay. On the other hand, one can also question if round is actually that much better for any reason? It is not so obvious from the code why sometimes round is chosen and sometimes np.round (except when we know that there is a vector involved).

story645 added the topic: ticks axis labels label

Jan 11, 2024

dstansby added the status: needs revision label

Jan 23, 2024

Copy link

ContributorAuthor

doronbehar commentedApr 16, 2024

Hey so I tried to reproduce the issue I experienced, with somenp.inf data, and I didn't succeed. I also failed to retrieve the data that I had back then. I still think it is worth discussing with other developers, but I will leave my effort on this for now.

Copy link

Member

tacaswell commentedApr 16, 2024

In principle I agree this makes sense, but there might be some other changes needed (likenp.round returns floats where asround returns ints so we might need extra casts).

You can reproduce this with

importmatplotlib.pyplotaspltplt.ion()fig,ax=plt.subplots()importnumpyasnpax.loglog(np.logspace(0,10,10),np.logspace(0,10,10)**2)ax.set_yticks([1,10,np.inf])

which suggest that we should be doing better validation on the way intoFixedLocator. Put another way, if we are gettingnp.inf here then there is very likely a bug (or user error) someplace else that we should deal with instead.

Copy link

ContributorAuthor

doronbehar commentedApr 16, 2024

Wow thanks for creating this code example! I will put something similar in the tests I will hopefully write after discussing a solution.

Put another way, if we are gettingnp.inf here then there is very likely a bug (or user error) someplace else that we should deal with instead.

Definitely - havingnp.inf in your data is not scientific :), and perhaps a good explanatory warning should be printed when this is attempted. The minimum expectation is thatmatplotlib won't crash when it encounters such a situation.

Copy link

Member

tacaswell commentedApr 18, 2024

Havingnp.inf in thedata should be OK (we take care of this a bunch of places), it is asking for a tick atnp.inf that is the problem.

doronbehar force-pushed theticker-round branch from86b1aad to1ae9888Compare

May 29, 2024 13:22

github-actionsbot removed the topic: ticks axis labels label

May 29, 2024

doronbehar force-pushed theticker-round branch from1ae9888 to76edbd5Compare

May 29, 2024 13:23

Copy link

ContributorAuthor

doronbehar commentedMay 29, 2024

OK I have an issue with running the tests locally, so I hope it will be OK with you that I will use the PR's CI to do that. Now I pushed a new test, without the fix yet, expecting to see the only test I added failing, and I will push a fix afterwards, expecting the issue to be fixed with it.

doronbehar force-pushed theticker-round branch from76edbd5 to896afeaCompare

May 29, 2024 13:43

Copy link

ContributorAuthor

doronbehar commentedMay 29, 2024

OK tests failures from CI:

=================================== FAILURES ===================================_____________________________ test_yticks_with_inf _____________________________[gw2] darwin -- Python 3.12.3 /Library/Frameworks/Python.framework/Versions/3.12/bin/python    def test_yticks_with_inf():        fig, ax = plt.subplots()        ax.loglog(np.logspace(0, 10, 10), np.logspace(0, 10, 10)**2)        ax.set_yticks([1, 10, np.inf])>       fig.draw_without_rendering()lib/matplotlib/tests/test_ticker.py:1837: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ lib/matplotlib/figure.py:3171: in draw_without_rendering    self.draw(renderer)lib/matplotlib/artist.py:95: in draw_wrapper    result = draw(artist, renderer, *args, **kwargs)lib/matplotlib/artist.py:72: in draw_wrapper    return draw(artist, renderer)lib/matplotlib/figure.py:3155: in draw    mimage._draw_list_compositing_images(lib/matplotlib/image.py:132: in _draw_list_compositing_images    a.draw(renderer)lib/matplotlib/artist.py:72: in draw_wrapper    return draw(artist, renderer)lib/matplotlib/axes/_base.py:3113: in draw    mimage._draw_list_compositing_images(lib/matplotlib/image.py:132: in _draw_list_compositing_images    a.draw(renderer)lib/matplotlib/artist.py:72: in draw_wrapper    return draw(artist, renderer)lib/matplotlib/axis.py:1422: in draw    ticks_to_draw = self._update_ticks()lib/matplotlib/axis.py:1300: in _update_ticks    major_labels = self.major.formatter.format_ticks(major_locs)lib/matplotlib/ticker.py:217: in format_ticks    return [self(value, i) for i, value in enumerate(values)]lib/matplotlib/ticker.py:1088: in __call__    is_x_decade = _is_close_to_int(fx)_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ x = inf    def _is_close_to_int(x):>       return math.isclose(x, round(x))E       OverflowError: cannot convert float infinity to integerlib/matplotlib/ticker.py:2265: OverflowError------------------------------ Captured log call -------------------------------

doronbehar force-pushed theticker-round branch from896afea tod63c964Compare

May 29, 2024 14:01

github-actionsbot added the topic: ticks axis labels label

May 29, 2024

Copy link

ContributorAuthor

doronbehar commentedMay 29, 2024

Now pushed the suggested fix. Hopefully CI will be green 🤞 .

doronbehar force-pushed theticker-round branch 7 times, most recently from8188964 to7f6af99Compare

May 29, 2024 17:12

Copy link

ContributorAuthor

doronbehar commentedMay 29, 2024

Aish there are more errors then I anticipated... What if I will replace allmath. functions with theirnumpy equivalent? Not onlynp.round.. Would that be accepted?

doronbehar marked this pull request as ready for review

May 31, 2024 01:27

Copy link

ContributorAuthor

doronbehar commentedMay 31, 2024

🎉 Managed to finally fix the new test I added (took me a while because ofthis). CI is green now. Your review is welcome.

Copy link

ContributorAuthor

doronbehar commentedJul 3, 2024

BTW@tacaswell CI is green here as well for a while now :).

tacaswell reviewed

Jul 3, 2024

View reviewed changes

lib/matplotlib/ticker.py OutdatedShow resolvedHide resolved

tacaswell reviewed

Jul 3, 2024

View reviewed changes

lib/matplotlib/ticker.pyShow resolvedHide resolved

doronbehar force-pushed theticker-round branch frome87b5f6 to1679a7fCompare

July 17, 2024 14:40

tacaswell added this to thev3.10.0 milestone

Jul 18, 2024

tacaswell removed the status: needs revision label

Jul 18, 2024

Copy link

Member

tacaswell commentedJul 18, 2024

Can you try rebasing to see if we have a fix for the appevoyer failure on main?

Copy link

Member

tacaswell commentedJul 18, 2024

Does this only letnp.inf pass through as a value or does it also sort out/fix why a locator suggested putting a tick at inf?

doronbehar force-pushed theticker-round branch from1679a7f to1c60bf8Compare

July 18, 2024 08:05

Copy link

ContributorAuthor

doronbehar commentedJul 18, 2024

Does this only letnp.inf pass through as a value or does it also sort out/fix why a locator suggested putting a tick at inf?

I'm not sure I understood the 2nd part - what do you mean by "tick at inf"? If you are asking in general about the motivation for the PR, it is a bit hard for me to find the most accurate words to explain it, but for sure it fixes the test added in the 1st commit, and I believe it fixes many other potential issues with nan and infinite values.

timhoffm reviewed

Oct 29, 2024

View reviewed changes

Copy link

Member

timhoffm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm a bit hesitant to change all the math <-> np calls. This may have surprising side-effects. How much of this is to handleinf inset_yticks? I'd rather error there and keep other changes minimal.

lib/matplotlib/tests/test_ticker.py Outdated

		deftest_yticks_with_inf():
		fig,ax=plt.subplots()
		ax.loglog(np.logspace(0,10,10),np.logspace(0,10,10)**2)
		ax.set_yticks([1,10,np.inf])

Copy link

Member

timhoffmOct 29, 2024•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

IMHO we should raise if asked to put a tick at infinity.

That check should be inFixedLocator.__init__.

Copy link

ContributorAuthor

doronbehar commentedOct 30, 2024

I'm a bit hesitant to change all the math <-> np calls. This may have surprising side-effects. How much of this is to handleinf inset_yticks? I'd rather error there and keep other changes minimal.

In the original code that produced the issue I encountered and tried to fix here, I didn't explicitly usedset_yticks with annp.inf in the argument, something internal inticker.py made it try to set these yticks for the set of y values I had in my code. It is indeed unfortunate that I am not able now to get a list of such y values, but I can imagine it might happen given the usage oflog to figure out where to put yticks.

Copy link

Member

timhoffm commentedOct 30, 2024

Are you able to produce a minimal example of the issue we are trying to fix here? Without that, we're basically blind:

we cannot judge whether the PR is the appropriate fix or whether another approach would be better
we cannot add tests to ensure that the fixed behavior is not unintentionally reverted through future code changes

The proposed math <-> np change, is substantial (in particular because the changes are subtle). We'd need a very informed decision to go that direction. One prerequisite is that we exactly know what problem we are fixing with that.

Copy link

ContributorAuthor

doronbehar commentedOct 30, 2024

Are you able to produce a minimal example of the issue we are trying to fix here?

I agree that the test I added here (suggested by@tacaswell inthis comment) might not be very satisfying because it is a bit superficial, so I'm happy to report I did manage now to create a less superficial MWE :):

importmatplotlib.pyplotaspltfig,ax=plt.subplots()ax.plot([1,2], [1,1e261])ax.set_yscale("log")plt.show()

Givesmath.isclose(x, round(x)) error right? Note how if you change the261 to260 you get no error. Another interesting thing to try is the following:

#!/usr/bin/env pythonimportnumpyasnpimportmathforoominrange(300,4400):x=np.float128(10**oom)print("oom(x) = ",oom,np.log(x),math.log(x))

The above prints inf in the last word afteroom = 309, andnp.float128 fails afteroom = 4300.

If I usenp.float64 (Numpy's default float), it fails whenoom = 309, exactly like the Python nativefloat function.

Without that, we're basically blind:
we cannot judge whether the PR is the appropriate fix or whether another approach would be better
we cannot add tests to ensure that the fixed behavior is not unintentionally reverted through future code changes
The proposed math <-> np change, is substantial (in particular because the changes are subtle). We'd need a very informed decision to go that direction. One prerequisite is that we exactly know what problem we are fixing with that.

I agree overall with the development workflow you try to impose, but I find it a bit peculiar to justify the current state in whichnumpy/math functions are chosen arbitrarily (at least it seems like so). The PR simply suggests to be consistent, which is something I thought would be accepted even without a test, simply because it makes sense.

I can replace the test I added with the above MWE and verify that the PR fixes the test, but I want to know first what is the direction you'd like to take this now that I managed to obtain further clues as for the bug I experienced.

Copy link

Member

timhoffm commentedOct 30, 2024•
edited
Loading

but I find it a bit peculiar to justify the current state in which numpy/math functions are chosen arbitrarily (at least it seems like so). The PR simply suggests to be consistent, which is something I thought would be accepted even without a test, simply because it makes sense.

Unfortunately, it's not that simple:

There's no clear right or wrong for using math vs. numpy. The behavior is subtly different (which you actually rely on for making your case work). But we have no insight whether the changed behavior has undesired effects in other cases.
Therefore, I'd try to minimize changes (if it ain't broken, don't fix it), which needs a clear insight why the current code is not suitable. -> Understand the problem and exactly solve that.

I can replace the test I added with the above MWE and verify that the PR fixes the test, but I want to know first what is the direction you'd like to take this now that I managed to obtain further clues as for the bug I experienced.

The next step is to follow the MWE code through a debugger and find out where the infinity comes in. There must be a place where 1e261 is turned into infinity, and most likely that's the code we have to fix.

Edit: Part of the issue is that we add whitespace around the data (aka.margin) so that data points are not at the edge of the Axes. On a log scale, that can be a significant numerical amount. To take that effect out, change the example to:

import matplotlib.pyplot as pltfig, ax = plt.subplots()ax.plot([1,2], [1, 1e269])ax.set_yscale("log")ax.margins(0)plt.show()

which still works, but will fail for 1e270. Or even better, don't bother with data but isolate the topics to the axis limits

import matplotlib.pyplot as pltfig, ax = plt.subplots()ax.set_ylim(1, 1e270)  # works with 1e269ax.set_yscale("log")plt.show()

doronbehar added3 commits

October 30, 2024 17:30

Add anologous tests for LogFormatter_call_big

b1bcc98

test_ticker: add a parameter set for np.inf in LogFormatterMathtext

b62f5fc

ticker.py: fix possible np.inf issue with LogFormatter{,Mathtext}

4a59114

doronbehar force-pushed theticker-round branch from1c60bf8 to4f8dc92Compare

October 30, 2024 16:29

doronbehar added a commit to doronbehar/matplotlib that referenced this pull request

Oct 30, 2024

Add test for LogFormatter with very big numbers

4f8dc92

Reported atmatplotlib#27609 . Values< 1e300 that trigger this test failure might be found, but it should bea bit better not to be on the edge of this issue.

Copy link

ContributorAuthor

doronbehar commentedOct 30, 2024

The next step is to follow the MWE code through a debugger and find out where the infinity comes in. There must be a place where 1e261 is turned into infinity, and most likely that's the code we have to fix.

OK, that's sort of what I did in the beginning, although I didn't have such a good MWE. I also described that process in the very first comment of this PR. In the meantime I reordered the commits a bit, and added a test that currently fails, as I have not yet found an optimal solution for it. Your view on this now is most welcome.

doronbehar marked this pull request as draft

October 30, 2024 16:30

doronbehar changed the title~~ticker.py: always use np.round~~ticker.py: always use np.round?

Oct 30, 2024

Add test for LogFormatter with very big numbers

2357bef

Reported atmatplotlib#27609 . Values< 1e300 that trigger this test failure might be found, but it should bea bit better not to be on the edge of this issue.

doronbehar force-pushed theticker-round branch from4f8dc92 to2357befCompare

October 30, 2024 16:38

tacaswell modified the milestones:v3.10.0,v3.11.0

Oct 30, 2024

Copy link

Member

tacaswell commentedOct 30, 2024

Given the recent discussion I'm going to move this to the 3.11 milestone.

Copy link

Member

timhoffm commentedOct 30, 2024•
edited
Loading

The actual MWE is

from matplotlib.ticker import LogLocatorLogLocator().tick_values(0, 1e270)

and the relevant code:

matplotlib/lib/matplotlib/ticker.py

Lines 2463 to 2464 in4468488

	decades=np.arange(math.floor(log_vmin)-stride,
	math.ceil(log_vmax)+2*stride,stride)

matplotlib/lib/matplotlib/ticker.py

Line 2473 in4468488

ticklocs=b**decades

decades is[-31 0 31 62 93 124 155 186 217 248 279 310] and withb=10.0 we end up on
ticklocs[1.e-031 1.e+000 1.e+031 1.e+062 1.e+093 1.e+124 1.e+155 1.e+186 1.e+217 1.e+248 1.e+279 inf]

with numpy 10.0 ** 310 == inf (pure python throws an OverflowError instead). Note that the largest number you can represent with 64-bit floats is ~ 1.8e308.

A tick locator should never place ticks at inf. The proper solution to this problem is to tune the tick location algorithm inLogLocator.tick_values().

Copy link

ContributorAuthor

doronbehar commentedOct 30, 2024

The actual MWE is
from matplotlib.ticker import LogLocatorLogLocator().tick_values(0, 1e270)
and the relevant code:
matplotlib/lib/matplotlib/ticker.py
Lines 2463 to 2464 in4468488
decades=np.arange(math.floor(log_vmin)-stride,
math.ceil(log_vmax)+2*stride,stride)
matplotlib/lib/matplotlib/ticker.py
Line 2473 in4468488
ticklocs=b**decades
decades is[-31 0 31 62 93 124 155 186 217 248 279 310] and withb=10.0 we end up onticklocs[1.e-031 1.e+000 1.e+031 1.e+062 1.e+093 1.e+124 1.e+155 1.e+186 1.e+217 1.e+248 1.e+279 inf]
with numpy 10.0 ** 310 == inf (pure python throws an OverflowError instead). Note that the largest number you can represent with 64-bit floats is ~ 1.8e308.
A tick locator should never place ticks at inf. The proper solution to this problem is to tune the tick location algorithm inLogLocator.tick_values().

That's true, and it'd indeed be nice to start with writing such a test specifically forLogLocator, but even if that would be fixed, it is not clear to me what would be the next step. I tried:

diff --git i/lib/matplotlib/ticker.py w/lib/matplotlib/ticker.pyindex 63a519f793..74cfac52e3 100644--- i/lib/matplotlib/ticker.py+++ w/lib/matplotlib/ticker.py@@ -2401,7 +2401,7 @@ class LogLocator(Locator):         have_subs = len(subs) > 1 or (len(subs) == 1 and subs[0] != 1.0)          decades = np.arange(math.floor(log_vmin) - stride,-                            math.ceil(log_vmax) + 2 * stride, stride)+                            math.ceil(log_vmax) + 2 * stride, stride, dtype=np.float128)          if have_subs:             if stride == 1:

But then I encountered an issue inLogFormatterSciNotation, here:

matplotlib/lib/matplotlib/ticker.py

Line 1101 in51e7230

fx=math.log(x)/math.log(b)

Which makesfx beinf. That is also fixable by:

diff --git i/lib/matplotlib/ticker.py w/lib/matplotlib/ticker.pyindex 74cfac52e3..5f3222fc1f 100644--- i/lib/matplotlib/ticker.py+++ w/lib/matplotlib/ticker.py@@ -1088,7 +1088,7 @@ class LogFormatterMathtext(LogFormatter):         b = self._base          # only label the decades-        fx = math.log(x) / math.log(b)+        fx = np.log(x) / np.log(b)         is_x_decade = _is_close_to_int(fx)         exponent = round(fx) if is_x_decade else np.floor(fx)         coeff = round(b ** (fx - exponent))

And then the originaltest_LogFormatter_almost_inf passes. However, it is not that satisfying, as I'm not capable of explaining up to what number willnp.float128 be good enough. And also equally importantly, I don't know how to testLogLocator'sdecades calculation without changing too many inner parts there. Not only that, if I use1e400 intest_LogFormatter_almost_inf, it won't fail but because 1e400 is already treated asinf, and it shouldn't, althoughnp.float128 is used whendecades are computed...

This rabbit hole is what made me use numpy's functions all the time, as they seem a bit more accurate almost all of the times. If we'll after all still go in such approach, it'd should be a good idea to use float128 all the time.

Copy link

Member

timhoffm commentedOct 30, 2024

Rewriting for float128 would be a non-trivial medium-sized task. Most users don't use float 128 anyway and introducing it only to have larger ticklabels feels overkill - or at least a completely separate discussion with much larger impact.

I'd rather try and manipulate the ticker so that it does not produce tick label positions beyond 1e308. The very naive approach would be to clip the nan position. One has to try whether that looks ok. Otherwise, you'd need to come up with a better decades algorithm for the edge case, e.g. choose something between vmax and 3e8 as the top tick position and a suitable spacing going down in regular decades.

Copy link

ContributorAuthor

doronbehar commentedOct 31, 2024

Apparently the maximal float size is actually1.7976931348623157e+308, and can be computed like this:

importsysimportnumpyasnpprint(np.finfo(np.float64).max)print(sys.float_info.max)

Rewriting for float128 would be a non-trivial medium-sized task. Most users don't use float 128 anyway and introducing it only to have larger ticklabels feels overkill - or at least a completely separate discussion with much larger impact.

I think, that the natural thing to do would be to inherit thedtype (float64 orfloat128) of the user's input. If they usefloat128 because they know they have very large floats, we shouldn't disallow them to do so. If they use integers OTH (which can grow infinitely large), we should disallow them to use integers bigger then the maximum offloat128. Perhaps some of this behavior should be handled at the level ofLocator.

Indeed that's a non-trivial task though, so I'm not sure I'd be able to invest time in it. TBH, the issue I encountered here is not an active issue for me anymore. I just kept giving this attention because I want to contribute back to Matplotlib. I think that the 3 commits prior to the last one are worth merging either way, and we can create a separate issue with all the details we have aggregated in the last few days.

I'd rather try and manipulate the ticker so that it does not produce tick label positions beyond 1e308.

I think that silently ignoring such inputs can be confusing for the users, so I'd raise an error much earlier.

Copy link

Member

timhoffm commentedOct 31, 2024

I think, that the natural thing to do would be to inherit the dtype (float64 or float128) of the user's input.

AFAIK, we don't have any guarantees on that, and it would be a massive undertaking. Basically, you'd have to make sure that all internal calculations do respect this. I don't think we can or should go there in the foreseeable future.

I'd rather try and manipulate the ticker so that it does not produce tick label positions beyond 1e308.
I think that silently ignoring such inputs can be confusing for the users, so I'd raise an error much earlier.

I'm not ignoring inputs. The inputs are the data range. The TickLocator tries to invent ticks at nice positions. And results incurrent ticks in below image. The naive solution would be to clip the invented right-most tick. The axis limits would still cover the whole data. The proposed alternative dcades algorithm could try to find the solution foralternative ticks.

Labels

topic: ticks axis labels

Movatterモバイル変換

Uh oh!

ticker.py: always use np.round?#27609

Are you sure you want to change the base?

ticker.py: always use np.round?#27609

Uh oh!

Conversation

doronbehar commentedJan 7, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

PR summary

PR checklist

Uh oh!

github-actionsbot left a comment

Choose a reason for hiding this comment

Uh oh!

oscargus commentedJan 7, 2024

Uh oh!

doronbehar commentedApr 16, 2024

Uh oh!

tacaswell commentedApr 16, 2024

Uh oh!

doronbehar commentedApr 16, 2024

Uh oh!

tacaswell commentedApr 18, 2024

Uh oh!

doronbehar commentedMay 29, 2024

Uh oh!

doronbehar commentedMay 29, 2024

Uh oh!

doronbehar commentedMay 29, 2024

Uh oh!

doronbehar commentedMay 29, 2024

Uh oh!

doronbehar commentedMay 31, 2024

Uh oh!

doronbehar commentedJul 3, 2024

Uh oh!

Uh oh!

Uh oh!

tacaswell commentedJul 18, 2024

Uh oh!

tacaswell commentedJul 18, 2024

Uh oh!

doronbehar commentedJul 18, 2024

Uh oh!

timhoffm left a comment

Choose a reason for hiding this comment

Uh oh!

timhoffmOct 29, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

doronbehar commentedOct 30, 2024

Uh oh!

timhoffm commentedOct 30, 2024

Uh oh!

doronbehar commentedOct 30, 2024

Uh oh!

timhoffm commentedOct 30, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

doronbehar commentedOct 30, 2024

Uh oh!

tacaswell commentedOct 30, 2024

Uh oh!

timhoffm commentedOct 30, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

doronbehar commentedOct 30, 2024

Uh oh!

timhoffm commentedOct 30, 2024

Uh oh!

doronbehar commentedOct 31, 2024

Uh oh!

timhoffm commentedOct 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

doronbehar commentedJan 7, 2024•
edited
Loading

timhoffmOct 29, 2024•
edited
Loading

timhoffm commentedOct 30, 2024•
edited
Loading

timhoffm commentedOct 30, 2024•
edited
Loading