Movatterモバイル変換

lib/matplotlib/tests/test_axes.py Outdated

Comment on lines 1752 to 1753

		for x in [pd.Series([1, 2], dtype="float64"),
		pd.Series([1, 2], dtype="Float32")]:

Copy link

Member

timhoffmJan 9, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is this specifically for "Float32" or would the test also do with "Float64"? If it's only the custom type and precision does not matter, I'd got with "Float64" to communicate that we're primarily testing the custom pandas type.

Side-note: It seems the capital "Float" types are not yet documented. There's onlya v1.2.0 change note and the GH issues linked therein.

Copy link

MemberAuthor

jklymakJan 10, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

As far as I can tell its undocumented. Maybe we should hold of on support, or maybe this PR can go in, but without the test?

I actually disagree with Pandas having a new type here - it seems people want this for pedantic reasons, but I think 99.999% of the world doesn't care if NaN means a failed computation or missing data. However, if they feel strongly, they should get numpy onboard, and then everyone will have this flag rather than making a new data type.

Copy link

Member

tacaswellJan 21, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think any of their capital F floats will work.

tacaswell added this to thev3.5.2 milestone

Jan 11, 2022

tacaswell reviewed

Jan 12, 2022

lib/matplotlib/cbook/__init__.py

		@@ -1649,7 +1650,7 @@ def index_of(y):
		The x and y values to plot.
		"""
		try:
		return y.index.values, y.values

Copy link

Member

tacaswellJan 12, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Do we need both this change and the additional exception handling above?

Copy link

MemberAuthor

jklymakJan 20, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I don't know, but I think we should be in the habit of usingto_numpy() when possible?

Copy link

Member

tacaswellJan 21, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

to_numpy came in via pandan 0.24 on Jan 25, 2019 so we can rely on it being there andhttps://stackoverflow.com/a/54508052/380231 makes an arguement in favor ofno_numpy.

I think this change may be un-related (or fix this by chance) but I do not think it will avoid needing the other one and does no harm.

tacaswell previously approved these changes

Jan 21, 2022

timhoffm reviewed

lib/matplotlib/tests/test_axes.py OutdatedShow resolvedHide resolved

FIX: more holistic fix

509626d

jklymak linked an issue

[Bug]: possible regression with pandas 1.4 with plt.plot when using a single column dataframe as the x argument#22330

that may beclosed by this pull request

Closed

jklymak mentioned this pull request

[Bug]: possible regression with pandas 1.4 with plt.plot when using a single column dataframe as the x argument#22330

Closed

Copy link

MemberAuthor

jklymak commentedJan 27, 2022

@tacaswell, thanks for your review, but dismissing as this approach is completely different, and attempts to remove the pandas-ness of the data right at the beginning.

jklymak dismissedtacaswell’sstale review

January 27, 2022 12:41

Approach now is completely different, so requires a re-review

jklymak commented

lib/matplotlib/cbook/__init__.pyShow resolvedHide resolved

jklymak added the Release criticalFor bugs that make the library unusable (segfaults, incorrect plots, etc) and major regressions. label

tacaswell approved these changes

matplotlib/lib/matplotlib/axes/_base.py

Copy link

Member

tacaswell left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I have some concerns (mostly because any time we change complicated code we find that the complexity was there to handle some weird corner case that failed to get a test), but overall am 👍 on this as I think this will also fix other data containers. I think there is a risk that something subtle will break, but we are already dealing with something subtle breaking and I am optimistic that the cure will not be worse than the malady.

If you think of a DF as a 2D array with columns, then this is consistent with how we handle 2D arrays being passed intoplot.

Copy link

MemberAuthor

jklymak commentedJan 27, 2022•
edited
Loading

This gets used in_plot_args, usually viaindex_of (which adds a y if it is missing).

Lines 486 to 495 inf0593a6

	iflen(xy)==2:
	x=_check_1d(xy[0])
	y=_check_1d(xy[1])
	else:
	x,y=index_of(xy[-1])

	ifself.axes.xaxisisnotNone:
	self.axes.xaxis.update_units(x)
	ifself.axes.yaxisisnotNone:
	self.axes.yaxis.update_units(y)

and it gets calledbefore we check units. So I guess what I am proposing here would strip an object that has ato_numpy method but somehow was being used by somebody for units.

This does give me pause that we have maybe done the wrong thing here, and are using_check_1d to both coerce a singleton to an arrayand to convert pandas series and dataframes to arrays. Maybe theright thing to do is just return the DataFrame, run through the converter, andthen coerce to numpy if it is not already.

Meh, it has done this conversion for a long time.

dstansby reviewed

Feb 18, 2022

lib/matplotlib/cbook/__init__.py OutdatedShow resolvedHide resolved

FIX: simplify a bit more

6dfa93a

dstansby approved these changes

Feb 19, 2022

Improve pandas/xarray/... conversion#22560

Copy link

MemberAuthor

jklymak commentedFeb 21, 2022

@tacaswell, this simplified a bit more since your approval - if you wanted to double check, that would be appreciated.

QuLogic mentioned this pull request

Feb 25, 2022

Merged

2 tasks

Copy link

Member

oscargus commentedFeb 27, 2022

In#22560 I addedcbook._unpack_pandas (so_unpack_pandas for this purpose) which basically doesto_numpy(), but also with a fallback tovalues. I did not touch the lines you changed here though.

I guess it can make sense to have a coordinates merge of this and that PR. If this is merged first, I'll update my PR. If my PR is merged first, it can make sense to use that function here.

Copy link

MemberAuthor

jklymak commentedFeb 27, 2022

Well first it's not only pandas that has to_numpy so that is a bit of a misnomer. But also, why have a separate method at all?

Copy link

Member

oscargus commentedFeb 27, 2022

It was suggested to use a separate function. Right now, slightly different approaches are used at different locations in the code. Sometimes a fallback tovalues, sometimes not. A benefit is possibly that it is enough to modify it in a single location (and that old Pandas versions are still supported with this fallback). Also, if we want to support new libraries that with some other name, it will be easy to do that. Well, all the standard reasons to factor out a piece of common code...

Regarding naming, I considered that, but I do not know which other libraries support that function. However, I do not really see the name neither written in stone nor something that should prohibit using a single function, there will be a name that is correct enough. (And as you can see, there are explicit comments mentioning pandas at all the other locations where it was used.)

Copy link

Member

oscargus commentedFeb 27, 2022

I then assume that we merge this first, find a good name for the function and, if you want to strongly object using the function here, you can do that in#22560.

Copy link

MemberAuthor

jklymak commentedFeb 27, 2022

I meant why have a separate method than check_1d? Our problem is inconsistent duckttping so if we can have it all in one spot that would be very helpful. If check1d does more than duck type pandas then sure it could call the ducktype converter.

I do actually wonder if all of this should just be part of the unit conversion machinery rather than cbook calls

Copy link

Member

oscargus commentedMar 1, 2022

I meant why have a separate method than check_1d?

I do not have an enough overview of the code base to see if one should/could have used check_1d (or check_2d?) instead. But if possible, that is of course even better.

tacaswell merged commit0359832 intomatplotlib:main

meeseeksmachine pushed a commit to meeseeksmachine/matplotlib that referenced this pull request

Backport PRmatplotlib#22141: Fix check 1d

4715aff

meeseeksmachine mentioned this pull request

Backport PR #22141 on branch v3.5.x (Fix check 1d)#22592

Merged

QuLogic added a commit that referenced this pull request