Movatterモバイル変換

Preparations for multivariate plotting#29877

This was referencedApr 6, 2025

Open

Multivariate plotting in imshow, pcolor and pcolormesh#29221

Open

oscargus reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

oscargus reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

oscargus reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

Apr 7, 2025

lib/matplotlib/colors.py Outdated

		@@ -2320,6 +2320,16 @@ def __init__(self, vmin=None, vmax=None, clip=False):
		self._scale = None
		self.callbacks = cbook.CallbackRegistry(signals=["changed"])

		@property
		def n_input(self):

Copy link

Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Perhaps input_dims, output_dims for consistency with Transforms?

Copy link

ContributorAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Dimensions is probably somewhat confusing term in this context, because it typically means the number of axes in an array, i.e.np.zeros((4,5,6)) has three dimensions, but the correspondingn_input would be four [if shown as an image of size (5,6)].

What do you think ofinput_variates andoutput_variates, orinput_vars andoutput_vars?

Copy link

Contributor

anntzerApr 7, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Sure, although transforms use the same terminology? No strong opinion there.
However this does raise another issue, which is that you seem to put the variates as thefirst dimension. Intuitively I'd rather put them as thelast dimension (just like if you pass a 2D array to plot, thelast dimension are the different variates). I'm sure this design choice must have been discussed somewhere, can you link that discussion?

Copy link

ContributorAuthor

trygvradApr 7, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We had a brief discussion on this here#14168 (comment), but we followed that up on a weekly meeting 14 September 2023https://hackmd.io/@matplotlib/Skaz1lrh6#Mutlivariate-colormapping .

The argument is not formulated well in the log, but basically(V, N, M) because there areV cmaps and norms, and in this way the input for the data mirrors that of the vmin, vmax and norm keywords, i.e.:
ax.imshow((data0, data1), vmin=(vmin0, vmin1), cmap='BiOrangeBlue')
or

ax.imshow((preassure,temperature),vmin=(vmin_preassure,vmin_temperature),cmap='BiOrangeBlue')

Quite often the data is qualitatively different, like the example at the bottom of the page here:https://trygvrad.github.io/mpl_docs/Multivariate%20colormaps.html whereGDP_per_capita andAnnual_growth are plotted together. I find it is not very intuitive to wrap these in a single array.

(also: this allows the different variates to have different data types)

Copy link

Contributor

anntzerApr 10, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Sure, looks like I already lost that discussion once so let's not relitigate it :)
I'll just ping@timhoffm who may have some idea as to the best name for n_input/input_dims (or some other name).

Copy link

ContributorAuthor

trygvradMay 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thank you@timhoffm :D

Why do we have to distinguish input and output? They are both the same here. Do you anticipate that there can be norms with different inputs and outputs? What would be an example for that? - I'm asking because that decides whether we need the input/output parts in the name.

The thinking was that there is no requirement that they be equal, so as to be future-proof they are here implemented as separate. I do believe this was mentioned at a meeting at one point, but I have spent the last day trying to imagine a use case where it would be beneficial to have them separate. I can imagine some use cases where one might want to map a single scalar to two colorbars, such as height above and below sea level on separate colorbars, but even as I stretch my mind to find a use case where one scalar maps to multiple colorbars, I cannot find a use case where it is more intuitive to integrate that into the norm than to handle the additional complexity a separate pre-processing step and use a norm where the number of inputs and outputs are the same.

With that in mind I think we should change the names from n_input/n_output to a single variable name.

In my mind the following are good options:

n_variates
n_channels
n_dims
Or we could go even shorted and just usevariates,channels ordims.

I have no strong opinion now, but this has been my logic on this topic previously:
Whenn_variates was implemented inMultivariateColormap my thinking was that we have the ability to define our own vocabulary in this case, and whatever word we choose, it is better if we are consistent, thus we ended up withMultivariateColormap andn_variates, but we could alternatively haveMultichannelColormap andn_channels. If we choose to usen_variates also forMultiNorm it would give an indication to that this relates toMultivariateColormap. Similarly, the word 'dims' is used in the context of transform, while the word 'channel' is weakly connected to the concept of an alpha-channel. My hunch has been that using a less-used word [variates] gives us more power to imbue that word with meaning, and that makes it easier to be specific in our vocabulary going forward.

How often will these properties be used by end users? - I'm asking, because if they are rarely used, we may affort longer names for more clarity.

I don't intend for users to use them. The number is implicit from the creation of aMultiNorm, i.e.MultiNorm(['log', None]) or evenplt.imshow([a, b], cmap='2AddA', norm=['log', None]).

Their importance is only for error-handling, to make sure that the data, norm and colormap are compatible, and give the user a reasonable error.

@anntzer I do not hold strong opinion, so if you let me know what you prefer, I will update this PR accordingly.
As this is not something we expect users to interact with much, the main concern should be the maintainers :)

Copy link

Member

timhoffmMay 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I agree that a single variable is sufficient. It's not the responsibility of the Norm to create or condense information. So it can only beN -> N. It could possibly even be1 -> 1 if we allow a list of Norm instances for theN -> N case. Advantage would be that we don't need to create any new norms and the logic stays simpler. The only thing we'd loose is the ability to couple the variables, but I don't see that that's needed. - We could always later introduce MultiNorm. To be checked: Maybe it's favorable internally to collect these into a single instance, but that would not need to be public. What do you think?

I asked ChatGPT "would you use variate as a standalone term?":

Excellent question — and honestly,"variate" isn’t super commonly used by itself in casual conversation or even in a lot of applied work, but itdoes exist as a proper term in stats.
Technically:
Avariate is a random variable, or a measurable quantity that can take on different values.
Example: If you're studying people's heights, each person’s height would be a value of the "height variate."
In practice:
You’re way more likely to hear "variable" or "random variable" than "variate."
"Variate" mostly shows up in more formal, old-school, or academic statistical texts.
Example: "The sample contains 100 observations of the variate X."
It really shines in compound terms like:
Univariate → one variable
Bivariate → two variables
Multivariate → multiple variables
But you wouldn’t usually say, “Let’s analyze this variate.” You’d just say, “variable.”

I don't intend for users to usen_variates. Then (if we still want to expose MultiNorm), let's just call thisn_variables and keep it private.

Copy link

ContributorAuthor

trygvradMay 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Lets go forn_variables then, it's a good name :)

So it can only be N -> N. It could possibly even be 1 -> 1 if we allow a list of Norm instances for the N -> N case. Advantage would be that we don't need to create any new norms and the logic stays simpler. The only thing we'd loose is the ability to couple the variables, but I don't see that that's needed. - We could always later introduce MultiNorm. To be checked: Maybe it's favorable internally to collect these into a single instance, but that would not need to be public. What do you think?

We have discussed this before in#28428 where I initially implemented multivariate color mapping with list of norms instead of MultiNorm#28428 (comment) . This was then later changed to MultiNorm.

However, this was before the introduction of theColorizer as a container for norm→colormap.

The choice of having a list of norms or a MultiNorm object is now somewhat a question of where we put the additional complexity for supporting norms for multivariate/bivariate colormaps. Do we put the complexity in A: Colorizer or B: MultiNorm. My hypothesis is that having MultiNorm as a separate class simplifies the colorizer class, as we can call methods on theColorizer.norm regardless of if the data is multivariate or not. In my mind this will make it easier to maintain.

The top-level plotting functions should be largely unaffected by this choice, as they should operate through the colorizer interface, and should not use the norm directly.

If we choose to have a MultiNorm class, it is then a separate question if MultiNorm should be private. To me, this boils down to what should the user recieve if requesting the norm after plotting a bivariate/multivariate image:

im=plt.imshow((a,b),cmap='BiOrangeBlue')im.norm# ← what should this be?

My hypothesis here has been that it is easier for the userìm.norm has type stability, i.e. it is always a subclass ofcolors.Normalize, and therefore we make MultiNorm public.

@timhoffm Let me know if you want me to implement a version with a list of norms instead of MultiNorm, it will have to be prototyped as a complete solution (with working top-level plotting methods) similar to#29221 , but if we go for that solution I could then break it into smaller PRs for review.

Copy link

Member

timhoffmMay 6, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Well, you don't really have type stability. While MultiNorm is formally a subclass of Normalize, it violates LSP (*), the call you need to do onim.norm depends on the type.

(*) I'm not a LSP nazi, it's sometimes ok to violate it. But you can't argue for that with type stability.

My hypothesis is that having MultiNorm as a separate class simplifies the colorizer class

This is a valid argument and we should at least have MultiNorm internally.

To me, this boils down to what should the user recieve if requesting the norm after plotting a bivariate/multivariate image

Yes. Technically that's a separate question, and I don't have a strong preference here. But since the class exist, it may be ok to expose it.

Copy link

ContributorAuthor

trygvradMay 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

All true, I was a bit inaccurate in my comment.
I have changed from n_input/n_output to n_variables as agreed :)

anntzer reviewed

Apr 7, 2025

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

Apr 7, 2025

lib/matplotlib/scale.py OutdatedShow resolvedHide resolved

github-actionsbot removed the topic: transforms and scales label

Apr 10, 2025

QuLogic reviewed

Apr 17, 2025

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

lib/matplotlib/colors.pyShow resolvedHide resolved

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

lib/matplotlib/colors.py Outdated

Comment on lines 4103 to 4105

		in the case where an invalid string is used. This cannot use
		`_api.check_getitem()`, because the norm keyword accepts arguments
		other than strings.

Copy link

Member

QuLogicApr 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm confused because this function is only called forisinstance(norm, str).

Copy link

ContributorAuthor

trygvradApr 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

So this function exists because the norm keyword acceptsNormalize objects in addition to strings.
This is fundamentally the same error you get if you give an invalid norm to aColorizer object.

In main, the@norm.setter oncolorizer.Colorizer reads:

@norm.setterdefnorm(self,norm):_api.check_isinstance((colors.Normalize,str,None),norm=norm)ifnormisNone:norm=colors.Normalize()elifisinstance(norm,str):try:scale_cls=scale._scale_mapping[norm]exceptKeyError:raiseValueError("Invalid norm str name; the following values are "f"supported:{', '.join(scale._scale_mapping)}"                )fromNonenorm=_auto_norm_from_scale(scale_cls)()    ...

The_get_scale_cls_from_str() exists in this PR because this functionality is now needed by bothcolorizer.Colorizer.norm() andcolors.MultiNorm.
Note this PR does not include changes tocolorizer.Colorizer.norm() so that it makes use of_get_scale_cls_from_str(). These changes follow in the next PR:#29877 .

lib/matplotlib/colors.pyi OutdatedShow resolvedHide resolved

trygvrad force-pushed themultivariate-plot-prapare branch from55b85e3 tof42d65bCompare

April 17, 2025 15:18

anntzer reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

anntzer reviewed

Copy link

Contributor

anntzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Just some minor points, plus re-pinging@timhoffm in case he has an opinion re: n_input / input_dims naming?

Copy link

ContributorAuthor

trygvrad commentedMay 4, 2025

Thank you for the feedback@anntzer !
Hopefully we can hear if@timhoffm has any thoughts on n_input / input_dims naming within the coming week.

Copy link

Member

timhoffm commentedMay 5, 2025

See#29876 (comment)

Copy link

ContributorAuthor

trygvrad commentedMay 7, 2025

Thank you@timhoffm
The PR should now be as we agreed (#29876 (comment)) :)

QuLogic reviewed

May 23, 2025

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

lib/matplotlib/colors.pyShow resolvedHide resolved

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

lib/matplotlib/colors.pyShow resolvedHide resolved

Copy link

ContributorAuthor

trygvrad commentedJun 1, 2025

@QuLogic Thank you again and apologies for my tardiness (I was sick)
@timhoffm Do you think you could approve this PR now?

timhoffm reviewed

Jun 2, 2025

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

lib/matplotlib/colors.py Outdated

		@@ -3219,6 +3224,224 @@ def inverse(self, value):
		return value


		class MultiNorm(Normalize):
		"""
		A mixin class which contains multiple scalar norms

Copy link

Member

timhoffmJun 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

AFAICS, this is not a mixin. It's a "normal" subclass.

Speaking of which, have we discussed whether we want to make this a subclass of Normalize? This breaks LSP, not that I'm a hardcore LSP advocate, but I feel we may become unnecessarily sloppy.

It may be reasonable to introduce a commonNorm base class (or possibly just a protocol). Which bothNormalize andMultiNorm (should we call thisMultiNormalize?) derive from/implement.

Copy link

ContributorAuthor

trygvradJun 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@timhoffm I don't think we have discussed this, but maybe we should. What solution do you think is preferable?
@story645 Should we add this a topic for the next weekly meeting?

Copy link

Member

timhoffmJun 4, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'd feel more comfortable ifMultiNorm is not a subclass ofNormalize.Normalize is already weird enough in that the linear norm is the base class for all other (scalar) norms. Having a genericNorm concept. Makes sense to me. I've not looked in to the implementation to judge whether abstract base class or protocol is the better implementation. - Unfortunately, I cannot predict whether I can make it to the next weekly meeting.

Copy link

ContributorAuthor

trygvradJun 5, 2025•
edited by story645
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

To me an abstract base class seems more intuitive than a protocol, but I have no strong preference.
(I find one use of an abstract class in the extisting codebase (animation.AbstractMovieWriter), and no use of a Protocol)

In that case I have some questions:

Should it only have methods with@abstractmethod or also include some implementations (I'm thinking of@property def n_variables(self) anddef _changed(self).
There are a lot of required functions for a norm so that it can function in all cases,vmin, vmax, inverse, autoscale, autoscale_none, scaled, clip should the abstract class define all of them with@abstractmethod, or only a subset.
I assume we will need to change the docstrings for Colorizer immediately. Once the top level functions can accept a MultiNorm, we will have to change the docstrings here as well, to specifiy that input can beNorm instead ofNormalize

I'm thinking something along these lines:

classNorm(ABC):@propertydefn_variables(self):# To be overridden by subclasses with multiple inputsreturn1def_changed(self):"""        Call this whenever the norm is changed to notify all the        callback listeners to the 'changed' signal.        """self.callbacks.process('changed')@property@abstractmethoddefvmin(self):pass@property@abstractmethoddefvmax(self):pass@property@abstractmethoddefclip(self):returnself._clip@abstractmethoddef__call__(self,value,clip=None):pass@abstractmethoddefinverse(self,value):passdefautoscale(self,A):passdefautoscale_None(self,A):pass@abstractmethoddefscaled(self):pass

I'm somewhat tempted to write it in such a way thatNoNorm could in theory inherit only fromNorm and not fromNormalize. I don't think there is any practical reason to do so, but if I was designing the architecture from scratch I would have made at leastNormalize,NoNorm andMultiNorm inherit from the abstract classNorm.

Copy link

Member

timhoffmJun 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

(I find one use of an abstract class in the extisting codebase (animation.AbstractMovieWriter), and no use of a Protocol)

We have several base classe (FigureBase,_ImageBase,TransformNode, ...`) that are not technically abstract, but mainly define API and common functionality but are not meaningful on their own.

There are no Protocols in Matplotlib because Protocols are much younger than Matplotlib and on top, they are only really helpful if you use typing, which we have just adopted.

IMHO the decision between abstract class and protocol should be taken on how much common code and (internal) logic there is. A protocol is less invasive because you can keep theNormalize class exactly as is and just define a compatible standaloneMultiNorm next to it. The compatibility is ensured through adding the protocol. From your code example it looks like only_changed() andclip are duplicated, which I find small enough to justify a protocol. But no strong opinion either.

In case of going with an abstract class, it'd be advisable to try and introduce this frist in a separate PR.

To the questions:

There's no real value in pure abstract classes (in contrast, that would rather point towards a protocol). If you have shared functionality, that can be implemented in the abstract base class.
The abstract class should define the complete common API such that somedef f(norm: Norm): norm.somefunc will not be flagged by type checkers.
Yes.
I think I would not bother withNoNorm right now. It's working as is. While it's logically a bit awkward that it derives fromNormalize, the same holds true for all the non-linear norms. Additionally, puttingNoNorm directly underNorm would require that you implement all the abstract methods / the complete protocol.

Copy link

ContributorAuthor

trygvradJun 5, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@tacaswell expressed a preference for a Protocol. The consensus in the meeting was that this will prevent it from growing into its own beast.

I will try to make this in a separate branch, and tag you@timhoffm. I will probably need some feedback on how to document it.

EDIT: the new PR is in#30149

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

		vmin, vmax : float, None, or list of float or None
		Limits of the constituent norms.
		If a list, each value is assigned to each of the constituent
		norms. Single values are repeated to form a list of appropriate size.

Copy link

Member

timhoffmJun 2, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is broadcasting reasonable here? I would assume that most MultiNorms have different scales and thus need per-element entries anyway. It could also be an oversight to pass a single value instead of multiple values.

I'm therefore tempted to not allow scalars here but require exactly n_variables values. A more narrow and explicit interface may be the better start. We can always later expand the API to broadcast scalars if we see that's a typical case and reasonable in terms of usability.

Copy link

ContributorAuthor

trygvradJun 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@timhoffm Perhaps this is also a topic for the weekly meeting :)

Copy link

ContributorAuthor

trygvradJun 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm perfectly fine with removing this here, and perhaps that is a good starting point.

My entry into this topic was a use case (dark-field X-ray microscopy, DFXRM) where we typically wantvmax0 = vmax1 = -vmin0 =-vmin1, i.e. equal normalizations, and centered on zero, and given that entry point it felt natural to me to include broadcasting.

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

QuLogic reviewed

Jun 4, 2025

Creation of the Norm Protocol#30149

lib/matplotlib/colors.py OutdatedShow resolvedHide resolved

trygvrad force-pushed themultivariate-plot-prapare branch from6b86d63 to32247f5Compare

June 4, 2025 21:01

trygvrad mentioned this pull request

Jun 6, 2025

Closed

Copy link

ContributorAuthor

trygvrad commentedJun 6, 2025

This is on hold until we sort out#30149 (Norm Protocol)
see#29876 (comment)

story645 added the status: waiting for other PR label

Jun 6, 2025

trygvrad mentioned this pull request

Jun 15, 2025

Abstract base class for Normalize#30178

Merged

github-actionsbot added the status: needs rebase label

Jun 28, 2025

trygvradand others added8 commits

June 29, 2025 12:46

MultiNorm class

f521bc3

This commit introduces the MultiNorm calss to prepare for the introduction of multivariate plotting methods

updates based on feedback from review,@oscargus,@anntzer

1557de1

Apply suggestions from code review

eefc23c

Thank you@QuLogicCo-authored-by: Elliott Sales de Andrade <quantum.analyst@gmail.com>

Updates based on feedback from@anntzer

d308403

change MultiNorm.n_intput to n_variables

c1a51e2

Updates from code review

342c5b7

Thank you@QuLogic for the feedback

update to conform to linter

b3c4d4d

updates based on feedback from@timhoffm(and@QuLogic)

bd8e726

trygvrad force-pushed themultivariate-plot-prapare branch from32247f5 tod49737fCompare

June 29, 2025 11:04

github-actionsbot removed the status: needs rebase label

Jun 29, 2025

Copy link

ContributorAuthor

trygvrad commentedJun 29, 2025

I have rebased this PR and updated the MultiNorm to inherit from the Norm ABC now that#30178 has been merged :)

trygvrad force-pushed themultivariate-plot-prapare branch fromd49737f to47b5116Compare

June 29, 2025 13:38

Let MultiNorm inherit from Norm ABC

63feb6b

trygvrad force-pushed themultivariate-plot-prapare branch from47b5116 to63feb6bCompare

June 29, 2025 14:09

timhoffm removed the status: waiting for other PR label

Jun 29, 2025

timhoffm reviewed

Jun 30, 2025

		@property
		@abstractmethod
		def n_variables(self):
		# Returns the number of variables supported by this normalization

Copy link

Member

timhoffmJun 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Why is this a comment and not a docstring?

Suggested change

	# Returns the number of variables supported by this normalization
	"""
	Thenumberofnormalizedvariables.

	Thisisnumberofelementsoftheparameterto``__call__``andof
	vmin,vmax.
	"""

		"""
		Normalize the data and return the normalized data.

		Each variate in the input is assigned to the constituent norm.

Copy link

Member

timhoffmJun 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	Eachvariateintheinputisassignedtotheconstituentnorm.
	Eachvariateintheinputisassignedtotheconstituentnorm.

Per#29876 (comment) let's not use "variate".

Alternatives here (in order of my preference): elment, component, variable

		Data to normalize. Must be of length `n_variables` or be a structured
		array or scalar with `n_variables` fields.
		clip : list of bools or bool, optional
		See the description of the parameter clip in Normalize.

Copy link

Member

timhoffmJun 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	SeethedescriptionoftheparameterclipinNormalize.
	Determinesthebehaviorformappingvaluesoutsidetherange
	``[vmin,vmax]``.Seethedescriptionoftheparameterclipin
	`.Normalize`.

At least give the one-sentence summary to give the idea on what this is about so that people can judge whether it's relevant for them and worth looking up the details.

Comment on lines +3416 to +3417

		list
		Normalized input values as a list of length `n_variables`

Copy link

Member

timhoffmJun 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think a list is not the right structure. Either we do a tuple of lengthn_variables - likevmin/vmax, or we return a 2D array - need to think on dimensionality here as well. Also, if the input is a structured array, should the output also be a structured array?

Comment on lines +3406 to +3408

		value : array-like
		Data to normalize. Must be of length `n_variables` or be a structured
		array or scalar with `n_variables` fields.

Copy link

Member

timhoffmJun 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We need to be more precise. Normalize takes either a scalar or an array-like. How do we generalize? If we have two norms, do you expect[scalar1, scalar2],[array-like2, array_like2]? is it reasonable to accepts a 2d array, if so what is the dimensionality, (N, 2) or (2, N)?