Movatterモバイル変換

This comment was marked as outdated.

the-knights-who-say-ni added the CLA not signed label

sweeneyde reviewed

Objects/longobject.cShow resolvedHide resolved

gvanrossum reviewed

Copy link

Member

gvanrossum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I tried to compile and found a bunch of trivial errors. Mind fixing those?

Comment on lines +81 to +83

		* These integers have a capacity of 62bits on 64-bit architectures:
		one bit for the "is inlined" flag, and one sign bit. This is 30 bits on
		32-bit architectures (for the same reasons).

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Shouldn't that be called a capacity of 63 (or 31) bits? When we use the full 64-bit word we don't say that the capacity is 63 bits plus sign, we usually just say "64-bit signed integers".

		if (sizeof(long) == sizeof(uint64_t)) {
		return _Py_popcount64(x);
		}
		_Py_UNREACHABLE();

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

The actual macro has no leading_.

		return 0;
		}
		// __builtin_clzl() is undefined for x = 0.
		Py_BUILT_ASSERT(sizeof(long) <= sizeof(uint32_t));

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Did you mean>=?


		static inline int _Py_bit_length(unsigned long x)
		{
		_Py_BUILD_ASSERT(sizeof(x) == sizeof(uint32_t) \|\| sizeof(x) == sizeof(uint64_t));

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

No leading_.

		_Py_UNREACHABLE();
		}

		static inline bool _Py_add_overflow32(int32_t a, int32_t b, int32_t *result)

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

Copy link

ContributorAuthor

lpereiraFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

A lot of these little things will be ironed out whenever the compiler yells at me. I still haven't compiled this, so I'm sure I'll hear a lot whenever I'm ready to do that.

Copy link

Member

gvanrossumFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Well, just understand that I find it easier to review when there aren't little details that could be fixed by trying to compile and fixing what it finds. :-)

		}
		} else {
		assert(!is_inlined);
		x_p = (PyLongObject )_PyLongFromSignedSize(-value);

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

_PyLong_FromSignedSize (_ after_PyLong) here and a few times below.

gvanrossum reviewed

Copy link

Member

gvanrossum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Couple more. (I'm sending these as I come across them, I'm reviewing this in dribs and drabs between other activities. :-)

		/* Long Integer Representation
		---------------------------

		There are two representations of long objects: the inlined

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Technically the optimized representation isn't "inlined" -- I would think that that term might be reserved for a version using tagged pointer values.

Copy link

ContributorAuthor

lpereiraFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'll change this to "small num" and "big num", or something like that, to make this clearer.


		For inlined longs, their value can be obtained with this expression:

		ob_size >> 1

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Also document the macro one is suppsed to use. :-)

Comment on lines +97 to +98

		* These numbers can also be normalized. In a normalized number,
		ob_digit[abs(ob_size)-1] (the most significant digit) is never zero.

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Isn't there also a stronger guarantee that when the object leaves longobject.c it is always normalized? (I.e. unnormalized longs only exist as intermediary results.)

Comment on lines +108 to +109

		aware that ints abuse ob_size's sign bit and its least significant
		bit.

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'd rather say that they abuse the ob_size value. :-)

Comment on lines +213 to +215

		int upper_bits = _Py_bit_length32((uint32_t)(x >> 32));
		int lower_bits = _Py_bit_length32((uint32_t)x);
		return upper_bits + lower_bits;

Copy link

Member

gvanrossumFeb 26, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That doesn't look right? It should use the bit length of the upper bits, plus 32, if the upper bits are nonzero, else the bit length of the lower bits, or something like that.

(I wonder if there should be a mode where you don't use the GCC/clang/MSVC versions for any of these functions, to test that the fallback versions are correct.)

Copy link

ContributorAuthor

lpereiraFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

You're right. Good catch. I was really tired when I wrote this code and ended up not testing/proving this is correct.

gvanrossum reviewed

Feb 28, 2022

Copy link

Member

gvanrossum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Flushing a few comments I had pending. I will wait with more review activities until you've decided how to address the issues Markbrought up. (If you want me to review more, please point me to something specific and I'll take a look, of course.)

		_Py_UNREACHABLE();
		}

		static inline bool _Py_add_overflow32(int32_t a, int32_t b, int32_t *result)

Copy link

Member

gvanrossumFeb 27, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

(Also our C style guide requires a line break between the return type and the function name.)

Comment on lines +50 to +52

		Py_ssize_t value;
		(void)as_inlined_int(x, &value);
		returnvalue == 0;

Copy link

Member

gvanrossumFeb 27, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Couldn't this just check whether the size is zero? I.e.return Py_SIZE(x) == 0;

(Same foris_negative() below.)

Copy link

ContributorAuthor

lpereiraFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I don't see that working, because the LSB is used as a flag to distinguish between inlined numbers and big numbers.

What could be done, though, to avoid shifting, is doing a unsigned, lesser-than-or-equal-to comparison against1ULL. If the size is either 1 (inlined number with LSB set) or 0 (big number with LSB unset), then the number is certainly zero.

		Py_LOCAL_INLINE(Py_ssize_t)
		encode_size(Py_ssize_t value)
		{
		assert(!fits_in_size_field(value));

Copy link

Member

gvanrossumFeb 27, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Isn't this assert reversed? Assuming the argument is the number of digits, it had better fit in the size field. :-)

Copy link

ContributorAuthor

lpereiraFeb 28, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

No, this is used to encode the number of digits for big nums. It's not supposed to be used when we're encoding the value that will be used inob_size for inlined ints (they have the LSB set).

Maybe the function has to have a better name to avoid this confusion.