Movatterモバイル変換

		@@ -30,7 +30,7 @@

		# The maximum field value for int32 id's -- which is also the maximum
		# number of simultaneous in-flight requests.
		INT32_MAX = (2**31) - 1
		INT32_MAX = (2**32) - 1

Copy link

Contributor

alanwguoMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

is this safe? This is assuming it's an unsigned int?

Copy link

ContributorAuthor

alexeykudinkinMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Python ints are bigints so there's no overflow.

Also can leave this change out if folks are nervous about it (perfectionist in me couldn't pass this one and not try to correct it)

Copy link

ContributorAuthor

alexeykudinkinMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

cc@edoakes @jjyao

Copy link

Contributor

alanwguoMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I don't know how this is used, but if all of it's usages are safe for this change, I'm fine with it.

Copy link

Contributor

alanwguoMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

One usage seems related to pass the value into a grpc client, so it might not stay in pythonland

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

this is definitely a little dangerous given it seems to be used when interacting with our cpp IDs & gRPC. let's not couple it together with this change

Copy link

Contributor

rynewangMay 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

why do we need to define our own value, instead ofsys.maxsize

Copy link

ContributorAuthor

alexeykudinkinMay 8, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Ack, will get rid of this one

alanwguo approved these changes

May 3, 2024

		@@ -30,7 +30,7 @@

		# The maximum field value for int32 id's -- which is also the maximum
		# number of simultaneous in-flight requests.
		INT32_MAX = (2**31) - 1
		INT32_MAX = (2**32) - 1

Copy link

Contributor

alanwguoMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I don't know how this is used, but if all of it's usages are safe for this change, I'm fine with it.

		@@ -30,7 +30,7 @@

		# The maximum field value for int32 id's -- which is also the maximum
		# number of simultaneous in-flight requests.
		INT32_MAX = (2**31) - 1
		INT32_MAX = (2**32) - 1

Copy link

Contributor

alanwguoMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

One usage seems related to pass the value into a grpc client, so it might not stay in pythonland

edoakes reviewed

May 3, 2024

dashboard/modules/job/job_agent.py

Comment on lines +193 to +194

		# NOTE: Sending chunk over the web-socket is an async operation,
		# allowing sync tailing iteration to yield the event-loop

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

What does this mean?tail_job_logs is an async iterator. so don't know what you mean by "sync tailing iteration"

Comment on lines -16 to +21

		NUM_LOG_LINES_ON_ERROR = 10
		#Maximum number of characters to print out of the logs to avoid
		MAX_LOG_LINES_ON_ERROR = 10
		#Max number of characters to print out of the logs to avoid

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

let's please avoid pure renaming/refactoring in addition to logic changes. it makes our lives harder as reviewers to cut through the noise and increases the chance of bugs slipping through

dashboard/modules/job/utils.py

Comment on lines +64 to +66

		# log_tail_iter can return batches of lines at a time.
		for line in lines:
		log_tail_deque.append(line)

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	# log_tail_iter can return batches of lines at a time.
	forlineinlines:
	log_tail_deque.append(line)
	# log_tail_iter can return batches of lines at a time.
	log_tail_deque.extend(lines)

		@@ -84,8 +88,8 @@ def file_tail_iterator(path: str) -> Iterator[Optional[List[str]]]:
		# - We accumulated at least MAX_CHUNK_CHAR_LENGTH total chars

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

update comment above with hardcoded10

Comment on lines +29 to +34

		def tail_logs(
		self,
		job_id: str,
		*,
		max_lines_per_chunk: int = MAX_LINES_PER_CHUNK,
		) -> Iterator[List[str]]:

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

it might make more sense to convert this iterator toasync directly because this is called in other places where we similarly need to avoid blocking the loop

Comment on lines -40 to +53

		num_log_lines: The number of lines to return.
		max_log_lines: The number of lines to return.

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

again please avoid unnecessary pure refactoring

Comment on lines +72 to +73

		if read_lines_count % max_lines_per_chunk == 1:
		await asyncio.sleep(0)

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

couldn't this avoid yielding if batch sizes not matchingmax_lines_per_chunk are returned repeatedly?

given thatself.tail_logs already takes themax_lines_per_chunk, why not just always yield the loop each iteration?

		@@ -30,7 +30,7 @@

		# The maximum field value for int32 id's -- which is also the maximum
		# number of simultaneous in-flight requests.
		INT32_MAX = (2**31) - 1
		INT32_MAX = (2**32) - 1

Copy link

Collaborator

edoakesMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

this is definitely a little dangerous given it seems to be used when interacting with our cpp IDs & gRPC. let's not couple it together with this change

rynewang reviewed

May 4, 2024

dashboard/modules/job/utils.py

		*,
		max_lines_per_chunk: int = MAX_LINES_PER_CHUNK,
		max_chunk_char_size: int = MAX_CHUNK_CHAR_SIZE,
		) -> Iterator[Optional[List[str]]]:
		"""Yield lines from a file as it's written.

		Returns lines in batches of up to 10 lines or 20000 characters,

Copy link

Contributor

rynewangMay 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

this docstring can be updated?

dashboard/modules/job/utils.py

		@@ -105,9 +109,6 @@ def file_tail_iterator(path: str) -> Iterator[Optional[List[str]]]:
		# Add line to current chunk
		lines.append(curr_line)
		chunk_char_count += len(curr_line)

Copy link

Contributor

rynewangMay 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

here thetime.sleep(1) is removed making it a busy loop if all logs are read. What about:

change the signature to returnAsyncIterator[Optional[List[str]]] or evenAsyncIterator[List[str]], and if it reached EOF, doasyncio.sleep(1).

And consequently changeJobLogStorageClient.tail_logs to returnAsyncIterator.

In fact this works nicely soJobManager.tail_job_logs no longer needs its ownasyncio.sleep.

		@@ -30,7 +30,7 @@

		# The maximum field value for int32 id's -- which is also the maximum
		# number of simultaneous in-flight requests.
		INT32_MAX = (2**31) - 1
		INT32_MAX = (2**32) - 1

Copy link

Contributor

rynewangMay 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

why do we need to define our own value, instead ofsys.maxsize