NotificationsYou must be signed in to change notification settings
Fork5.2k
Star17.2k

datas tuning fix#98743

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

Maoni0 merged 2 commits intodotnet:mainfromMaoni0:datas_tuning

Feb 23, 2024

Merged

datas tuning fix#98743

Maoni0 merged 2 commits intodotnet:mainfromMaoni0:datas_tuning

Feb 23, 2024

Conversation

Copy link

Member

Maoni0 commentedFeb 21, 2024•
edited
Loading

Change the HC (heap count) adjustment based on history and how successful the previous adjustment was -
- looking at the trending of this buffer and using it to detect if things look stable or if they are
  trending up/down (and if so how fast is that trend) and make a decision if we want to grow/shrink according to our calculation
- previous we barely ever shrank the HC, with this change we shrink as needed
- if we just grew and the calculation says to grow again, we grow more aggressively
- if we just shrink but the tcp didn't come down, and the calculation says to shrink again, we should avoid shrinking for a while
One of the reasons for outliers is something temporarily affected GC work. We pick the min tcp if the survival is very stable to avoid counting these outliers.
Added simple gen2 handling for BGC.
Bug fixes -
- When we change the heap count, we should not be refreshing all new heaps' budget which will cause a spike in heap size. If the budget is already partially used we should use up the existing budget and let the next GC will refresh it.
- Don't carry stcp over when HC is changed - it doesn't make sense since the estimated stcp is bogus
- Don't add the first sample as it's artificially skewed by startup time

There are a few issues with these that will be addressed in future checkins -

the aggressiveness factor needs to be capped and also it needs to discard history if history is too distant
growth is too aggressive for large tcps which causes an initial spike when we look at heap counts
recognize when the slope direction changes, ie, trending upward <-> downward and discard older entries as appropriate

ghost added the area-GC-coreclr label

Feb 21, 2024

ghost assignedMaoni0

Feb 21, 2024

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Worth checking if size > 0 as a precondition check?

Copy link

MemberAuthor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I've added an assert in slope which makes more sense.

Copy link

Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

kind of similar to log_with_base, is the assertion/condition intended formean or callers ofmean? If it's a precondition formean, then I would expect the precondition check to be inmean (or bothmean and the callers).

Or ifmean is supposed to support some callers with a negative size, then the final return probably needs to be something likereturn (size > 0) ? (sum / size) : 0

mrsharm reviewed


		size_t gc_heap::get_num_completed_gcs ()
		float log_with_base (float x, float base)
		{

		uint64_telapsed_between_gcs;// time between gcs in microseconds (this should really be between_pauses)
		uint64_tgc_pause_time;// pause time for this GC
		uint64_tmsl_wait_time;
		size_tgc_survived_size;

	size_tgc_survived_size;
	size_tgc_survived_size;// total survived size across all relevant generations for this GC

		//
		// We need to observe the history of tcp's so record them in a small buffer.
		//
		floatrecorded_tcp_rearranged[recorded_tcp_array_size];

		floatrecorded_tcp_rearranged[recorded_tcp_array_size];
		floatrecorded_tcp[recorded_tcp_array_size];
		intrecorded_tcp_index;
		inttotal_recorded_tcp;

	inttotal_recorded_tcp;
	inttotal_recorded_tcp;// can exceed the array size

		recorded_tcp_index++;
		if (recorded_tcp_index==recorded_tcp_array_size)
		{
		recorded_tcp_index=0;
		}

		if (total_recorded_tcp >=recorded_tcp_array_size)
		{
		intearlier_entry_size=recorded_tcp_array_size-recorded_tcp_index;
		memcpy (recorded_tcp_rearranged, (recorded_tcp+recorded_tcp_index), (earlier_entry_size*sizeof (float)));

		returncopied_count;
		}

		inthighest_avg_recorded_tcp (intcount,floatavg,float*highest_avg)

		floathighest_sum=0.0;
		inthighest_count=0;

		for (inti=0;i<count;i++)

		}

		float mean (float* arr, int size)
		{

	recorded_tcp_index++;
	if (recorded_tcp_index==recorded_tcp_array_size)
	{
	recorded_tcp_index=0;
	}
	recorded_tcp_index= (recorded_tcp_index+1) %recorded_tcp_array_size;

		// each time our calculation tells us to shrink.
		intdec_failure_count;
		intdec_failure_recheck_threshold;

		floatbelow_target_accumulation;
		floatbelow_target_threshold;

		// Currently only used for dprintf.

		// Recording the gen2 GC indices so we know how far apart they are. Currently unused
		// but we should consider how much value there is if they are very far apart.
		size_tgc_index;
		// This is (gc_elapsed_time / time inbetween this and the last gen2 GC)

		// at the beginning of a BGC and the PM triggered full GCs
		// fall into this case.
		PER_HEAP_ISOLATED_FIELD_DIAG_ONLYuint64_tsuspended_start_time;
		// Right now this is diag only but may be used functionally later.

		dynamic_heap_count_data.sample_index = (dynamic_heap_count_data.sample_index + 1) % dynamic_heap_count_data_t::sample_size;
		(dynamic_heap_count_data.current_samples_count)++;

	float avg_x = (float)sum_x / n;
	float avg_x = ((float)sum_x) / n;

		// Change it to a desired number if you want to print.
		int max_times_to_print_tcp = 0;

		// Return the slope, and the average values in the avg arg.

		}

		float median_throughput_cost_percent = median_of_3 (throughput_cost_percents[0], throughput_cost_percents[1], throughput_cost_percents[2]);
		float avg_throughput_cost_percent = (float)((throughput_cost_percents[0] + throughput_cost_percents[1] + throughput_cost_percents[2]) / 3.0);

		if (dynamic_heap_count_data.dec_failure_count)
		{
		(dynamic_heap_count_data.dec_failure_count)++;
		}
		else
		{
		dynamic_heap_count_data.dec_failure_count = 1;
		}


		if (shrink_p && step_down_int && (new_n_heaps > step_down_int))
		{
		// TODO - if we see that it wants to shrink by 1 heap too many times, we do want to shrink.

Movatterモバイル変換

datas tuning fix#98743

datas tuning fix#98743

Uh oh!

Conversation

Maoni0 commentedFeb 21, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

ghost commentedFeb 21, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Maoni0 commentedFeb 22, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markplesFeb 23, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markplesFeb 23, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markples left a comment

Choose a reason for hiding this comment

Uh oh!

sebastienros commentedApr 5, 2024• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Maoni0 commentedFeb 21, 2024•
edited
Loading

markplesFeb 23, 2024•
edited
Loading

markplesFeb 23, 2024•
edited
Loading

sebastienros commentedApr 5, 2024•
edited
Loading