NotificationsYou must be signed in to change notification settings
Fork40
Star164

Add more columns/dimensions to pg_wait_sampling#97

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Open

Medvecrab wants to merge13 commits intomaster

base:master

Choose a base branch

fromnew_dimensions

Open

Add more columns/dimensions to pg_wait_sampling#97

Medvecrab wants to merge13 commits intomasterfromnew_dimensions

Conversation

Copy link

Contributor

Medvecrab commentedMar 26, 2025

Following the discussion in#94, I decided to add more columns to pg_wait_sampling_current/history/profile

I've added most of the columns from request in#94, but some fields from "pg_stat_activity" (=localBackendStatusTable) like query_text, query_start, query_id (if we actually took it from "pg_stat_activity") work in the following way: they save values at the start of the query and DO NOT clear those fields when the query has ended. So after executing "select 1;" in some client backend we will always sample its query_text and query_start until we execute other query. We specifically avoided this kind of sampling when we were making our way of tracking query_id with Executor hooks.

All new columns are added to new views with suffix "_extended". Those views COULD be changed between extension versions, while existing views without those suffixes SHOULD NOT be changed.

One more thing to highlight - since we have to look intolocalBackendStatusTable when we sample some columns, we have reduced perfomance. This is unavoidable and is noted in updated README. BUT, in PostgreSQL 13-16 we can't reliably linkProcGlobal andlocalBackendStatusTable arrays and from this was bornget_beentry_by_procpid. For each interesting process fromProcGlobal we have to iterate throughlocalBackendStatusTable. This has O(n^2) time complexity, where n is a number of all backends. Very inefficient. I probably could remake the collector code to iterate throughlocalBackendStatusTable first and then find corresponding entries inProcGlobal but I have not investigated it and am not sure it would be faster (it may be faster, depending onProcGlobal access/iteration)

There could some sloppy code, including possible "you should copy the struct here, not use pointer" moments

Everyone is welcome to review the code and share their thoughts.

Medvecrab mentioned this pull request

Mar 26, 2025

additional dimensions#94

Open

Copy link

DmitryNFomin commentedMar 26, 2025

@Medvecrab
many thanks for this.

Performance penalty is exact risk that I would like to avoid while still want to have some extra dimensions.
If we add fields from PGROCisBackgroundWorker,databaseId androleId (like here#95) we do not get increased observer effect.

So my suggestion to add fields from PGPROC to the "main" views and fields fromlocalBackendStatusTable to _extended views

Copy link

ContributorAuthor

Medvecrab commentedMar 26, 2025

Performance penalty is exact risk that I would like to avoid while still want to have some extra dimensions. If we add fields from PGROCisBackgroundWorker,databaseId androleId (like here#95) we do not get increased observer effect.
So my suggestion to add fields from PGPROC to the "main" views and fields fromlocalBackendStatusTable to _extended views

The idea of *_extended views is to add ALL additional fields to them (also any that could be added in the future) so the original views remain the same for backwards compatibility. I agree that if we were rewriting pg_wait_sampling anew, it would make some sense to distribute columns differently, but alas. This is also the reason that with my patch we still set profiling per profile/query_id with only the old GUC variables, not with the new ones

When all fields fromlocalBackendStatusTable are turned off (well, none of them are listed in eitherhistory_dimensions orprofile_dimensions), the performance shouldn't take a hit (there are specific guards for those cases in PR)

Copy link

DmitryNFomin commentedMar 27, 2025

agree with your point, just one more comment, can we add isBackgroundWorker from PGPROC? I understand that backend_type is more detailed, but it we can get it without performance penalty while we can distinct regular backends in performance analysis already

Copy link

ContributorAuthor

Medvecrab commentedMar 27, 2025

Seems like a fair request, will probably add after/during review

Oleg Tselebrovskiyand others added11 commits

June 26, 2025 11:44

Add profile_extended and history_extended views with additional dimen…

6cd4efa

…sionsSometimes it can be useful to have additional info collected with wait_events,so we add two new views/functions that include more information. The structure ofthose views could be changed in new versions of pg_wait_sampling extension

Update README to include information about new *_extended views

9208eb2

Also fix some typos/reword some sentences

Fixes after review

f0ee939

Fixes after review, part 2

292aaa9

Add serialization

da75db0

Fixing serialization

46af7da

Add history reset

64efec7

remove old functions for good

f6a2203

Do as pg_stat_statements does with different extension versions

09fc662

Fixes after self-review

42b4828

Fix and add info about sampling dimensions to README

01c45a4

Medvecrab force-pushed thenew_dimensions branch from1597439 to01c45a4Compare

June 26, 2025 04:47

shinderuk mentioned this pull request

Aug 21, 2025

Добавление ожидания WAIT_EVENT_CLIENT_READ_IN_TRANSACTION#102

Closed

Medvecrab added2 commits

September 23, 2025 15:16

Fixes after review

885496f

- Rename some columns in code and in GUC allowed values- Change client_addr type to inet as in pg_stat_activity- event value for dimension GUC now turn both wait_event and  wait_event_type on and off; event_type not supported- README formatting- Hide leader_pid for leader process- Reset profile and history after reloading configuration,  but only if dimensions have changed- Disallow passing empty string as dimensions- pid and queryid are NULL now if are not in dimenstions- empty client_hostname is now NULL like in pg_stat_activity- empty appname is now shown as empty string as in pg_stat_activity

More fixes after review

5bc04b5

- Remove unneded palloc- Move initialization of possibly-null fields behind mask-checking  (in fill_values_and_nulls function)- Switch api_version and parameter amount in *_internal functions,  this makes safety check look better- Remove pgstat_clear_*** from pg_wait_sampling_get_current,  it could mess with other functions unexpectedly- Use correct bitmask in get_profile/history_internal- Add dimensions_mask to Profile and History to save it through  multiple calls to get_profile/history- Use statically allocated varialbes ts and count in deserialize_array- Take care of padding in deserialize_array- Leave only common_dimensions in probe_waits- Remove unneded +1 in serialize_item- Remove serialized_key in probe_waits, it's not needed- Add pfree to stop leaking serialized_item- Always copy only count to hash table in probe_waits

Labels

None yet

2 participants

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add more columns/dimensions to pg_wait_sampling#97

Are you sure you want to change the base?

Add more columns/dimensions to pg_wait_sampling#97

Uh oh!

Conversation

Medvecrab commentedMar 26, 2025

Uh oh!

DmitryNFomin commentedMar 26, 2025

Uh oh!

Medvecrab commentedMar 26, 2025

Uh oh!

DmitryNFomin commentedMar 27, 2025

Uh oh!

Medvecrab commentedMar 27, 2025

Uh oh!

Uh oh!