a newapi.insightsUserStatusCountsOverTime endpoint to the API
which calls a newGetUserStatusCountsOverTime query from postgres
which relies on two new tablesuser_status_changes anduser_deleted
which are populated by a new trigger and function that tracks updates to the users table

The graph itself will be added in a subsequent PR

github-actionsbot assignedSasSwart

Jan 3, 2025

SasSwart requested a review frommafredri

January 3, 2025 07:11

SasSwart mentioned this pull request

Jan 3, 2025

feat(site): display user status counts over time as an indicator of license usage#15893

Closed

SasSwart requested a review fromEmyrk

January 3, 2025 07:15

SasSwart mentioned this pull request

Jan 3, 2025

DAU graph misleads admins on license utilization#15297

Closed

6 tasks

SasSwart marked this pull request as ready for review

January 3, 2025 07:50

SasSwart mentioned this pull request

Jan 3, 2025

feat(site): display user status history as an indication of license usage#16020

Closed

Emyrk reviewed

Jan 3, 2025

View reviewed changes

Copy link

Member

Emyrk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Small nit, can we use the wordChart instead ofGraph in various comments when referring to the frontend display?

For a moment I was wondering why we needed a graph data structure.

On deleting a user more than once. We should enforce this at the db_schema level to not allow it if it's going to break our logic. An index could be made on theuser_deleted(user_id).

coderd/database/migrations/000282_user_status_changes.up.sql

Comment on lines 24 to 28

		CREATETABLEuser_deleted (
		id uuidPRIMARY KEY DEFAULT gen_random_uuid(),
		user_id uuidNOT NULLREFERENCES users(id),
		deleted_attimestamptzNOT NULL DEFAULTCURRENT_TIMESTAMP
		);

Copy link

Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Will we ever be able to "un-delete" a user? Would is be easy to implement this to handle that case if we ever support it?

Copy link

Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Why we don't implement this as a user status change? Perhaps as an additional field on the table or modify the enum to include deleted. Wrt to changing the enum, it doesn't make much sense to me to differentiate between e.g. deleted/active or deleted/dormant users (or whatever the possibilities are), so changing it makes sense IMO.

Copy link

Member

EmyrkJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That is true. I assume we can still make an index with a conditional expression onstatus == deleted. We might want to have some trigger or something to prevent "un-deletion" if we want to prevent that.

Copy link

ContributorAuthor

SasSwartJan 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I considered these options, but avoided them simply because they expand the scope beyond a reasonable time cost for this feature. We could relatively easy migrate from this to a space wheredeleted is just another user status.

I'm trying to remain agnostic of the massive rabbit-hole shaped question of what the business rule should be around un/re-deletion.

The trigger and function right now would handle un/re-deletion correctly, but the query would break as identified by@Emyrk in another comment below.

I would prefer that we stick to the current implementation because it is good enough and I can't find anywhere that we support un/re-deletion right now. If y'all would like to explicitly request a specific change I'd be happy to consider it.

Copy link

Member

EmyrkJan 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

@SasSwart can we open an issue to address un-deletion and mention this query if we decide to support it?

Just to track it

coderd/database/migrations/000282_user_status_changes.up.sqlShow resolvedHide resolved

coderd/database/querier.go OutdatedShow resolvedHide resolved

coderd/database/queries/insights.sql

Comment on lines +798 to +799

		WHERE changed_at> @start_time::timestamptz
		AND changed_at< @end_time::timestamptz

Copy link

Member

EmyrkJan 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

If it is inclusive, should it be?

Suggested change

	WHERE changed_at> @start_time::timestamptz
	AND changed_at< @end_time::timestamptz
	WHERE changed_at>= @start_time::timestamptz
	AND changed_at<= @end_time::timestamptz

Copy link

Member

mafredriJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think that's covered by the initial and final union.

Personally I prefer start_time inclusive, end_time exclusive though, as most of the time that mirrors the intuitive understanding of data. Say we want to view Monday to Sunday (1 week), we ultimately want Monday 00:00 -> Sunday 23:59 but it's simplest to request this as Monday 00:00 -> Monday 00:00 instead of subtracting an arbitrary unit of time or using an end-of-day function. Alternatively, if we include Monday 00:00 at the end, we include data from the beyond the range.

Copy link

ContributorAuthor

SasSwartJan 9, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think that's covered by the initial and final union.

Correct.

Monday 00:00 -> Monday 00:00 instead of subtracting an arbitrary unit of time or using an end-of-day function

I think the inclusive approach is good enough. There are higher impact changes to make. If you'd like me to change it, please ask me directly to do so and I will. My own opinion is that for how this chart is going to be displayed inclusive works well enough.

coderd/database/queries/insights.sql OutdatedShow resolvedHide resolved

coderd/database/queries/insights.sql Outdated

		usc.new_status,
		usc.changed_at
		FROM user_status_changes usc
		LEFT JOIN user_deleted udONusc.user_id=ud.user_id

Copy link

Member

EmyrkJan 3, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

If a user is deleted twice in it's history (should not be possible, but would be nice to be durable against), then this join will duplicate the status changed rows for each deletion event.

We could make the table a subquery that returns 1 row, or we should enforce a user can only be deleted once.

Minimal example:https://www.db-fiddle.com/f/4jyoMCicNSZpjMt4jFYoz5/15847

Emyrk reviewed

Jan 3, 2025

View reviewed changes

coderd/insights.goShow resolvedHide resolved

mafredri reviewed

Jan 8, 2025

View reviewed changes

Copy link

Member

mafredri left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Nice work and good job adding all that test coverage. A few suggestions and questions, but nothing major.

coderd/database/migrations/000282_user_status_changes.up.sql

Comment on lines 24 to 28

		CREATETABLEuser_deleted (
		id uuidPRIMARY KEY DEFAULT gen_random_uuid(),
		user_id uuidNOT NULLREFERENCES users(id),
		deleted_attimestamptzNOT NULL DEFAULTCURRENT_TIMESTAMP
		);

Copy link

Member

mafredriJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

coderd/database/queries/insights.sql

Comment on lines +798 to +799

		WHERE changed_at> @start_time::timestamptz
		AND changed_at< @end_time::timestamptz

Copy link

Member

mafredriJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think that's covered by the initial and final union.

coderd/database/queries/insights.sql OutdatedShow resolvedHide resolved

coderd/database/queries/insights.sql

		rsc1.new_status
		FROM dates_of_interest d
		LEFT JOIN relevant_status_changes rsc1ONrsc1.changed_at<=d.date
		)

Copy link

Member

mafredriJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm a bit worried about the number of CTEs and performance, but I think this is fine for now and let's not prematurely optimize. Just raising this so you're aware that in some cases a CTE performs worse than a pure query with joins and subqueries. That's mainly because the result of a CTE lacks indexes.

Copy link

Member

EmyrkJan 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I was wondering about performance too, but user count and user status changes is going to be way less data than say workspace builds. Like on the order of 1000s. 🤷

Might be worth a benchmark, but we don't have a good way to populate "large" datasets atm.

Copy link

ContributorAuthor

SasSwartJan 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I did it this way for legibility. Trying to write the optimal solution here made my own query inscrutable even to myself. I don't consider this to be on a particularly hot path. We do have metrics for this query. If it needs to be optimised, we can do so.

Copy link

ContributorAuthor

SasSwartJan 9, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Just raising this so you're aware that in some cases a CTE performs worse than a pure query with joins and subqueries

Also these CTEs were designed to reduce the amount of data being handled as early as possible. We use the existing indices while we have them to identify exactly the data we need ASAP and then once we need to join it all the hope is that we've brought down the volume low enough that it's performant regardless of the lack of indices on CTEs.

coderd/insights.goShow resolvedHide resolved

coderd/insights.go OutdatedShow resolvedHide resolved

coderd/coderd.go OutdatedShow resolvedHide resolved

coderd/database/querier_test.go OutdatedShow resolvedHide resolved

SasSwart added20 commits

January 9, 2025 10:33

add user_status_changes table

cd953a3

add GetUserStatusCountsByDay

ec16728

rename unused variable

0d97e82

Test GetUserStatusCountsByDay

eb6e249

make gen

7b2c259

fix dbauthz tests

89177b2

do the plumbing to get sql, api and frontend talking to one another

877517f

rename migration

0b3e0e6

move aggregation logic for GetUserStatusChanges into the SQL

6462cc2

use window functions for efficiency

ccd0cdf

ensure we use the same time zone as the start_time param

12a274f

ensure we use the same time zone as the start_time param

fcfd76e

make gen

7c0cade

update field names and fix tests

ecffc8b

exclude deleted users from the user status graph

f3a2ce3

GetUserStatusChanges now passes all querier tests

5067a63

renumber migrations

254d436

add partial fixture for CI

3e86522

fix migration numbers

2e49e4c

rename and document sql function

ff59729

SasSwart added4 commits

January 9, 2025 10:34

make gen

b1ad074

Remove unwanted comments from the generated interface

de4081f

review notes

4de334f

make gen

8fca0c5

SasSwart force-pushed thejjs/dau-history-backend branch from699ee8a to8fca0c5Compare

January 9, 2025 10:48

SasSwart added3 commits

January 9, 2025 10:50

remove frontend changes

b06179c

rename GetUserStatusCountsOverTime to GetUserStatusCounts

213b288

fix tests

9457ac8

mafredri approved these changes

Jan 10, 2025

View reviewed changes

coderd/database/queries/insights.sql OutdatedShow resolvedHide resolved

SasSwartand others added5 commits

January 10, 2025 12:42

Update coderd/database/queries/insights.sql

63128a3

Co-authored-by: Mathias Fredriksson <mafredri@gmail.com>

Provide basic durability against multiple deletions

89f0a11

Provide basic durability against multiple deletions

89ebab2

populate the user_deleted_table

012f14c

formatting

c2efd97

SasSwart merged commit4543b21 intomain

Jan 13, 2025

36 checks passed

SasSwart deleted the jjs/dau-history-backend branch

January 13, 2025 11:08

github-actionsbot locked and limited conversation to collaborators

Jan 13, 2025

Labels

None yet

Movatterモバイル変換

feat(coderd/database): track user status changes over time#16019

feat(coderd/database): track user status changes over time#16019

Uh oh!

Conversation

SasSwart commentedJan 3, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

Emyrk left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

SasSwartJan 9, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

EmyrkJan 3, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mafredri left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SasSwart commentedJan 3, 2025•
edited
Loading

SasSwartJan 9, 2025•
edited
Loading

EmyrkJan 3, 2025•
edited
Loading