NotificationsYou must be signed in to change notification settings
Fork1k
Star11.2k

feat: add status watcher to MCP server#18320

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

code-asher merged 15 commits intomainfromasher/report-task

Jun 13, 2025

Merged

feat: add status watcher to MCP server#18320

code-asher merged 15 commits intomainfromasher/report-task

Jun 13, 2025

Conversation

Copy link

Member

code-asher commentedJun 11, 2025•
edited
Loading

This is meant to complement the existing task reporter since the LLM does not call it reliably.

I also did some refactoring to use the common agent flags/env vars, those changes are in separate commits.

For#18163, but will need to update the modules to make use of it.

github-actionsbot assignedcode-asher

Jun 11, 2025

code-asher changed the title~~Add status watcher to MCP server~~feat: add status watcher to MCP server

Jun 11, 2025

code-asher force-pushed theasher/report-task branch 5 times, most recently fromfd1b20f tof8d7628Compare

June 11, 2025 10:38

code-asher requested a review fromhugodutka

June 11, 2025 10:59

hugodutka reviewed

Jun 11, 2025

View reviewed changes

Copy link

Contributor

hugodutka left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Idea for getting rid of that "a message from the screen watcher and a message from the LLM could arrive out of order if the timing is just right" race condition:

maintain a state object that keeps track of
- an array of pending updates with timestamps
- the latest status reported to the server with a timestamp
instead of submitting status updates to a queue, push them to the array of pending updates
in a separate loop that ticks every second, process pending updates in batch. when processing a "screen stable" update, act on it only if it's at least 1 second old and there hasn't been a "screen changing" update after it. leave it in the pending array if it's not old enough and there hasn't been a "screen changing" update after it.
keep the existing deduplication logic that takes into account conversation length and latest status

up to you if you want to implement it. I'm fine with merging the current version once you address the feedback

cli/cliutil/queue.goShow resolvedHide resolved

cli/root.go


		// tryCreateAgentClient returns a new client from the command context. It works
		// just like tryCreateAgentClient, but does not error.
		func (rRootCmd)tryCreateAgentClient() (agentsdk.Client,error) {

Copy link

Contributor

hugodutkaJun 11, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'd expect the names to be swapped:tryCreateAgentClient should return an error,createAgentClient should not. "try" implies the possibility of failure.

Copy link

MemberAuthor

code-asherJun 12, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Very fair. For this though I just mirroredTryInitClient. I think the logic is thattry here is more likeattempt, in the sense that failing is OK, or maybe in the way thattry intry/catch suppresses errors. Happy to rename it, but maybe we should rename both if we do.

Copy link

Contributor

hugodutkaJun 12, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Renaming both would be perfect.

cli/exp_mcp.go OutdatedShow resolvedHide resolved

cli/exp_mcp.goShow resolvedHide resolved

cli/exp_mcp.go OutdatedShow resolvedHide resolved

cli/root.goShow resolvedHide resolved

code-asher force-pushed theasher/report-task branch 2 times, most recently from43350ec to5512b1aCompare

June 12, 2025 02:11

code-asher added10 commits

June 12, 2025 12:39

Add queue util

5cee4c4

Make createAgentClient use token file and errors

1b3b734

For now, the old function has been renamed to tryCreateAgentClientbecause it is not clear to me if it is safe to error in these cases orif it was intentional to create a client even if there was no valid URLor token.

Use common flags for agent client in MCP server

722475c

Before we were separately reading the environment, but we can reuse theglobal flags and the function for creating an agent client.Also, since the tests no longer need to set environment variables theycan now be ran in parallel.This is not fully backwards compatible in that the old method would fallback to the non-agent client URL if the agent URL was not set, but thatseems like undesired behavior since we are not doing that anywhere else.

Add status watcher to MCP server

56c41c8

Since we can now get status updates from two places, they are placed ina queue so we can handle them one at a time.

Preserve URI only if message was blank

2cd3b45

Test summary and link

9ec6ded

Fix lying comment

3d56d18

Only report user messages

c8dc0dd

Increase queue to 100

bf78f1a

Push and return seems fine

32f6eb9

code-asher force-pushed theasher/report-task branch from5512b1a to9f28bacCompare

June 12, 2025 21:59

code-asher added3 commits

June 12, 2025 14:18

Configure LLM agent URL

f4e06c6

Add test for duplicate complete

8dcada5

Check against last successfully submitted status

6d40d40

code-asher force-pushed theasher/report-task branch from9f28bac toffc599dCompare

June 12, 2025 22:19

Move update predicates to push phase

866b721

Instead of the pop phase.  This ensures we do not queue up updates thatwill just end up being discarded once they are popped (which could takesome time due to latency to coderd).It also has the side effect of preserving summaries even when the queuegets too big, because now we preserve them as part of pushing, beforethey might get lost due to getting dropped while we wait on coderd.

code-asher force-pushed theasher/report-task branch fromffc599d to866b721Compare

June 12, 2025 22:37

Copy link

MemberAuthor

code-asher commentedJun 12, 2025

I force pushed to break out the UI fix but the remaining commits are unchanged.

Then I moved the update predicates to the push phase instead of the pop phase, so now if we are waiting on coderd, we will not queue up a bunch of duplicate updates that will just end up getting discarded later anyway. Now they get discarded immediately.

This could still use channels, I would just need a mutex overlastReport/lastMessageID. But, although I know we are unlikely to hit the limit I do still like that the queue can drop the oldest update.

I am also doing the part where we keep the last summary from the agent during the push phase instead, which is nice because now even if an agent update is dropped while waiting on coderd because the queue got too big we can still log it with the next agentapi status update. Granted, with a buffer of 512 this is probably not going to actually happen.

code-asher marked this pull request as ready for review

June 12, 2025 23:30

hugodutka approved these changes

Jun 13, 2025

View reviewed changes

Copy link

Contributor

hugodutka left a comment•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That's awesome. I set it up in a local deployment and it works! Both for reporting status if the ai agent doesn't, and for completing statuses the ai agent didn't.

Minor nits on naming: I believe it's more accurate to refer to claude code and others as "AI Agents" rather than LLMs. Also, whenever referring to the url of a coder/agentapi process, I think we should say "AI agentapi url" rather than "llm agent url" for accuracy.

Also TODO: we need to update the claude code and goose terraform modules to use this.

cli/exp_mcp.go OutdatedShow resolvedHide resolved

cli/exp_mcp_test.go OutdatedShow resolvedHide resolved

Rename LLM to AI

a79ee72

Also, any references to the API are renamed as "AgentAPI" rather thanjust "agent".

code-asher force-pushed theasher/report-task branch fromd68e1dc toa79ee72Compare

June 13, 2025 20:39

code-asher merged commit4bd5609 intomain

Jun 13, 2025

35 checks passed

code-asher deleted the asher/report-task branch

June 13, 2025 20:53

github-actionsbot locked and limited conversation to collaborators

Jun 13, 2025

Labels

None yet

Movatterモバイル変換

feat: add status watcher to MCP server#18320

feat: add status watcher to MCP server#18320

Uh oh!

Conversation

code-asher commentedJun 11, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

hugodutka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hugodutkaJun 11, 2025

Choose a reason for hiding this comment

Uh oh!

code-asherJun 12, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hugodutkaJun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

code-asher commentedJun 12, 2025

Uh oh!

hugodutka left a comment• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

code-asher commentedJun 11, 2025•
edited
Loading

code-asherJun 12, 2025•
edited
Loading

hugodutka left a comment•
edited
Loading