NotificationsYou must be signed in to change notification settings
Fork1k
Star11k

Commita25eb96

authored

Adding docs for MCP sampling (#2027)

1 parent473b2ce commita25eb96Copy full SHA for a25eb96

File tree

20 files changed

+335

-87

lines changed

.gitignore
docs
- api/models
  - mcp-sampling.md
- common-tools.md
- dependencies.md
- evals.md
- graph.md
- input.md
- mcp
- testing.md
- tools.md
mkdocs.yml
pydantic_ai_slim/pydantic_ai
- mcp.py
- models
  - mcp_sampling.py
- settings.py
pydantic_graph/pydantic_graph
- graph.py
tests
- example_modules
  - mcp_server.py
- test_examples.py
- test_mcp.py

20 files changed

+335

-87

lines changed

`‎.gitignore`

Lines changed: 1 addition & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -19,3 +19,4 @@ examples/pydantic_ai_examples/.chat_app_messages.sqlite`
`19`	`19`	`node_modules/`
`20`	`20`	`**.idea/`
`21`	`21`	`.coverage*`
	`22`	`+/test_tmp/`

`‎docs/api/models/mcp-sampling.md`

Lines changed: 3 additions & 0 deletions

Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+#pydantic_ai.models.mcp_sampling`
	`2`	`+`
	`3`	`+::: pydantic_ai.models.mcp_sampling`

`‎docs/common-tools.md`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -20,7 +20,7 @@ pip/uv-add "pydantic-ai-slim[duckduckgo]"`
`20`	`20`
`21`	`21`	`Here's an example of how you can use the DuckDuckGo search tool with an agent:`
`22`	`22`
`23`		-```py {title="main.py" test="skip"}
	`23`	+```py {title="duckduckgo_search.py" test="skip"}
`24`	`24`	`from pydantic_aiimport Agent`
`25`	`25`	`from pydantic_ai.common_tools.duckduckgoimport duckduckgo_search_tool`
`26`	`26`
`@@ -103,7 +103,7 @@ pip/uv-add "pydantic-ai-slim[tavily]"`
`103`	`103`
`104`	`104`	`Here's an example of how you can use the Tavily search tool with an agent:`
`105`	`105`
`106`		-```py {title="main.py" test="skip"}
	`106`	+```py {title="tavily_search.py" test="skip"}
`107`	`107`	`import os`
`108`	`108`
`109`	`109`	`from pydantic_ai.agentimport Agent`

`‎docs/dependencies.md`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -276,7 +276,7 @@ async def application_code(prompt: str) -> str: # (3)!`
`276`	`276`
`277`	`277`	`_(This example is complete, it can be run "as is")_`
`278`	`278`
`279`		-```python {title="test_joke_app.py" hl_lines="10-12" call_name="test_application_code"}
	`279`	+```python {title="test_joke_app.py" hl_lines="10-12" call_name="test_application_code" requires="joke_app.py"}
`280`	`280`	`from joke_appimport MyDeps, application_code, joke_agent`
`281`	`281`
`282`	`282`

`‎docs/evals.md`

Lines changed: 2 additions & 2 deletions

Original file line number	Diff line number	Diff line change
`@@ -55,7 +55,7 @@ Evaluators are the components that analyze and score the results of your task wh`
`55`	`55`
`56`	`56`	`Pydantic Evals includes several built-in evaluators and allows you to create custom evaluators:`
`57`	`57`
`58`		-```python {title="simple_eval_evaluator.py"}
	`58`	+```python {title="simple_eval_evaluator.py" requires="simple_eval_dataset.py"}
`59`	`59`	`from dataclassesimport dataclass`
`60`	`60`
`61`	`61`	`from simple_eval_datasetimport dataset`
@@ -616,7 +616,7 @@ _(This example is complete, it can be run "as is" — you'll need to add `asynci
`616`	`616`
`617`	`617`	`You can also write datasets as JSON files:`
`618`	`618`
`619`		-```python {title="generate_dataset_example_json.py"}
	`619`	+```python {title="generate_dataset_example_json.py" requires="generate_dataset_example.py"}
`620`	`620`	`from pathlibimport Path`
`621`	`621`
`622`	`622`	`from generate_dataset_exampleimport AnswerOutput, MetadataType, QuestionInputs`

`‎docs/graph.md`

Lines changed: 6 additions & 6 deletions

Original file line number	Diff line number	Diff line change
`@@ -167,7 +167,7 @@ _(This example is complete, it can be run "as is" with Python 3.10+)_`
`167`	`167`
`168`	`168`	`A[mermaid diagram](#mermaid-diagrams) for this graph can be generated with the following code:`
`169`	`169`
`170`		-```py {title="graph_example_diagram.py" py="3.10"}
	`170`	+```py {title="graph_example_diagram.py" py="3.10" requires="graph_example.py"}
`171`	`171`	`from graph_exampleimport DivisibleBy5, fives_graph`
`172`	`172`
`173`	`173`	`fives_graph.mermaid_code(start_node=DivisibleBy5)`
`@@ -308,7 +308,7 @@ _(This example is complete, it can be run "as is" with Python 3.10+ — you'll n`
`308`	`308`
`309`	`309`	`A[mermaid diagram](#mermaid-diagrams) for this graph can be generated with the following code:`
`310`	`310`
`311`		-```py {title="vending_machine_diagram.py" py="3.10"}
	`311`	+```py {title="vending_machine_diagram.py" py="3.10" requires="vending_machine.py"}
`312`	`312`	`from vending_machineimport InsertCoin, vending_machine_graph`
`313`	`313`
`314`	`314`	`vending_machine_graph.mermaid_code(start_node=InsertCoin)`
@@ -524,7 +524,7 @@ Alternatively, you can drive iteration manually with the [`GraphRun.next`][pydan
`524`	`524`
`525`	`525`	`Below is a contrived example that stops whenever the counter is at 2, ignoring any node runs beyond that:`
`526`	`526`
`527`		-```python {title="count_down_next.py" noqa="I001" py="3.10"}
	`527`	+```python {title="count_down_next.py" noqa="I001" py="3.10" requires="count_down.py"}
`528`	`528`	`from pydantic_graphimport End, FullStatePersistence`
`529`	`529`	`from count_downimport CountDown, CountDownState, count_down_graph`
`530`	`530`
@@ -593,7 +593,7 @@ We can run the `count_down_graph` from [above](#iterating-over-a-graph), using [
`593`	`593`
`594`	`594`	As you can see in this code,`run_node` requires no external application state (apart from state persistence) to be run, meaning graphs can easily be executed by distributed execution and queueing systems.
`595`	`595`
`596`		-```python {title="count_down_from_persistence.py" noqa="I001" py="3.10"}
	`596`	+```python {title="count_down_from_persistence.py" noqa="I001" py="3.10" requires="count_down.py"}
`597`	`597`	`from pathlibimport Path`
`598`	`598`
`599`	`599`	`from pydantic_graphimport End`
`@@ -746,7 +746,7 @@ Instead of running the entire graph in a single process invocation, we run the g`
`746`	`746`
`747`	`747`	`_(This example is complete, it can be run "as is" with Python 3.10+)_`
`748`	`748`
`749`		-```python {title="ai_q_and_a_run.py" noqa="I001" py="3.10"}
	`749`	+```python {title="ai_q_and_a_run.py" noqa="I001" py="3.10" requires="ai_q_and_a_graph.py"}
`750`	`750`	`import sys`
`751`	`751`	`from pathlibimport Path`
`752`	`752`
`@@ -965,7 +965,7 @@ You can specify the direction of the state diagram using one of the following va`
`965`	`965`	-`'BT'`: Bottom to top, the diagram flows vertically from bottom to top.
`966`	`966`
`967`	`967`	`Here is an example of how to do this using 'Left to Right' (LR) instead of the default 'Top to Bottom' (TB):`
`968`		-```py {title="vending_machine_diagram.py" py="3.10"}
	`968`	+```py {title="vending_machine_diagram.py" py="3.10" requires="vending_machine.py"}
`969`	`969`	`from vending_machineimport InsertCoin, vending_machine_graph`
`970`	`970`
`971`	`971`	`vending_machine_graph.mermaid_code(start_node=InsertCoin,direction='LR')`

`‎docs/input.md`

Lines changed: 4 additions & 4 deletions

Original file line number	Diff line number	Diff line change
`@@ -10,7 +10,7 @@ Some LLMs are now capable of understanding audio, video, image and document cont`
`10`	`10`
`11`	`11`	If you have a direct URL for the image, you can use[`ImageUrl`][pydantic_ai.ImageUrl]:
`12`	`12`
`13`		-```py {title="main.py" test="skip" lint="skip"}
	`13`	+```py {title="image_input.py" test="skip" lint="skip"}
`14`	`14`	`from pydantic_aiimport Agent, ImageUrl`
`15`	`15`
`16`	`16`	`agent= Agent(model='openai:gpt-4o')`
`@@ -26,7 +26,7 @@ print(result.output)`
`26`	`26`
`27`	`27`	If you have the image locally, you can also use[`BinaryContent`][pydantic_ai.BinaryContent]:
`28`	`28`
`29`		-```py {title="main.py" test="skip" lint="skip"}
	`29`	+```py {title="local_image_input.py" test="skip" lint="skip"}
`30`	`30`	`import httpx`
`31`	`31`
`32`	`32`	`from pydantic_aiimport Agent, BinaryContent`
@@ -69,7 +69,7 @@ You can provide document input using either [`DocumentUrl`][pydantic_ai.Document
`69`	`69`
`70`	`70`	If you have a direct URL for the document, you can use[`DocumentUrl`][pydantic_ai.DocumentUrl]:
`71`	`71`
`72`		-```py {title="main.py" test="skip" lint="skip"}
	`72`	+```py {title="document_input.py" test="skip" lint="skip"}
`73`	`73`	`from pydantic_aiimport Agent, DocumentUrl`
`74`	`74`
`75`	`75`	`agent= Agent(model='anthropic:claude-3-sonnet')`
`@@ -87,7 +87,7 @@ The supported document formats vary by model.`
`87`	`87`
`88`	`88`	You can also use[`BinaryContent`][pydantic_ai.BinaryContent] to pass document data directly:
`89`	`89`
`90`		-```py {title="main.py" test="skip" lint="skip"}
	`90`	+```py {title="binary_content_input.py" test="skip" lint="skip"}
`91`	`91`	`from pathlibimport Path`
`92`	`92`	`from pydantic_aiimport Agent, BinaryContent`
`93`	`93`

`‎docs/mcp/client.md`

Lines changed: 110 additions & 3 deletions

Original file line number	Diff line number	Diff line change
`@@ -98,7 +98,7 @@ Will display as follows:`
`98`	`98`
`99`	`99`	`Before creating the Streamable HTTP client, we need to run a server that supports the Streamable HTTP transport.`
`100`	`100`
`101`		-```python {title="streamable_http_server.py" py="3.10"test="skip"}
	`101`	+```python {title="streamable_http_server.py" py="3.10"dunder_name="not_main"}
`102`	`102`	`from mcp.server.fastmcpimport FastMCP`
`103`	`103`
`104`	`104`	`app= FastMCP()`
`@@ -107,7 +107,8 @@ app = FastMCP()`
`107`	`107`	`defadd(a:int,b:int) ->int:`
`108`	`108`	`return a+ b`
`109`	`109`
`110`		`-app.run(transport='streamable-http')`
	`110`	`+if__name__=='__main__':`
	`111`	`+ app.run(transport='streamable-http')`
`111`	`112`	```
`112`	`113`
`113`	`114`	`Then we can create the client:`
`@@ -194,7 +195,7 @@ async def process_tool_call(`
`194`	`195`	`returnawait call_tool(tool_name, args,metadata={'deps': ctx.deps})`
`195`	`196`
`196`	`197`
`197`		`-server= MCPServerStdio('python', ['-m','tests.mcp_server'],process_tool_call=process_tool_call)`
	`198`	`+server= MCPServerStdio('python', ['mcp_server.py'],process_tool_call=process_tool_call)`
`198`	`199`	`agent= Agent(`
`199`	`200`	`model=TestModel(call_tools=['echo_deps']),`
`200`	`201`	`deps_type=int,`
`@@ -275,3 +276,109 @@ agent = Agent('openai:gpt-4o', mcp_servers=[python_server, js_server])`
`275`	`276`	```
`276`	`277`
`277`	`278`	`When the model interacts with these servers, it will see the prefixed tool names, but the prefixes will be automatically handled when making tool calls.`
	`279`	`+`
	`280`	`+##MCP Sampling`
	`281`	`+`
	`282`	`+!!! info "What is MCP Sampling?"`
	`283`	`+ In MCP[sampling](https://modelcontextprotocol.io/docs/concepts/sampling) is a system by which an MCP server can make LLM calls via the MCP client - effectively proxying requests to an LLM via the client over whatever transport is being used.`
	`284`	`+`
	`285`	`+Sampling is extremely useful when MCP servers need to use Gen AI but you don't want to provision them each with their own LLM credentials or when a public MCP server would like the connecting client to pay for LLM calls.`
	`286`	`+`
	`287`	`+Confusingly it has nothing to do with the concept of "sampling" in observability, or frankly the concept of "sampling" in any other domain.`
	`288`	`+`
	`289`	`+??? info "Sampling Diagram"`
	`290`	`+ Here's a mermaid diagram that may or may not make the data flow clearer:`
	`291`	`+`
	`292`	+ ```mermaid
	`293`	`+ sequenceDiagram`
	`294`	`+ participant LLM`
	`295`	`+ participant MCP_Client as MCP client`
	`296`	`+ participant MCP_Server as MCP server`
	`297`	`+`
	`298`	`+ MCP_Client->>LLM: LLM call`
	`299`	`+ LLM->>MCP_Client: LLM tool call response`
	`300`	`+`
	`301`	`+ MCP_Client->>MCP_Server: tool call`
	`302`	`+ MCP_Server->>MCP_Client: sampling "create message"`
	`303`	`+`
	`304`	`+ MCP_Client->>LLM: LLM call`
	`305`	`+ LLM->>MCP_Client: LLM text response`
	`306`	`+`
	`307`	`+ MCP_Client->>MCP_Server: sampling response`
	`308`	`+ MCP_Server->>MCP_Client: tool call response`
	`309`	+ ```
	`310`	`+`
	`311`	`+Pydantic AI supports sampling as both a client and server. See the[server](./server.md#mcp-sampling) documentation for details on how to use sampling within a server.`
	`312`	`+`
	`313`	`+Sampling is automatically supported by Pydantic AI agents when they act as a client.`
	`314`	`+`
	`315`	`+Let's say we have an MCP server that wants to use sampling (in this case to generate an SVG as per the tool arguments).`
	`316`	`+`
	`317`	`+??? example "Sampling MCP Server"`
	`318`	`+`
	`319`	+```python {title="generate_svg.py" py="3.10"}
	`320`	`+import re`
	`321`	`+from pathlib import Path`
	`322`	`+`
	`323`	`+from mcp import SamplingMessage`
	`324`	`+from mcp.server.fastmcp import Context, FastMCP`
	`325`	`+from mcp.types import TextContent`
	`326`	`+`
	`327`	`+app = FastMCP()`
	`328`	`+`
	`329`	`+`
	`330`	`+@app.tool()`
	`331`	`+async def image_generator(ctx: Context, subject: str, style: str) -> str:`
	`332`	`+ prompt = f'{subject=} {style=}'`
	`333`	+ # `ctx.session.create_message` is the sampling call
	`334`	`+ result = await ctx.session.create_message(`
	`335`	`+ [SamplingMessage(role='user', content=TextContent(type='text', text=prompt))],`
	`336`	`+ max_tokens=1_024,`
	`337`	`+ system_prompt='Generate an SVG image as per the user input',`
	`338`	`+ )`
	`339`	`+ assert isinstance(result.content, TextContent)`
	`340`	`+`
	`341`	`+ path = Path(f'{subject}_{style}.svg')`
	`342`	`+ # remove triple backticks if the svg was returned within markdown`
	`343`	+ if m := re.search(r'^```\w*$(.+?)```$', result.content.text, re.S \| re.M):
	`344`	`+ path.write_text(m.group(1))`
	`345`	`+ else:`
	`346`	`+ path.write_text(result.content.text)`
	`347`	`+ return f'See {path}'`
	`348`	`+`
	`349`	`+`
	`350`	`+if __name__ == '__main__':`
	`351`	`+ # run the server via stdio`
	`352`	`+ app.run()`
	`353`	+```
	`354`	`+`
	`355`	+Using this server with an`Agent` will automatically allow sampling:
	`356`	`+`
	`357`	+```python {title="sampling_mcp_client.py" py="3.10" requires="generate_svg.py"}
	`358`	`+from pydantic_aiimport Agent`
	`359`	`+from pydantic_ai.mcpimport MCPServerStdio`
	`360`	`+`
	`361`	`+server= MCPServerStdio(command='python',args=['generate_svg.py'])`
	`362`	`+agent= Agent('openai:gpt-4o',mcp_servers=[server])`
	`363`	`+`
	`364`	`+`
	`365`	`+asyncdefmain():`
	`366`	`+asyncwith agent.run_mcp_servers():`
	`367`	`+ result=await agent.run('Create an image of a robot in a punk style.')`
	`368`	`+print(result.output)`
	`369`	`+#> Image file written to robot_punk.svg.`
	`370`	+```
	`371`	`+`
	`372`	`+_(This example is complete, it can be run "as is" with Python 3.10+)_`
	`373`	`+`
	`374`	+You can disallow sampling by settings[`allow_sampling=False`][pydantic_ai.mcp.MCPServerStdio.allow_sampling] when creating the server reference, e.g.:
	`375`	`+`
	`376`	+```python {title="sampling_disallowed.py" hl_lines="6" py="3.10"}
	`377`	`+from pydantic_ai.mcpimport MCPServerStdio`
	`378`	`+`
	`379`	`+server= MCPServerStdio(`
	`380`	`+command='python',`
	`381`	`+args=['generate_svg.py'],`
	`382`	`+allow_sampling=False,`
	`383`	`+)`
	`384`	+```

`‎docs/mcp/run-python.md`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -122,7 +122,7 @@ As introduced in PEP 723, explained [here](https://packaging.python.org/en/lates`
`122`	`122`
`123`	`123`	`This allows use of dependencies that aren't imported in the code, and is more explicit.`
`124`	`124`
`125`		-```py {title="inline_script_metadata.py" py="3.10"}
	`125`	+```py {title="inline_script_metadata.py" py="3.10" requires="mcp_run_python.py"}
`126`	`126`	`from mcpimport ClientSession`
`127`	`127`	`from mcp.client.stdioimport stdio_client`
`128`	`128`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commita25eb96

File tree

20 files changed

20 files changed

`‎.gitignore`

`‎docs/api/models/mcp-sampling.md`

`‎docs/common-tools.md`

`‎docs/dependencies.md`

`‎docs/evals.md`

`‎docs/graph.md`

`‎docs/input.md`

`‎docs/mcp/client.md`

`‎docs/mcp/run-python.md`

0 commit comments