Enables support for vLLM. To use, you must specify themodel field in thetask parameter of thepgml.transform functionand you must add"backend": "vllm" in thetask parameters. For example,

SELECT*FROMpgml.transform(    task=>'{"model":"tiiuae/falcon-7b","backend":"vllm"}'::JSONB,    inputs=> Array['hello']);

A list of supported models for vLLM can be foundhere.

Only one vLLM model can be loaded per client connection process due to alimitation in vLLM. The first call topgml.transform with a given model will load the model ("cold start"), but subsequent calls will use the cached model. If you change the specified model in the same client connection, the cached model will be replaced with the new one.

kczimm marked this pull request as ready for review

October 19, 2023 20:46

Copy link

Contributor

levkk commentedOct 19, 2023

Rebase on master to get#1102 which should fix the tests.

kczimm added9 commits

October 19, 2023 21:25

add vllm binding

635476c

add vllm SamplingParams

9360ef7

add test showing vllm model support check

8be0710

refactor into llm module, use PyResult

b212ee0

add vLLM to the transform API

746953e

make bindings vllm::outputs

ca7e4ad

swap out vLLM model if new

d017cd6

add vllm docs

74ce6ae

add vllm inference docs; fix logic

aca505c

kczimm force-pushed thekczimm-vllm-support branch from5e20276 toaca505cCompare

October 19, 2023 21:26

Labels

None yet

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vllm support#1063

Are you sure you want to change the base?

vllm support#1063

Uh oh!

Conversation

kczimm commentedOct 11, 2023•
edited
Loading

Uh oh!

Uh oh!

levkk commentedOct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Movatterモバイル変換

vllm support#1063

Are you sure you want to change the base?

vllm support#1063

Uh oh!

Conversation

kczimm commentedOct 11, 2023• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

levkk commentedOct 19, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kczimm commentedOct 11, 2023•
edited
Loading