allenai/infinigram-apiPublic

NotificationsYou must be signed in to change notification settings
Fork10
Star80

License

Apache-2.0 license

80 stars 10 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.devcontainer		.devcontainer
.github		.github
.skiff		.skiff
.vscode		.vscode
api		api
attribution_worker		attribution_worker
bin		bin
compute_stats		compute_stats
docs		docs
indexing		indexing
load-test		load-test
otel-collector		otel-collector
packages/infini-gram-processor		packages/infini-gram-processor
proxy		proxy
schema		schema
scripts		scripts
vendor		vendor
volume-claims		volume-claims
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
skiff.json		skiff.json
uv.lock		uv.lock

Repository files navigation

Infini-gram API

This API is a wrapper overinfini-gram to allow it to be used through an API at scale. It's a uv workspace with two applications (the API andthe worker) and one library to share code between the two (infini-gram-processor).

Reference

This application is only made possible by researchers that worked on the infini-gram paper:Liu, Jiacheng and Min, Sewon and Zettlemoyer, Luke and Choi, Yejin and Hajishirzi, Hannaneh (2024).Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens.arXiv preprint arXiv:2401.17377,

Getting Started

To develop in this repo, see thecontributing doc.

Indexes

You can find the index documentationhere.

Architecture

flowchart TB    queue@{ shape: cyl, label: "Queue"}    indexes@{ shape: lin-cyl, label: "Indexes" }    api@{ shape: rounded, label: "API" }    worker@{ shape: rounded, label: "Attribution Worker" }    proxy@{ shape: stadium, label: "Web Proxy" }    infini-gram@{ shape: subproc, label: "infini-gram" }    api <-- Add jobs, receive results --> queue    worker <-- Receive jobs, send results --> queue    api --> infini-gram    worker --> infini-gram    infini-gram --> indexes    proxy --> api

This application is deployed on Ai2'sSkiff platform. It's a wrapper over k8s designed to streamline development and deployment.

The API and worker are in different deployments and are separately scalable.

Both the API and worker access infini-gram and the associated indexes. The API will pass anyattribution requests to the queue and await the result. The worker reads requests from the queue and works them, returning the result to the queue when finished. Requests other thanattribution will be handled in the API.attribution requests are split off because they take much longer, which was causing the server to hang under load.

About

No description, website, or topics provided.

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Infini-gram API

Reference

Getting Started

Indexes

Architecture

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors5

Uh oh!

Languages

Movatterモバイル変換

License

allenai/infinigram-api

Folders and files

Latest commit

History

Repository files navigation

Infini-gram API

Reference

Getting Started

Indexes

Architecture

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors5

Uh oh!

Languages

Packages