PinnedLoading
- llm-compressor
llm-compressor PublicTransformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
- speculators
speculators PublicA unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM
Repositories
Showing 10 of 30 repositories
- compressed-tensors Public
A safetensors extension to efficiently store sparse quantized tensors on disk
vllm-project/compressed-tensors’s past year of commit activity Uh oh!
There was an error while loading.Please reload this page.
vllm-project/ci-infra’s past year of commit activity - llm-compressor Public
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
vllm-project/llm-compressor’s past year of commit activity