High Performance and Easy Deployment of vLLM in K8S with “vLLM production-stack”
ByLMCache TeamPosted on January 21, 2025
You focus on KV cache research, we make it compatible with vLLM
ByLMCache TeamPosted on October 29, 2024
📖 Explore LMCache Documentation
ByLMCache TeamPosted on October 17, 2024
Beyond Prefix Caching! How LMCache Speeds Up RAG by 4.5x By One Line of Change
ByLMCache TeamPosted on October 9, 2024
Are you a vLLM user? Change just ONE line of code to unlock 100x more KV cache storage power!
ByLMCache TeamPosted on September 23, 2024