#
xpu
Here are 7 public repositories matching this topic...
A high-throughput and memory-efficient inference and serving engine for LLMs
amdcudainferencepytorchtransformerllamagptrocmmodel-servingtpuhpumlopsxpullminferentiallmopsllm-servingqwendeepseektrainium
- Updated
Mar 17, 2025 - Python
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
- Updated
Dec 11, 2024 - Julia
Improve this page
Add a description, image, and links to thexpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thexpu topic, visit your repo's landing page and select "manage topics."