DefTruth DefTruth

🎯

#pragma unroll

Achievements

DefTruth/README.md

🎉 Contributor: FastDeploy | vLLM | SGLang | Many Others ⚙️

♥️ Ilove open source, bro, and I think you do too.♥️

xlite-dev/LeetCUDAxlite-dev/LeetCUDAPublic
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda 5.5k 576
xlite-dev/lite.ai.toolkitxlite-dev/lite.ai.toolkitPublic
🛠 A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉
C++ 4.2k 749
xlite-dev/Awesome-LLM-Inferencexlite-dev/Awesome-LLM-InferencePublic
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Python 4.2k 292
vllm-project/vllmvllm-project/vllmPublic
A high-throughput and memory-efficient inference and serving engine for LLMs
Python 52.1k 8.7k
PaddlePaddle/FastDeployPaddlePaddle/FastDeployPublic
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
Python 3.4k 554
vipshop/cache-ditvipshop/cache-ditPublic
🤗CacheDiT: A Training-free and Easy-to-use Cache Acceleration Toolbox for Diffusion Transformers🔥
Python 99 4