SqueezeBits Inc.
Popular repositoriesLoading
- Torch-TRTLLM
Torch-TRTLLM PublicDitto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.
- owlite-examples
owlite-examples PublicOwLite Examples repository offers illustrative example codes to help users seamlessly compress PyTorch deep learning models and transform them into TensorRT engines.
- .github
.github Public - mlperf_inference_results_v4.0
mlperf_inference_results_v4.0 PublicC++ 1
Repositories
- Torch-TRTLLM Public
Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.
SqueezeBits/Torch-TRTLLM’s past year of commit activity - vllm-fork Public Forked fromHabanaAI/vllm-fork
A high-throughput and memory-efficient inference and serving engine for LLMs
SqueezeBits/vllm-fork’s past year of commit activity - gradio Public Forked fromgradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
SqueezeBits/gradio’s past year of commit activity - TensorRT-LLM Public Forked fromNVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
SqueezeBits/TensorRT-LLM’s past year of commit activity - owlite-examples Public
OwLite Examples repository offers illustrative example codes to help users seamlessly compress PyTorch deep learning models and transform them into TensorRT engines.
SqueezeBits/owlite-examples’s past year of commit activity - nvidia-dind Public Forked fromehfd/nvidia-dind
Isolated DinD (Docker in Docker) container for developing and deploying Docker containers using NVIDIA GPUs and the NVIDIA container toolkit. Useful for deploying the Docker engine with NVIDIA in Kubernetes.
SqueezeBits/nvidia-dind’s past year of commit activity