tensorcore
Here are 13 public repositories matching this topic...
Sort:Most stars
Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.
- Updated
May 15, 2023 - Python
An extension library of WMMA API (Tensor Core API)
- Updated
Jul 12, 2024 - Cuda
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
- Updated
Dec 2, 2025 - Cuda
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
- Updated
Feb 12, 2022 - Python
(Deprecated) SystemVerilog Implementations of CUDA/TensorCore/TPU GEMM Operations
- Updated
Aug 14, 2025 - Verilog
Fast SGEMM emulation on Tensor Cores
- Updated
Feb 16, 2025 - Cuda
An extension library of WMMA API for single precision matrix operation using TensorCores and error correction technique
- Updated
Jul 22, 2021 - C++
Compare the different runtime of CNN computation on CPU and GPU
- Updated
May 1, 2022 - C++
Artifact for SC21: APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores.
- Updated
Aug 26, 2021 - Cuda
simple examples of tools and libraries
- Updated
Jul 31, 2025 - Python
Experiments to accelerate GPU device for PyTorch training
- Updated
Dec 15, 2021 - Jupyter Notebook
Improve this page
Add a description, image, and links to thetensorcore topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thetensorcore topic, visit your repo's landing page and select "manage topics."