Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

cuda-kernels

Here are 277 public repositories matching this topic...

LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

  • UpdatedDec 4, 2025
  • Cuda

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

  • UpdatedSep 5, 2025
  • C

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

  • UpdatedDec 18, 2025
  • Python

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

  • UpdatedDec 16, 2025
  • Rust

CUDA Kernel Benchmarking Library

  • UpdatedDec 10, 2025
  • Cuda
kernel_tuner

Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.

  • UpdatedApr 14, 2022
  • C++

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

  • UpdatedDec 15, 2025
  • Cuda

This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010

  • UpdatedJun 24, 2022
  • C++

CUDA tutorials for Maths & ML tutorials with examples, covers multi-gpus, fused attention, winograd convolution, reinforcement learning.

  • UpdatedJun 11, 2025
  • Cuda

Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.

  • UpdatedApr 9, 2025
  • C#

Some CUDA design patterns and a bit of template magic for CUDA

  • UpdatedJun 3, 2023
  • C++

Triton implementation of FlashAttention2 that adds Custom Masks.

  • UpdatedAug 14, 2024
  • Python

Spiking Neural Networks in C++ with strong GPU acceleration through CUDA

  • UpdatedJul 3, 2020
  • Cuda

Attention Kernels for Symmetric Power Transformers

  • UpdatedSep 25, 2025

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

  • UpdatedJul 13, 2024
  • Cuda

Improve this page

Add a description, image, and links to thecuda-kernels topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thecuda-kernels topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp