Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

cuda-kernels

Here are 299 public repositories matching this topic...

LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

  • UpdatedFeb 13, 2026
  • Cuda

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

  • UpdatedJan 6, 2026
  • C

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

  • UpdatedFeb 13, 2026
  • Python

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

  • UpdatedFeb 16, 2026
  • Rust

CUDA Kernel Benchmarking Library

  • UpdatedFeb 19, 2026
  • Cuda

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

  • UpdatedJan 8, 2026
  • Cuda
kernel_tuner

Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.

  • UpdatedApr 14, 2022
  • C++

This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010

  • UpdatedJun 24, 2022
  • C++

Comprehensive CUDA tutorials for Maths & ML with examples.

  • UpdatedJun 11, 2025
  • Cuda

Amplifier allows .NET developers to easily run complex applications with intensive mathematical computation on Intel CPU/GPU, NVIDIA, AMD without writing any additional C kernel code. Write your function in .NET and Amplifier will take care of running it on your favorite hardware.

  • UpdatedDec 23, 2025
  • C#

Triton implementation of FlashAttention2 that adds Custom Masks.

  • UpdatedAug 14, 2024
  • Python

Some CUDA design patterns and a bit of template magic for CUDA

  • UpdatedJun 3, 2023
  • C++

Spiking Neural Networks in C++ with strong GPU acceleration through CUDA

  • UpdatedJul 3, 2020
  • Cuda

Attention Kernels for Symmetric Power Transformers

  • UpdatedSep 25, 2025

High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.

  • UpdatedJul 13, 2024
  • Cuda

Improve this page

Add a description, image, and links to thecuda-kernels topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thecuda-kernels topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp