Movatterモバイル変換

HROlive/Fundamentals-of-Accelerated-Computing-with-CUDA-PythonPublic

NotificationsYou must be signed in to change notification settings
Fork0
Star0

Fundamental tools and techniques for running GPU-accelerated Python applications using CUDA® GPUs and the Numba compiler.

0 stars 0 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assessment		assessment
debug		debug
img		img
slides		slides
Custom CUDA Kernels in Python with Numba.ipynb		Custom CUDA Kernels in Python with Numba.ipynb
Effective Memory Use.ipynb		Effective Memory Use.ipynb
Introduction to CUDA Python with Numba.ipynb		Introduction to CUDA Python with Numba.ipynb
README.md		README.md

Repository files navigation

Description

This course explores how to use Numba—the just-in-time, type-specializing Python function compiler—to accelerate Python programs to run on massively parallel NVIDIA GPUs.

You’ll learn how to:

Use Numba to compile CUDA kernels from NumPy universal functions (ufuncs);
Use Numba to create and launch custom CUDA kernels;
Apply key GPU memory management techniques.
Upon completion, you’ll be able to use Numba to compile and launch CUDA kernels to accelerate your Python applications on NVIDIA GPUs.

Information

At the conclusion of the workshop, you’ll have an understanding of the fundamental tools and techniques for GPU-accelerated Python applications with CUDA and Numba:

GPU-accelerate NumPy ufuncs with a few lines of code.
Configure code parallelization using the CUDA thread hierarchy.
Write custom CUDA device kernels for maximum performance and flexibility.
Use memory coalescing and on-device shared memory to increase CUDA kernel bandwidth.

More detailed information and links for the course can be found on thecourse website.

Certificate

The certificate for the course can be found below:

"Fundamentals of Accelerated Computing with CUDA Python" - NVIDIA Deep Learning Institute (Issued On: January 2025)

About

Fundamental tools and techniques for running GPU-accelerated Python applications using CUDA® GPUs and the Numba compiler.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Description

Information

Certificate

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Languages

Movatterモバイル変換

HROlive/Fundamentals-of-Accelerated-Computing-with-CUDA-Python

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Description

Information

Certificate

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Languages

Packages