Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches

License

NotificationsYou must be signed in to change notification settings

ielhajj/klap

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

KLAP is a source-to-source compiler that optimizes CUDA code which uses dynamic parallelism to implement applications with nested parallelism. KLAP aggregates dynamic launches across warps, blocks, and grids to reduce the total number of grid launches and increase their granularity.

Instructions

Refer tosrc for instructions on how to build the compiler.

Refer toinclude for instructions on how to setup the runtime.

Refer totest for instructions on how to run the benchmarks.

Citation

Please cite the following paper if you find this work useful:

  • I. El Hajj, J. Gómez-Luna, C. Li, L.-W. Chang, D. Milojicic, W.-M. Hwu.KLAP: Kernel Launch Aggregation and Promotion for Optimizing Dynamic Parallelism.InProceedings of the 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2016.

About

A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++68.7%
  • Cuda20.0%
  • C8.7%
  • Makefile1.1%
  • Objective-C1.0%
  • Shell0.4%
  • CMake0.1%

[8]ページ先頭

©2009-2025 Movatter.jp