You signed in with another tab or window.Reload to refresh your session.You signed out in another tab or window.Reload to refresh your session.You switched accounts on another tab or window.Reload to refresh your session.Dismiss alert
The project is developing a deep learning accelerator generator based on systolic array architecture. It is inspired by recent trends in machine learning accelerators for edge and mobile SoCs.
It currently supports unsigned integer and signed integer, and support run-time selection. You are also free to config which type to support at RTL code.It is planned to support floating-point in the future.
This repo contains the Chisel code for generating the RTL. We plan to write our customized PyTorch backend in the future.
At the heart of the accelerator lies a systolic array (called Tile) which performs matrix multiplications. The processing elements support weight-stationary dataflows.
Test Tile
make test-tile
Test Processing Element
make test-pe
Generate Verilog Code
make run
About
Systolic-array based Deep Learning Accelerator generator