CN116911357A

Movatterモバイル変換

Info

Publication number: CN116911357A
Application number: CN202310848642.6A
Authority: CN
Inventors: 彭琪; 陈纪宇; 王一凡; 朱樟明
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2023-07-11
Filing date: 2023-07-11
Publication date: 2023-10-20
Anticipated expiration: 2043-07-11
Also published as: CN116911357B

Abstract

Translated fromChinese

本发明公开了一种基于CSR编码的卷积计算加速器，包括：数据预处理模块，用于从外部读取数据，并进行分块处理；CSR编码模块，用于对分块数据进行CSR编码，得到编码数据及其对应的地址；乘法脉动计算阵列，用于根据地址对对应的编码数据进行计算；数据分配模块，用于将计算结果划分为本窗口数据和跨窗口数据，并传入数据累加模块进行累加；数据延迟模块，用于在判断发生加法写冲突时，向乘法脉动计算阵列反馈反压信号，以暂停当前工作，并在延迟数据相加完毕后重新启动当前工作；数据排布模块，用于对累加数据进行整合并通过再量化模块重新映射位宽后，写入片外存储。该方法减少了片上存储的压力，降低了功耗，适用于高并行卷积计算。

The invention discloses a convolution calculation accelerator based on CSR encoding, which includes: a data preprocessing module for reading data from the outside and performing block processing; a CSR encoding module for performing CSR encoding on block data. Obtain the encoded data and its corresponding address; the multiplication pulse calculation array is used to calculate the corresponding encoded data based on the address; the data distribution module is used to divide the calculation results into local window data and cross-window data, and incoming data is accumulated The module accumulates; the data delay module is used to feed back the back pressure signal to the multiplication pulse calculation array when it is determined that an addition write conflict occurs, so as to suspend the current work and restart the current work after the delay data is added; the data arrangement module , used to integrate the accumulated data and remap the bit width through the requantization module, and then write it to off-chip storage. This method reduces the pressure on on-chip storage, reduces power consumption, and is suitable for highly parallel convolution calculations.