CN113705799B

Movatterモバイル変換

Info

Publication number: CN113705799B
Application number: CN202010435630.7A
Authority: CN
Inventors: 董俊; 尹莉; 陈琳
Original assignee: Pingtouge Shanghai Semiconductor Co Ltd
Current assignee: Pingtouge Shanghai Semiconductor Co Ltd
Priority date: 2020-05-21
Filing date: 2020-05-21
Publication date: 2025-08-22
Anticipated expiration: 2040-05-21
Also published as: CN113705799A

Abstract

Translated fromChinese

本发明公开了一种处理单元、计算装置及深度学习模型的计算图处理方法。该方法包括：将第一深度学习框架的计算图转换成符合加速单元所遵循的中间表达；对中间表达进行模型处理；确定处理后的中间表达包含的且未在第一深度学习框架注册过的第一算子，其中，处理后的中间表达用算子标识、属性数据以及与其他算子的连接关系表征算子；将处理后的中间表达转换回第一深度学习框架的计算图，包括：用连接算子替换第一算子，且按照第一算子与其它算子的连接关系构造连接算子与其它算子的连接关系，然后用第一算子的算子标识和属性数据替换连接算子的算子标识和属性数据。本公开实施例利用一个连接算子解决计算图中多个未在原框架下定义且注册的算子的转换问题。

The present invention discloses a method for processing a computational graph of a processing unit, a computing device, and a deep learning model. The method includes: converting a computational graph of a first deep learning framework into an intermediate expression that complies with the acceleration unit; performing model processing on the intermediate expression; determining a first operator contained in the processed intermediate expression and that has not been registered in the first deep learning framework, wherein the processed intermediate expression represents the operator with an operator identifier, attribute data, and a connection relationship with other operators; converting the processed intermediate expression back into the computational graph of the first deep learning framework, including: replacing the first operator with a connection operator, and constructing a connection relationship between the connection operator and other operators according to the connection relationship between the first operator and other operators, and then replacing the operator identifier and attribute data of the connection operator with the operator identifier and attribute data of the first operator. The embodiment of the present disclosure uses a connection operator to solve the conversion problem of multiple operators in the computational graph that are not defined and registered in the original framework.