I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

fengyuentau mentioned this pull request

Feb 4, 2023

Add data & models for GELU layeropencv/opencv_extra#1044

Merged

fengyuentau added the category: dnn label

Feb 4, 2023

fengyuentau added this to the4.8.0 milestone

Feb 4, 2023

fengyuentau requested review fromalalek androgday

February 4, 2023 16:16

alalek reviewed

Feb 5, 2023

View reviewed changes

Copy link

Member

alalek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks good!

modules/dnn/src/layers/elementwise_layers.cpp OutdatedShow resolvedHide resolved

fengyuentau requested a review fromalalek

February 6, 2023 01:56

alalek approved these changes

Feb 7, 2023

View reviewed changes

Copy link

Member

alalek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Thank you!

add gelu and gelu approximation

d4351b1

fengyuentau force-pushed theadd_gelu branch fromc91f12d tod4351b1Compare

February 9, 2023 03:33

Copy link

MemberAuthor

fengyuentau commentedFeb 9, 2023

@rogday Could you review if possible?

vpisarev removed the request for review fromrogday

February 10, 2023 08:45

alalek reviewed

Feb 10, 2023

View reviewed changes

modules/dnn/src/opencl/activations.cl

Comment on lines 310 to 324

		__kernelvoidGeluForward(constintn,__globalTin,__globalTout)
		{
		intindex=get_global_id(0);
		if(index<n)
		out[index]= (T)0.5fin[index] ( (T)1.f+erf(in[index]*M_SQRT1_2) );
		}

		__kernelvoidGeluApproximationForward(constintn,__globalTin,__globalTout,
		constKERNEL_ARG_DTYPEsqrt_2_pi,
		constKERNEL_ARG_DTYPEcoef_sqrt_2_pi)
		{
		intindex=get_global_id(0);
		if(index<n)
		out[index]= (T)0.5fin[index] ( (T)1.f+tanh(in[index]* (sqrt_2_pi+coef_sqrt_2_piin[index]in[index])) );
		}

Copy link

Member

alalekFeb 10, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Please use this OpenCL code:

__kernel void GeluForward(const int n, __global T* in, __global T* out){    int index = get_global_id(0);    if (index < n)    {        T x = in[index];        out[index] = (T)0.5f * x * ( (T)1.f + erf(x * M_SQRT1_2) );    }}__kernel void GeluApproximationForward(const int n, __global T* in, __global T* out){    // see GeluApproximationConstants from .cpp    const T sqrt_2_pi = 0.7978845834732056f;    const T coef_sqrt_2_pi = 0.044714998453855515f * sqrt_2_pi;    int index = get_global_id(0);    if(index < n)    {        T x = in[index];        out[index] = (T)0.5f * x * ( (T)1.f + tanh(x * (sqrt_2_pi + coef_sqrt_2_pi * x * x)) );    }}

and dropsetKernelParams() method.

Copy link

MemberAuthor

fengyuentauFeb 10, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Okay

drop setKernelParams

46dfb70

alalek merged commitc2b7c1f intoopencv:4.x

Feb 10, 2023

a-sajjad72 pushed a commit to a-sajjad72/opencv that referenced this pull request

Mar 30, 2023

Merge pull requestopencv#23219from fengyuentau:add_gelu

cb74d64

Add GELU layer for vision transformers* add gelu and gelu approximation* drop setKernelParams

asmorkalov mentioned this pull request

May 31, 2023

(5.x) Merge 4.x#23718

Merged

geversonsto pushed a commit to stodev-com-br/opencv that referenced this pull request

Jun 3, 2023

Merge pull requestopencv#23219from fengyuentau:add_gelu

5cd9d50

Add GELU layer for vision transformers* add gelu and gelu approximation* drop setKernelParams

fengyuentau deleted the add_gelu branch

February 21, 2024 08:19

fengyuentau mentioned this pull request

Feb 21, 2024

ONNX conformance test results#21078

Open

48 tasks

Labels

category: dnn (onnx)

ONNX suport issues in DNN module

category: dnn feature

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add GELU layer for vision transformers#23219

Add GELU layer for vision transformers#23219

Uh oh!

Conversation

fengyuentau commentedFeb 4, 2023

Pull Request Readiness Checklist

Uh oh!

alalek left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

alalek left a comment

Choose a reason for hiding this comment

Uh oh!

fengyuentau commentedFeb 9, 2023

Uh oh!

alalekFeb 10, 2023

Choose a reason for hiding this comment

Uh oh!

fengyuentauFeb 10, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants