Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Benchmark results #89

Open
Open
Labels
performanceCPU and memory usage - results and comparisons
@ggerganov

Description

@ggerganov

Encoder

Collection ofbench results for various platforms and devices.
If you want to submit info about your device, simply run thebench tool or theextra/bench-all.sh and report the results in the comments below.

Suggestions for better summary of the results are welcome

CPUOSConfigModelThLoadEnc.Commit
MacBook M1 ProMacOS 13.0.1NEON BLAStiny871102206fc93
MacBook M1 ProMacOS 13.0.1NEON BLASbase896220206fc93
MacBook M1 ProMacOS 13.0.1NEON BLASsmall8233685206fc93
MacBook M1 ProMacOS 13.0.1NEON BLASmedium86031928206fc93
MacBook M1 ProMacOS 13.0.1NEON BLASlarge811583350206fc93
---
MacBook M1 ProMacOS 13.0.1NEON BLASsmall12512605206fc93
MacBook M1 ProMacOS 13.0.1NEON BLASsmall4255884206fc93
---
Mac Mini M1MacOSNEON BLAStiny462194fcf515d
Mac Mini M1MacOSNEON BLASbase481380fcf515d
Mac Mini M1MacOSNEON BLASsmall42041249fcf515d
Mac Mini M1MacOSNEON BLASmedium48763980fcf515d
Mac Mini M1MacOSNEON BLASlarge418767979fcf515d
---
Ryzen 9 3900XUbuntu 20.04AVX2tiny8107422fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2base8137880fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2small82802874fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2medium86929610fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2large8131716917fcf515d
---
Ryzen 9 3900XUbuntu 20.04AVX2 BLAStiny4120780fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2 BLASbase41511173fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2 BLASsmall42893062fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2 BLASmedium47119175fcf515d
Ryzen 9 3900XUbuntu 20.04AVX2 BLASlarge4128216050fcf515d
---
Ryzen 9 5950XUbuntu 22.04AVX2tiny8135197fcf515d
Ryzen 9 5950XUbuntu 22.04AVX2base8176421fcf515d
Ryzen 9 5950XUbuntu 22.04AVX2small83571393fcf515d
Ryzen 9 5950XUbuntu 22.04AVX2medium88554404fcf515d
Ryzen 9 5950XUbuntu 22.04AVX2large815768118fcf515d
---
Raspberry Pi 4NEONtiny4143613839fcf515d
Raspberry Pi 4NEONbase4189430552fcf515d
---
iPhone 13 MiniiOS 16.0NEON BLASbase4971091fcf515d
---
MacBook M1 ProVivaldiWASMtiny81333785fcf515d
MacBook M1 ProVivaldiWASMbase81728253fcf515d
---
MacBook M1 ProChromeWASMtiny81343776fcf515d
MacBook M1 ProChromeWASMbase81688200fcf515d
---
MacBook M1 ProFirefoxWASMtiny81372626fcf515d
MacBook M1 ProFirefoxWASMbase81836226fcf515d

memcpy

MacBook M1 Pro

./bench -w 1 -t 1memcpy: 37.59 GB/s

Ryzen 9 5950X

./bench -w 1 -t 1memcpy: 16.74 GB/s

ggml_mul_mat

MacBook M1 Pro

./bench -w 2 -t 1ggml_mul_mat:    64 x    64: F16    330.6 GFLOPS (128 runs) / F32    466.0 GFLOPS (128 runs)ggml_mul_mat:   128 x   128: F16    737.5 GFLOPS (128 runs) / F32    838.9 GFLOPS (128 runs)ggml_mul_mat:   256 x   256: F16    938.6 GFLOPS (128 runs) / F32   1062.3 GFLOPS (128 runs)ggml_mul_mat:   512 x   512: F16   1312.5 GFLOPS (128 runs) / F32   1835.5 GFLOPS (128 runs)ggml_mul_mat:  1024 x  1024: F16   1765.1 GFLOPS (128 runs) / F32   2041.4 GFLOPS (128 runs)ggml_mul_mat:  2048 x  2048: F16   1784.3 GFLOPS (104 runs) / F32   1859.2 GFLOPS (109 runs)ggml_mul_mat:  4096 x  4096: F16   1855.1 GFLOPS ( 14 runs) / F32   1873.3 GFLOPS ( 14 runs)

Ryzen 9 5950X

WHISPER_OPENBLAS=1 make -j bench && ./bench -w 2 -t 1ggml_mul_mat:    64 x    64: F16     56.3 GFLOPS (128 runs) / F32     70.2 GFLOPS (128 runs)ggml_mul_mat:   128 x   128: F16     47.8 GFLOPS (128 runs) / F32     67.0 GFLOPS (128 runs)ggml_mul_mat:   256 x   256: F16    185.1 GFLOPS (128 runs) / F32    332.7 GFLOPS (128 runs)ggml_mul_mat:   512 x   512: F16    386.4 GFLOPS (128 runs) / F32    658.6 GFLOPS (128 runs)ggml_mul_mat:  1024 x  1024: F16    636.2 GFLOPS (128 runs) / F32   1012.0 GFLOPS (128 runs)ggml_mul_mat:  2048 x  2048: F16    950.9 GFLOPS ( 56 runs) / F32   1296.8 GFLOPS ( 76 runs)ggml_mul_mat:  4096 x  4096: F16   1168.6 GFLOPS (  9 runs) / F32   1403.1 GFLOPS ( 11 runs)

Metadata

Metadata

Assignees

No one assigned

    Labels

    performanceCPU and memory usage - results and comparisons

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions


      [8]ページ先頭

      ©2009-2025 Movatter.jp