Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

fma

Here are 36 public repositories matching this topic...

Implementations of SIMD instruction sets for systems which don't natively support them.

  • UpdatedFeb 20, 2026
  • C

Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)

  • UpdatedJul 15, 2019

Recommending Music using a Convolutional Neural Network.

  • UpdatedMay 4, 2019
  • Python

This package contains a macro for converting expressions to use muladd calls and fused-multiply-add (FMA) operations for high-performance in the SciML scientific machine learning ecosystem

  • UpdatedJan 5, 2026
  • Julia
Systolic_MAC_with_DFT

GF180 ASIC tapeout of a 2x2 MAC with DFT infrastructure

  • UpdatedJan 13, 2026
  • Verilog
transmutation

FMA Transmutation Circles

  • UpdatedFeb 13, 2019
  • TypeScript

Computing FLOPs with Intel Software Development Emulator (Intel SDE)

  • UpdatedOct 22, 2023
  • Python

Data pipeline and training pipeline for 🎵 music genre classification from FMA dataset

  • UpdatedDec 8, 2022
  • Jupyter Notebook

A collection of highly optimized, SIMD-accelerated (SSE, AVX, FMA, NEON) functions written in C

  • UpdatedOct 19, 2021
  • C

software implementation of Fused-Multiply Add for 64-bit floats

  • UpdatedAug 6, 2020
  • Go

Fast avx2/fma3 dgemm and sgemm subroutines for medium to large matrices(>2000*2000) on haswell/skylake/zen processors, with performances comparable to MKL.

  • UpdatedMar 23, 2020
  • C

X86-64 bilateral instruction tokenizer implemented in C. Supports the following processor extensions: AES, AVX, AVX2, AVX512, FMA, MMX, SSE, SSE2, SSE3, SSE4, x87(FPU), VMX. In order to ease testing, a diassembler which transforms tokens into compilable assembly (for NASM compiler) has been implemented.

  • UpdatedOct 2, 2022
  • C

VectorFFT is a vectorized, pure C FFT library optimized for x86 processors (AVX-512, AVX2, SSE2) with zero external dependencies. It implements mixed-radix algorithms for common sizes and Bluestein's method for arbitrary lengths, with OpenMP multi-threading for large transforms. Designed for both digital signal processing and financial applications

  • UpdatedNov 12, 2025
  • C

FMA convnet features

  • UpdatedMay 24, 2017

Fused Multiply Add (FMA) operation on Altera DE1-SoC Cyclone V development board.

  • UpdatedAug 7, 2017
  • Verilog

Python library to detect CPU SIMD capabilities.

  • UpdatedFeb 16, 2021
  • C
flow

offline artistic image generator

  • UpdatedApr 29, 2020
  • C

Test FMA on RaspberryPI CPU&GPU

  • UpdatedOct 16, 2021
  • C

Improve this page

Add a description, image, and links to thefma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thefma topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp