- Notifications
You must be signed in to change notification settings - Fork105
A 32-bit MIPS / RISC-V core & SoC, 1.55 DMIPS/MHz, 2.96 CM/Mhz
License
f32c/f32c
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
f32c is a retargetable, scalar, pipelined, 32-bit processor core whichcan execute subsets of either RISC-V or MIPS instruction sets.It is implemented in parametrized VHDL which permits synthesis withdifferent area / speed tradeoffs, and includes a branch predictor,exception handling control block, and optional direct-mapped caches.The RTL code also includesSoC modules such as amulti-port SDRAM and SRAM controllers, video framebuffers with composite (PAL),HDMI, DVI and VGA outputs with simple 2D acceleration for sprites and windows,floating point vector processor, SPI, UART, PCM audio, GPIO, PWM outputs and atimer, as well as glue logic tailored for numerous popular FPGA development boardsfrom various manufacturers.
In synthetic integer benchmarks the core yields 3.3 CoreMark/MHzand 1.46 DMIPS/MHz with code and data stored in on-chip block RAMs.When configured with 16 KB of instruction and 8 KB of data cache,and with code and data stored in external SDRAM, the core yields3.13 CoreMark/MHz and 1.39 DMIPS/MHz.
A performance-tuned f32c SoC which includes a timerand an UART occupies only 1048 6-input LUTs, while still being able toexecute gcc-generated code when synthesized in the most compactconfiguration which consumes just 697 (649 logic plus 48 memory) LUTs.
Floating point vector processor can be optionally synthesized.Tested on Xilinx Spartan-6 (xc6slx25) and 7-series (xc7a35i, xc7a102t, xc7z010),Altera Cyclone-4 (EP4CE22) and MAX-10 (10M50DAF), Lattice ECP3 (LFE3-150EA)and ECP5 (LFE5UM-85F). On Artix-7 it uses 3148 LUTs, 64K BRAM,38 DSP multipliers (36 for divider unit) and can provide up to 3 MFLOPs/MHz.
The Fmax depends on core configuration and FPGA silicon, and tops ataround 115 MHz for 90 nm FPGAs (such as Xilinx S3E / S3A or Lattice XP2)up to 185 MHz for latest generations of 6-input LUT FPGAs such asArtix-7.
Configurable options include:
C_arch RISC-V or MIPS ISAC_big_endian bus endianessC_mult_enable synthesize multipler unitC_branch_likely support branch delay slot annullingC_sign_extend support sign extension instructionsC_movn_movz support conditional move instructionsC_ll_sc support atomic read-modify-write constructsC_branch_prediction synthesize branch predictorC_bp_global_depth global branch history trace sizeC_result_forwarding synthesize result bypassesC_load_aligner synthesize load alignerC_full_shifter pipelined instead of iterative shiferC_icache_size instruction cache size (0 to 64 KB)C_dcache_size data cache size (0 to 64 KB)C_debug synthesize single-stepping debug module
Pre-compiled gcc-based toolchains for Windows, OS-X and Linux can befound at theFPGArduino page,together with pre-built demo bitstreams for various Xilinx, Alteraand Lattice FPGAs, and with further instructions on how to compileRISC-V / MIPS executables using the Arduino IDE.
All VHDL modules areBSD licensed. The majority of softwarelibraries are borrowed from FreeBSD, while some originate from otherprojects and may be subject to an MIT-style license.