Movatterモバイル変換

Riffusion

From Wikipedia, the free encyclopedia

Music-generating machine learning model

Riffusion
Developers	Seth Forsgren Hayk Martiros
Initial release	December 15, 2022
Repository	github.com/hmartiro/riffusion-inference
Written in	Python
Type	Text-to-image model
License	MIT License
Website	riffusion.com

Generated spectrogram from the prompt "bossa nova withelectric guitar" (top), and the resulting audio after conversion (bottom)

Riffusion is aneural network, designed by Seth Forsgren and Hayk Martiros, that generates music using images of sound rather than audio.^[1]

The resulting music has been described as "de otro mundo" (otherworldly),^[2] although unlikely to replace man-made music.^[2] The model was made available on December 15, 2022, with the code also freely available onGitHub.^[3]

The first version of Riffusion was created as afine-tuning ofStable Diffusion, an existing open-source model for generating images from text prompts, onspectrograms,^[1] resulting in a model which used text prompts to generate image files which could then be put through aninverse Fourier transform and converted into audio files.^[3] While these files were only several seconds long, the model could also uselatent space between outputs tointerpolate different files together^[1]^[4] (using theimg2img capabilities of SD).^[5] It was one of many models derived from Stable Diffusion.^[5]

In December 2022, Mubert^[6] similarly used Stable Diffusion to turn descriptive text into music loops. In January 2023, Google published a paper on their own text-to-music generator called MusicLM.^[7]^[8]

Forsgren and Martiros formed a startup, also called Riffusion, and raised $4 million in venture capital funding in October 2023.^[9]^[10]

References

[edit]

^^a ^b ^cColdewey, Devin (December 15, 2022)."Try 'Riffusion,' an AI model that composes music by visualizing it".
^^a ^bLlano, Eutropio (December 15, 2022)."El generador de imágenes AI también puede producir música (con resultados de otro mundo)".
^^a ^bNasi, Michele (December 15, 2022)."Riffusion: creare tracce audio con l'intelligenza artificiale".IlSoftware.it.
^"Essayez "Riffusion", un modèle d'IA qui compose de la musique en la visualisant". December 15, 2022.
^^a ^b"文章に沿った楽曲を自動生成してくれるAI「Riffusion」登場、画像生成AI「Stable Diffusion」ベースで誰でも自由に利用可能".GIGAZINE. 16 December 2022.
^"Mubert launches Text-to-Music interface – a completely new way to generate music from a single text prompt". December 21, 2022.
^"MusicLM: Generating Music From Text". January 26, 2023.
^"5 Reasons Google's MusicLM AI Text-to-Music App is Different". January 27, 2023.
^Gal, Dr. Itay (February 10, 2025)."Free A.I. music creation platform launches, competing with Suno".The Jerusalem Post. RetrievedFebruary 16, 2025.
^Nuñez, Michael (January 30, 2025)."Riffusion's free AI music platform could be the Spotify of the future".VentureBeat. RetrievedFebruary 16, 2025.

v t e Music streaming services
Comparison of music streaming services Streaming media Internet radio
Free	AccuRadio Boomplay Claro Música Jango Joox KuGou Mziiki NetEase Cloud Music Patari Raaga.com Sua Música Yandex Music
Subscription	Amazon Music Anghami Audiomack Apple Music Gaana iHeartRadio KKBox Line Music Melon Moov Napster Nintendo Music Qobuz QQ Music Tidal
Hybrid	Deezer IDAGIO JioSaavn Pandora SoundCloud (SoundCloud Go) Spotify Xite YouTube Music
Gen AI	Riffusion Suno AI Udio
Defunct	AliMusic Baidu Music Beats Music Blinkbox Music Google Play Music Groove Music Grooveshark Guvera KakaoMusic Primephonic Rdio Songza Wynk Yahoo! Music Radio Yahoo! Music Unlimited

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 Llama o1 o3 o4-mini Qwen Velvet
Coding	Base44 Claude Code Cursor Devstral GitHub Copilot Kimi Qwen3-Coder Replit
Image	Aurora Firefly Flux GPT Image 1 Ideogram Imagen Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling Runway Gen Seedance LTX-2 Sora Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno Udio

Controversies

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 5.1 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Category

Thisartificial neural network-related article is astub. You can help Wikipedia byexpanding it.

Thisscientific software article is astub. You can help Wikipedia byexpanding it.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Riffusion&oldid=1323145739"

Categories:

Hidden categories:

[8]ページ先頭