NotificationsYou must be signed in to change notification settings
Fork468
Star2.2k

[inference provider] Add wavespeed.ai as an inference provider#1424

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Open

arabot777 wants to merge41 commits intohuggingface:main

base:main

Choose a base branch

fromarabot777:feat/wavespeedai

Open

[inference provider] Add wavespeed.ai as an inference provider#1424

arabot777 wants to merge41 commits intohuggingface:mainfromarabot777:feat/wavespeedai

Conversation

Copy link

arabot777 commentedMay 5, 2025•
edited
Loading

What’s in this PR
WaveSpeedAI is a high-performance AI image and video generation service platform, offering industry-leading generation speeds. Now, want to be listed as an Inference Provider on the Hugging Face Hub

The JS Client Integration was completed based on the inference-providers help documentation and passed the test. I am submitting the pr now and look forward to further communication with you

Test

pnpm --filter @huggingface/inference test "test/InferenceClient.spec.ts" -t "^Wavespeed AI"> @huggingface/inference@3.11.0 test /Users/shanliu/work/huggingface.js/packages/inference> vitest run --config vitest.config.mts "test/InferenceClient.spec.ts" RUN  v0.34.6 /Users/shanliu/work/huggingface.js/packages/inference ✓ test/InferenceClient.spec.ts (104) 198160ms   ✓ InferenceClient (104) 198160ms     ✓ backward compatibility (1)       ✓ works with old HfInference name     ↓ HF Inference (49) [skipped]       ↓ throws error if model does not exist [skipped]       ↓ fillMask [skipped]       ↓ works without model [skipped]       ↓ summarization [skipped]       ↓ questionAnswering [skipped]       ↓ tableQuestionAnswering [skipped]       ↓ documentQuestionAnswering [skipped]       ↓ documentQuestionAnswering with non-array output [skipped]       ↓ visualQuestionAnswering [skipped]       ↓ textClassification [skipped]       ↓ textGeneration - gpt2 [skipped]       ↓ textGeneration - openai-community/gpt2 [skipped]       ↓ textGenerationStream - meta-llama/Llama-3.2-3B [skipped]       ↓ textGenerationStream - catch error [skipped]       ↓ textGenerationStream - Abort [skipped]       ↓ tokenClassification [skipped]       ↓ translation [skipped]       ↓ zeroShotClassification [skipped]       ↓ sentenceSimilarity [skipped]       ↓ FeatureExtraction [skipped]       ↓ FeatureExtraction - auto-compatibility sentence similarity [skipped]       ↓ FeatureExtraction - facebook/bart-base [skipped]       ↓ FeatureExtraction - facebook/bart-base, list input [skipped]       ↓ automaticSpeechRecognition [skipped]       ↓ audioClassification [skipped]       ↓ audioToAudio [skipped]       ↓ textToSpeech [skipped]       ↓ imageClassification [skipped]       ↓ zeroShotImageClassification [skipped]       ↓ objectDetection [skipped]       ↓ imageSegmentation [skipped]       ↓ imageToImage [skipped]       ↓ imageToImage blob data [skipped]       ↓ textToImage [skipped]       ↓ textToImage with parameters [skipped]       ↓ imageToText [skipped]       ↓ request - openai-community/gpt2 [skipped]       ↓ tabularRegression [skipped]       ↓ tabularClassification [skipped]       ↓ endpoint - makes request to specified endpoint [skipped]       ↓ endpoint - makes request to specified endpoint - alternative syntax [skipped]       ↓ chatCompletion modelId - OpenAI Specs [skipped]       ↓ chatCompletionStream modelId - OpenAI Specs [skipped]       ↓ chatCompletionStream modelId Fail - OpenAI Specs [skipped]       ↓ chatCompletion - OpenAI Specs [skipped]       ↓ chatCompletionStream - OpenAI Specs [skipped]       ↓ custom mistral - OpenAI Specs [skipped]       ↓ custom openai - OpenAI Specs [skipped]       ↓ OpenAI client side routing - model should have provider as prefix [skipped]     ↓ Fal AI (4) [skipped]       ↓ textToImage - black-forest-labs/FLUX.1-schnell [skipped]       ↓ textToImage - SD LoRAs [skipped]       ↓ textToImage - Flux LoRAs [skipped]       ↓ automaticSpeechRecognition - openai/whisper-large-v3 [skipped]     ↓ Featherless (3) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ textGeneration [skipped]     ↓ Replicate (10) [skipped]       ↓ textToImage canonical - black-forest-labs/FLUX.1-schnell [skipped]       ↓ textToImage canonical - black-forest-labs/FLUX.1-dev [skipped]       ↓ textToImage canonical - stabilityai/stable-diffusion-3.5-large-turbo [skipped]       ↓ textToImage versioned - ByteDance/SDXL-Lightning [skipped]       ↓ textToImage versioned - ByteDance/Hyper-SD [skipped]       ↓ textToImage versioned - playgroundai/playground-v2.5-1024px-aesthetic [skipped]       ↓ textToImage versioned - stabilityai/stable-diffusion-xl-base-1.0 [skipped]       ↓ textToSpeech versioned [skipped]       ↓ textToSpeech OuteTTS -  usually Cold [skipped]       ↓ textToSpeech Kokoro [skipped]     ↓ SambaNova (3) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ featureExtraction [skipped]     ↓ Together (4) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ textToImage [skipped]       ↓ textGeneration [skipped]     ↓ Nebius (3) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ textToImage [skipped]     ↓ 3rd party providers (1) [skipped]       ↓ chatCompletion - fails with unsupported model [skipped]     ↓ Fireworks (2) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]     ↓ Hyperbolic (4) [skipped]       ↓ chatCompletion - hyperbolic [skipped]       ↓ chatCompletion stream [skipped]       ↓ textToImage [skipped]       ↓ textGeneration [skipped]     ↓ Novita (2) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]     ↓ Black Forest Labs (2) [skipped]       ↓ textToImage [skipped]       ↓ textToImage URL [skipped]     ↓ Cohere (2) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]     ↓ Cerebras (2) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]     ↓ Nscale (3) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ textToImage [skipped]     ↓ Groq (2) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]     ↓ OVHcloud (4) [skipped]       ↓ chatCompletion [skipped]       ↓ chatCompletion stream [skipped]       ↓ textGeneration [skipped]       ↓ textGeneration stream [skipped]     ✓ Wavespeed AI (5) 89033ms       ✓ textToImage - wavespeed-ai/flux-schnell 89032ms       ✓ textToImage - wavespeed-ai/flux-dev-lora 12369ms       ✓ textToImage - wavespeed-ai/flux-dev-lora-ultra-fast 17936ms       ✓ textToVideo - wavespeed-ai/wan-2.1/t2v-480p 79507ms       ✓ imageToImage - wavespeed-ai/hidream-e1-full 74481ms Test Files  1 passed (1)      Tests  5 passed | 103 skipped (108)   Start at  14:33:17   Duration  89.62s (transform 315ms, setup 14ms, collect 368ms, tests 89.03s, environment 0ms, prepare 74ms)

arabot777 added2 commits

May 5, 2025 09:45

add wavespeed.ai as an inference provider

a4d8504

delete debug log

686931e

arabot777 requested review fromjulien-c,hanouticelina andSBrandeis ascode owners

May 5, 2025 02:28

arabot777and others added6 commits

May 5, 2025 17:31

Merge branch 'main' into feat/wavespeedai

0e71b88

Merge branch 'main' into feat/wavespeedai

4461225

Merge branch 'main' into feat/wavespeedai

e0bf580

Merge branch 'main' into feat/wavespeedai

07af35f

Merge branch 'main' into feat/wavespeedai

fa3afa4

support lora

214ff99

SBrandeis reviewed

May 19, 2025

View reviewed changes

Copy link

Contributor

SBrandeis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Hello, thank you for your contribution
The code is of great quality overall - I left a few comments regarding our code style.
Please make sure the client can be used to query your API for all supported tasks, and that the payload are matching your API.
Thanks again!

packages/inference/README.md Outdated

Comment on lines 101 to 102

		- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

- [HF Inference API (serverless)](https://huggingface.co/models?inference=warm&sort=trending)

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

has deleted

packages/inference/test/InferenceClient.spec.ts Outdated

		hfModelId: "wavespeed-ai/wan-2.1/i2v-480p",
		providerId: "wavespeed-ai/wan-2.1/i2v-480p",
		status: "live",
		task: "image-to-video",

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

this task is not supported in the client code - let's remove it for now

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

has deleted

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 1 to 13

		import { InferenceOutputError } from "../lib/InferenceOutputError";
		import { ImageToImageArgs } from "../tasks";
		import type { BodyParams, HeaderParams, RequestArgs, UrlParams } from "../types";
		import { delay } from "../utils/delay";
		import { omit } from "../utils/omit";
		import { base64FromBytes } from "../utils/base64FromBytes";
		import {
		TaskProviderHelper,
		TextToImageTaskHelper,
		TextToVideoTaskHelper,
		ImageToImageTaskHelper,
		} from "./providerHelper";

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We useimport type when the import is only used as a type

Suggested change

	import{InferenceOutputError}from"../lib/InferenceOutputError";
	import{ImageToImageArgs}from"../tasks";
	importtype{BodyParams,HeaderParams,RequestArgs,UrlParams}from"../types";
	import{delay}from"../utils/delay";
	import{omit}from"../utils/omit";
	import{base64FromBytes}from"../utils/base64FromBytes";
	import{
	TaskProviderHelper,
	TextToImageTaskHelper,
	TextToVideoTaskHelper,
	ImageToImageTaskHelper,
	}from"./providerHelper";
	import{InferenceOutputError}from"../lib/InferenceOutputError";
	importtype{ImageToImageArgs}from"../tasks";
	importtype{BodyParams,HeaderParams,RequestArgs,UrlParams}from"../types";
	import{delay}from"../utils/delay";
	import{omit}from"../utils/omit";
	import{base64FromBytes}from"../utils/base64FromBytes";
	importtype{
	TaskProviderHelper,
	TextToImageTaskHelper,
	TextToVideoTaskHelper,
	ImageToImageTaskHelper,
	}from"./providerHelper";

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Modify as suggested

packages/inference/src/providers/wavespeed-ai.ts Outdated

		};
		}

		type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'm not sure this type alias is needed, can we remove it?

Suggested change

type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

WaveSpeedAICommonResponse can be renamed toWaveSpeedAIResponse

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This type is needed and will be used in two places. It's uncertain whether it will be used again in the future.
It follows the DRY (Don't Repeat Yourself) principle
It provides better type safety (through default generic parameters)
It makes the code more readable and maintainable

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 124 to 133

		case "completed": {
		// Get the video data from the first output URL
		if (!taskResult.outputs?.[0]) {
		throw new InferenceOutputError("No video URL in completed response");
		}
		const videoResponse = await fetch(taskResult.outputs[0]);
		if (!videoResponse.ok) {
		throw new InferenceOutputError("Failed to fetch video data");
		}
		return await videoResponse.blob();

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

From what I understand, the payload can be something else than a video (eg an image)
Let's update the error message to reflect that

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

yes,
I revised it.

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 170 to 192

		if (!args.parameters) {
		return {
		...args,
		model: args.model,
		data: args.inputs,
		};
		} else {
		return {
		...args,
		inputs: base64FromBytes(
		new Uint8Array(args.inputs instanceof ArrayBuffer ? args.inputs : await (args.inputs as Blob).arrayBuffer())
		),
		};
		}
		}

		override preparePayload(params: BodyParams): Record<string, unknown> {
		return {
		...omit(params.args, ["inputs", "parameters"]),
		...(params.args.parameters as Record<string, unknown>),
		image: params.args.inputs,
		};
		}

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think only one of the two (preparePayload orpreparePayloadAsync) should be responsible for building the payload, meaning, I'd rather move the rename ofinputs toimage inpreparePayloadAsync an havepreparePayload as dumb as possible

cc@hanouticelina - would love your opinion on that specific point

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I only kept preparePayloadAsync func

Copy link

Contributor

hanouticelinaMay 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think only one of the two ( preparePayload or preparePayloadAsync) should be responsible for building the payload, meaning, I'd rather move the rename of inputs to image in preparePayloadAsync an have preparePayload as dumb as possible

yes agree!

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 179 to 181

		inputs: base64FromBytes(
		new Uint8Array(args.inputs instanceof ArrayBuffer ? args.inputs : await (args.inputs as Blob).arrayBuffer())
		),

Copy link

Contributor

SBrandeisMay 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Does the wavespeed API support base64-encoded images as inputs?

Copy link

Author

arabot777May 20, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

yes

hanouticelina added the inference-providersintegration of a new or existing Inference Provider label

May 20, 2025

arabot777and others added4 commits

May 20, 2025 21:31

code review

47c64c6

Merge branch 'main' into feat/wavespeedai

7270c5c

code review

ba35791

Merge branch 'main' into feat/wavespeedai

ca35eab

arabot777 requested a review fromSBrandeis

May 20, 2025 13:44

arabot777and others added2 commits

May 20, 2025 21:48

delete unused import

80d4640

Merge branch 'main' into feat/wavespeedai

77be0c6

hanouticelina reviewed

May 21, 2025

View reviewed changes

Copy link

Contributor

hanouticelina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

thank you@arabot777 for the PR! I left some minor comments. I tested the 3 tasks supported by Wavespeed.ai and it works as expected with the changes I suggested.

packages/inference/src/lib/getProviderHelper.ts OutdatedShow resolvedHide resolved

packages/inference/src/providers/wavespeed-ai.tsShow resolvedHide resolved

arabot777and others added4 commits

May 22, 2025 00:22

Update packages/inference/src/lib/getProviderHelper.ts

0c77b3b

Co-authored-by: célina <hanouticelina@gmail.com>

Update packages/inference/src/lib/getProviderHelper.ts

3ab254e

Co-authored-by: célina <hanouticelina@gmail.com>

Merge branch 'main' into feat/wavespeedai

a8fe74c

Merge branch 'main' into feat/wavespeedai

f706e02

SBrandeis reviewed

May 22, 2025

View reviewed changes

Copy link

Contributor

SBrandeis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Second round of code review, thank you! We're getting there

Note: make sure you runpnpm format andpnpm lint to conform our code style.

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 16 to 24

		/**
		* Common response structure for all WaveSpeed AI API responses
		*/
		interface WaveSpeedAICommonResponse<T> {
		code: number;
		message: string;
		data: T;
		}

Copy link

Contributor

SBrandeisMay 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This abstraction is not necessary IMO, let's remove it (see my other comment)

Suggested change

	/**
	*CommonresponsestructureforallWaveSpeedAIAPIresponses
	*/
	interfaceWaveSpeedAICommonResponse<T>{
	code:number;
	message:string;
	data:T;
	}

Copy link

Author

arabot777May 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It has been modified as suggested

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 55 to 56

		type WaveSpeedAIResponse<T = WaveSpeedAITaskResponse> = WaveSpeedAICommonResponse<T>;

Copy link

Contributor

SBrandeisMay 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Following the previous comment - let's remove one level of abstraction

Suggested change

	typeWaveSpeedAIResponse<T=WaveSpeedAITaskResponse>=WaveSpeedAICommonResponse<T>;
	interfaceWaveSpeedAIResponse{
	code:number;
	message:string;
	data:WaveSpeedAITaskResponse
	}

Copy link

Author

arabot777May 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It has been modified as suggested

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 67 to 73

		preparePayload(params: BodyParams): Record<string, unknown> {
		const payload: Record<string, unknown> = {
		...omit(params.args, ["inputs", "parameters"]),
		...(params.args.parameters as Record<string, unknown>),
		prompt: params.args.inputs,
		};
		// Add LoRA support if adapter is specified in the mapping

Copy link

Contributor

SBrandeisMay 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

We don't need to cast intoResult<string, unknown> if theparams have the proper type
ImageToImageArgs,TextToImageArgs, andTextToVideoArgs need to be improrted from"../tasks"

Suggested change

	preparePayload(params:BodyParams):Record<string,unknown>{
	constpayload:Record<string,unknown>={
	...omit(params.args,["inputs","parameters"]),
	...(params.args.parametersasRecord<string,unknown>),
	prompt:params.args.inputs,
	};
	// Add LoRA support if adapter is specified in the mapping
	preparePayload(params:BodyParams<ImageToImageArgs\|TextToImageArgs\|TextToVideoArgs>):Record<string,unknown>{
	constpayload:Record<string,unknown>={
	...omit(params.args,["inputs","parameters"]),
	...params.args.parameters,
	prompt:params.args.inputs,
	};
	// Add LoRA support if adapter is specified in the mapping

Copy link

Author

arabot777May 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It has been modified as suggested

packages/inference/src/providers/wavespeed-ai.ts Outdated

		if (params.mapping?.adapter === "lora" && params.mapping.adapterWeightsPath) {
		payload.loras = [
		{
		path: params.mapping.adapterWeightsPath,

Copy link

Contributor

SBrandeisMay 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

For reference,adapterWeightsPath is the path to the LoRA weights inside the associated HF repo
eg, fornerijs/pixel-art-xl, it will be

"pixel-art-xl.safetensors"

Let's make sure that is indeed what your API is expecting when running LoRAs

Copy link

Author

arabot777May 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Here I see that fal is the endpoint that has been concatenated with hf.
Can I directly set the adapterWeightsPath to a lora http address? Or any other address.

Copy link

Author

arabot777May 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

In the test cases, I conducted the test in this way. TheadapterWeightsPath was directly passed over as an input parameter of lora.

"wavespeed-ai/flux-dev-lora": {hfModelId: "wavespeed-ai/flux-dev-lora",providerId: "wavespeed-ai/flux-dev-lora",status: "live",task: "text-to-image",adapter: "lora",adapterWeightsPath:"https://d32s1zkpjdc4b1.cloudfront.net/predictions/599f3739f5354afc8a76a12042736bfd/1.safetensors",},"wavespeed-ai/flux-dev-lora-ultra-fast": {hfModelId: "wavespeed-ai/flux-dev-lora-ultra-fast",providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",status: "live",task: "text-to-image",adapter: "lora",adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",},

in wavespeedai task is :

However, I'm not sure whether the input parameters submitted by hf to lora must be the abbreviation of the file path of the hf model and then concatenated with the hf address in the code. If it is this kind of specification, I can complete it in the format of fal

Copy link

Contributor

hanouticelinaMay 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I think your API can just take the hf model id as the loras path, right?

Suggested change

	path:params.mapping.adapterWeightsPath,
	path:params.mapping.hfModelId,,

As mentioned by@SBrandeis, this part depends on what your API is expecting as inputs when using LoRAs weights.

Copy link

Author

arabot777May 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Yes, you're correct.
In the example,linoyts/yarn_art_Flux_LoRA is the lora model address of hf. We will automatically match and download the hf model。

Copy link

Author

arabot777May 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I completed the modification and ran the use case successfully

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 85 to 92

		override prepareHeaders(params: HeaderParams, isBinary: boolean): Record<string, string> {
		this.accessToken = params.accessToken;
		const headers: Record<string, string> = { Authorization: `Bearer ${params.accessToken}` };
		if (!isBinary) {
		headers["Content-Type"] = "application/json";
		}
		return headers;
		}

Copy link

Contributor

SBrandeisMay 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This is the same behavior as the blanket implementation here:
https://github.com/arabot777/huggingface.js/blob/f706e02d6128f559bd5551072344ff6e31b9c4be/packages/inference/src/providers/providerHelper.ts#L114-L124

No need for an override IMO

Suggested change

	overrideprepareHeaders(params:HeaderParams,isBinary: boolean):Record<string,string>{
	this.accessToken=params.accessToken;
	constheaders:Record<string,string>={Authorization:`Bearer${params.accessToken}`};
	if(!isBinary){
	headers["Content-Type"]="application/json";
	}
	returnheaders;
	}

Copy link

Author

arabot777May 22, 2025•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I removed this part of the logic at the beginning. However, thegetresponse method ofimageToimage.ts did not pass in header information.

I have to rewrite prepareHeaders here and by assignment
this.accessToken = params.accessToken; To ensure that the complete ak information of the header can be passed on when calling getresponse

Copy link

Contributor

hanouticelinaMay 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'd rather updateImageToImage to be able to pass headers togetResponse:

exportasyncfunctionimageToImage(args:ImageToImageArgs,options?:Options):Promise<Blob> {constprovider=awaitresolveProvider(args.provider,args.model,args.endpointUrl);constproviderHelper=getProviderHelper(provider,"image-to-image");constpayload=awaitproviderHelper.preparePayloadAsync(args);const {data:res }=awaitinnerRequest<Blob>(payload,providerHelper, {...options,task:"image-to-image",});const {url,info }=awaitmakeRequestOptions(args,providerHelper, { ...options,task:"image-to-image" });returnproviderHelper.getResponse(res,url,info.headersasRecord<string,string>);}

rather than overridingprepareHeaders and doingthis.accessToken = params.accessToken

Copy link

Author

arabot777May 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Your suggestion makes sense. Initially, this was a common/public function, so I took a minimalistic approach and didn't modify it. Now, let me try making some changes here.

Copy link

Author

arabot777May 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I completed the modification and ran the use case successfully

code review modification

47f41f0

arabot777 requested review fromSBrandeis andhanouticelina

May 22, 2025 15:32

Merge branch 'main' into feat/wavespeedai

0cfefe8

arabot777and others added3 commits

May 23, 2025 22:19

import js file

6cabc5a

Merge branch 'main' into feat/wavespeedai

71e4939

Merge branch 'main' into feat/wavespeedai

6341233

hanouticelina reviewed

May 26, 2025

View reviewed changes

Copy link

Contributor

hanouticelina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

thanks@arabot777 for the iteration. I left a few comments, but we're almost at something merge-ready!

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 85 to 92

		override prepareHeaders(params: HeaderParams, isBinary: boolean): Record<string, string> {
		this.accessToken = params.accessToken;
		const headers: Record<string, string> = { Authorization: `Bearer ${params.accessToken}` };
		if (!isBinary) {
		headers["Content-Type"] = "application/json";
		}
		return headers;
		}

Copy link

Contributor

hanouticelinaMay 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'd rather updateImageToImage to be able to pass headers togetResponse:

exportasyncfunctionimageToImage(args:ImageToImageArgs,options?:Options):Promise<Blob> {constprovider=awaitresolveProvider(args.provider,args.model,args.endpointUrl);constproviderHelper=getProviderHelper(provider,"image-to-image");constpayload=awaitproviderHelper.preparePayloadAsync(args);const {data:res }=awaitinnerRequest<Blob>(payload,providerHelper, {...options,task:"image-to-image",});const {url,info }=awaitmakeRequestOptions(args,providerHelper, { ...options,task:"image-to-image" });returnproviderHelper.getResponse(res,url,info.headersasRecord<string,string>);}

rather than overridingprepareHeaders and doingthis.accessToken = params.accessToken

packages/inference/src/providers/wavespeed-ai.tsShow resolvedHide resolved

lora optimize and image-to-image getresponse use header

bf5ccb4

Copy link

Author

arabot777 commentedMay 26, 2025

@hanouticelina Thank you for your suggestion. I completed the modification and ran the use case successfully

arabot777 requested a review fromhanouticelina

May 27, 2025 14:32

arabot777 added3 commits

May 27, 2025 23:15

Merge branch 'main' into feat/wavespeedai

554bd19

Merge branch 'main' into feat/wavespeedai

8507385

Merge branch 'main' into feat/wavespeedai

054ecb9

Copy link

Author

arabot777 commentedMay 29, 2025•
edited
Loading

Hi, the code has been updated per the review comments. Appreciate if you could verify the changes and point out any remaining concerns for us to address. thanks@SBrandeis

Merge branch 'main' into feat/wavespeedai

1a1f672

Copy link

HuggingFaceDocBuilderDev commentedJun 3, 2025

The docs for this PR livehere. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

hanouticelina reviewed

Jun 3, 2025

View reviewed changes

Copy link

Contributor

hanouticelina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Hi@arabot777, we recently merged an improvement of error handling for inference (PR:#1504). i've added suggestions on how to incorporate it into the WaveSpeed AI inference provider implementation.
Other than that, the PR looks good to me but let's wait for@SBrandeis final review!

packages/inference/src/providers/wavespeed-ai.ts Outdated

		headers?: Record<string, string>
		): Promise<Blob> {
		if (!headers) {
		throw new InferenceOutputError("Headers are required for WaveSpeed AI API calls");

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError("Headers are required for WaveSpeed AI API calls");
	thrownewInferenceClientInputError("Headers are required for WaveSpeed AI API calls");

packages/inference/src/providers/wavespeed-ai.ts Outdated

		const resultResponse = await fetch(resultUrl, { headers });

		if (!resultResponse.ok) {
		throw new InferenceOutputError(`Failed to get result: ${resultResponse.statusText}`);

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError(`Failed to get result:${resultResponse.statusText}`);
	thrownewInferenceClientProviderApiError(
	"Failed to fetch response status from WaveSpeed AI API",
	{url:resultUrl,method:"GET"},
	{
	requestId:resultResponse.headers.get("x-request-id")??"",
	body:awaitresultResponse.text(),
	}
	);

packages/inference/src/providers/wavespeed-ai.ts Outdated


		const result: WaveSpeedAIResponse = await resultResponse.json();
		if (result.code !== 200) {
		throw new InferenceOutputError(`API request failed with code ${result.code}: ${result.message}`);

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError(`API request failed with code${result.code}:${result.message}`);
	thrownewInferenceClientProviderOutputError(`API request to WaveSpeed AI API failed with code${result.code}:${result.message}`);

packages/inference/src/providers/wavespeed-ai.ts Outdated

		case "completed": {
		// Get the media data from the first output URL
		if (!taskResult.outputs?.[0]) {
		throw new InferenceOutputError("No output URL in completed response");

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError("No output URL in completed response");
	thrownewInferenceClientProviderOutputError("Received malformed response from WaveSpeed AI API:No output URL in completed response");

packages/inference/src/providers/wavespeed-ai.ts Outdated

		}
		const mediaResponse = await fetch(taskResult.outputs[0]);
		if (!mediaResponse.ok) {
		throw new InferenceOutputError("Failed to fetch output data");

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError("Failed to fetch output data");
	thrownewInferenceClientProviderApiError(
	"Failed to fetch response status from WaveSpeed AI API",
	{url:taskResult.outputs[0],method:"GET"},
	{
	requestId:mediaResponse.headers.get("x-request-id")??"",
	body:awaitmediaResponse.text(),
	}
	);

packages/inference/src/providers/wavespeed-ai.ts Outdated

		return await mediaResponse.blob();
		}
		case "failed": {
		throw new InferenceOutputError(taskResult.error \|\| "Task failed");

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError(taskResult.error\|\|"Task failed");
	thrownewInferenceClientProviderOutputError(taskResult.error\|\|"Task failed");

packages/inference/src/providers/wavespeed-ai.ts Outdated

		continue;

		default: {
		throw new InferenceOutputError(`Unknown status: ${taskResult.status}`);

Copy link

Contributor

hanouticelinaJun 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	thrownewInferenceOutputError(`Unknown status:${taskResult.status}`);
	thrownewInferenceClientProviderOutputError(`Unknown status:${taskResult.status}`);

packages/inference/src/providers/wavespeed-ai.tsShow resolvedHide resolved

arabot777and others added3 commits

June 3, 2025 22:14

Merge branch 'main' into feat/wavespeedai

1b407f3

handle inference error; upgrade api v2 -> v3

4a71a4b

recode import

839e940

Copy link

Author

arabot777 commentedJun 3, 2025

Hi@arabot777, we recently merged an improvement of error handling for inference (PR:#1504). i've added suggestions on how to incorporate it into the WaveSpeed AI inference provider implementation. Other than that, the PR looks good to me but let's wait for@SBrandeis final review!

Thank you for your reminder. I have completed the new error handling

arabot777 added2 commits

June 7, 2025 11:55

Merge branch 'main' into feat/wavespeedai

64a991d

Merge branch 'main' into feat/wavespeedai

fd20f75

Copy link

Author

arabot777 commentedJun 9, 2025

Hi@SBrandeis ,

Just checking in—is there anything I can do to help move this PR forward? Let me know if you'd like any changes or have questions. Thanks for your time!

arabot777 added2 commits

June 11, 2025 11:56

Merge branch 'main' into feat/wavespeedai

98465e2

Merge branch 'main' into feat/wavespeedai

4e4ca9c

SBrandeis approved these changes

Jun 18, 2025

View reviewed changes

Copy link

Contributor

SBrandeis left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Looks good - just a few minor comments to address
Let's merge soon and proceed with the next steps:https://huggingface.co/docs/inference-providers/register-as-a-provider

packages/inference/src/providers/wavespeed-ai.ts Outdated

Comment on lines 120 to 125

		const result: WaveSpeedAIResponse = await resultResponse.json();
		if (result.code !== 200) {
		throw new InferenceClientProviderOutputError(
		`API request to WaveSpeed AI API failed with code ${result.code}: ${result.message}`
		);
		}

Copy link

Contributor

SBrandeisJun 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Already covered by the previous check onresultResponse

ref:https://developer.mozilla.org/en-US/docs/Web/API/Response/ok

Suggested change

	constresult:WaveSpeedAIResponse=awaitresultResponse.json();
	if(result.code!==200){
	thrownewInferenceClientProviderOutputError(
	`API request to WaveSpeed AI API failed with code${result.code}:${result.message}`
	);
	}

packages/inference/src/providers/wavespeed-ai.ts Outdated

		const mediaResponse = await fetch(taskResult.outputs[0]);
		if (!mediaResponse.ok) {
		throw new InferenceClientProviderApiError(
		"Failed to fetch response status from WaveSpeed AI API",

Copy link

Contributor

SBrandeisJun 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	"Failed to fetchresponse status from WaveSpeed AI API",
	"Failed to fetchgeneration output from WaveSpeed AI API",

packages/inference/test/InferenceClient.spec.ts Outdated

Comment on lines 2083 to 2118

		HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"] = {
		"wavespeed-ai/flux-schnell": {
		hfModelId: "wavespeed-ai/flux-schnell",
		providerId: "wavespeed-ai/flux-schnell",
		status: "live",
		task: "text-to-image",
		},
		"wavespeed-ai/wan-2.1/t2v-480p": {
		hfModelId: "wavespeed-ai/wan-2.1/t2v-480p",
		providerId: "wavespeed-ai/wan-2.1/t2v-480p",
		status: "live",
		task: "text-to-video",
		},
		"wavespeed-ai/hidream-e1-full": {
		hfModelId: "wavespeed-ai/hidream-e1-full",
		providerId: "wavespeed-ai/hidream-e1-full",
		status: "live",
		task: "image-to-image",
		},
		"openfree/flux-chatgpt-ghibli-lora": {
		hfModelId: "openfree/flux-chatgpt-ghibli-lora",
		providerId: "wavespeed-ai/flux-dev-lora",
		status: "live",
		task: "text-to-image",
		adapter: "lora",
		adapterWeightsPath: "openfree/flux-chatgpt-ghibli-lora",
		},
		"linoyts/yarn_art_Flux_LoRA": {
		hfModelId: "linoyts/yarn_art_Flux_LoRA",
		providerId: "wavespeed-ai/flux-dev-lora-ultra-fast",
		status: "live",
		task: "text-to-image",
		adapter: "lora",
		adapterWeightsPath: "linoyts/yarn_art_Flux_LoRA",
		},
		};

Copy link

Contributor

SBrandeisJun 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

In order to reflect how mappings will work when deployed live, you need to:

add aprovider field to the mapping
use the HF model IDs as keys

Suggested change

	HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"]={
	"wavespeed-ai/flux-schnell":{
	hfModelId:"wavespeed-ai/flux-schnell",
	providerId:"wavespeed-ai/flux-schnell",
	status:"live",
	task:"text-to-image",
	},
	"wavespeed-ai/wan-2.1/t2v-480p":{
	hfModelId:"wavespeed-ai/wan-2.1/t2v-480p",
	providerId:"wavespeed-ai/wan-2.1/t2v-480p",
	status:"live",
	task:"text-to-video",
	},
	"wavespeed-ai/hidream-e1-full":{
	hfModelId:"wavespeed-ai/hidream-e1-full",
	providerId:"wavespeed-ai/hidream-e1-full",
	status:"live",
	task:"image-to-image",
	},
	"openfree/flux-chatgpt-ghibli-lora":{
	hfModelId:"openfree/flux-chatgpt-ghibli-lora",
	providerId:"wavespeed-ai/flux-dev-lora",
	status:"live",
	task:"text-to-image",
	adapter:"lora",
	adapterWeightsPath:"openfree/flux-chatgpt-ghibli-lora",
	},
	"linoyts/yarn_art_Flux_LoRA":{
	hfModelId:"linoyts/yarn_art_Flux_LoRA",
	providerId:"wavespeed-ai/flux-dev-lora-ultra-fast",
	status:"live",
	task:"text-to-image",
	adapter:"lora",
	adapterWeightsPath:"linoyts/yarn_art_Flux_LoRA",
	},
	};
	HARDCODED_MODEL_INFERENCE_MAPPING["wavespeed-ai"]={
	"black-forest-labs/FLUX.1-schnell":{
	provider:"wavespeed-ai",
	hfModelId:"wavespeed-ai/flux-schnell",
	providerId:"wavespeed-ai/flux-schnell",
	status:"live",
	task:"text-to-image",
	},
	"Wan-AI/Wan2.1-T2V-14B":{
	provider:"wavespeed-ai",
	hfModelId:"wavespeed-ai/wan-2.1/t2v-480p",
	providerId:"wavespeed-ai/wan-2.1/t2v-480p",
	status:"live",
	task:"text-to-video",
	},
	"HiDream-ai/HiDream-E1-Full":{
	provider:"wavespeed-ai",
	hfModelId:"wavespeed-ai/hidream-e1-full",
	providerId:"wavespeed-ai/hidream-e1-full",
	status:"live",
	task:"image-to-image",
	},
	"openfree/flux-chatgpt-ghibli-lora":{
	provider:"wavespeed-ai",
	hfModelId:"openfree/flux-chatgpt-ghibli-lora",
	providerId:"wavespeed-ai/flux-dev-lora",
	status:"live",
	task:"text-to-image",
	adapter:"lora",
	adapterWeightsPath:"openfree/flux-chatgpt-ghibli-lora",
	},
	"linoyts/yarn_art_Flux_LoRA":{
	provider:"wavespeed-ai",
	hfModelId:"linoyts/yarn_art_Flux_LoRA",
	providerId:"wavespeed-ai/flux-dev-lora-ultra-fast",
	status:"live",
	task:"text-to-image",
	adapter:"lora",
	adapterWeightsPath:"linoyts/yarn_art_Flux_LoRA",
	},
	};

packages/inference/test/InferenceClient.spec.ts Outdated


		it(`textToImage - wavespeed-ai/flux-schnell`, async () => {
		const res = await client.textToImage({
		model: "wavespeed-ai/flux-schnell",

Copy link

Contributor

SBrandeisJun 18, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Following my previous comment, model IDs used here need to match the keys in theHARDCODED_MODEL_INFERENCE_MAPPING (which are the HF model IDs)

Suggested change

	model:"wavespeed-ai/flux-schnell",
	model:"black-forest-labs/FLUX.1-schnell",

arabot777and others added3 commits

June 22, 2025 20:36

Merge branch 'main' into feat/wavespeedai

76344db

recode

f145274

Fix the demo

dc66fd4

Copy link

Author

arabot777 commentedJun 22, 2025

@SBrandeis Thanks for your review and help with the PR! I’ve addressed all the comments—could you help take a look and merge when you get a chance? Really appreciate your time and insights.

Merge branch 'main' into feat/wavespeedai

da95bc3

Copy link

Author

arabot777 commentedJul 10, 2025

@SBrandeis @hanouticelina Hello, it's been a few weeks. Could you follow up on the progress

Labels

inference-providers

integration of a new or existing Inference Provider

4 participants

Movatterモバイル変換

[inference provider] Add wavespeed.ai as an inference provider#1424

Are you sure you want to change the base?

[inference provider] Add wavespeed.ai as an inference provider#1424

Uh oh!

Conversation

arabot777 commentedMay 5, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hanouticelina left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SBrandeis left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

arabot777May 22, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

arabot777 commentedMay 5, 2025•
edited
Loading

arabot777May 22, 2025•
edited
Loading

arabot777 commentedMay 29, 2025•
edited
Loading