- Notifications
You must be signed in to change notification settings - Fork539
feat(local-apps): Add Nexa SDK integration#1800
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
Open
mengshengwu wants to merge2 commits intohuggingface:mainChoose a base branch frommengshengwu:feat/nexa-sdk
base:main
Could not load branches
Branch not found:{{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline, and old review comments may become outdated.
Uh oh!
There was an error while loading.Please reload this page.
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
…lation instructions
Uh oh!
There was an error while loading.Please reload this page.
zhiyuan8 commentedNov 1, 2025
@SBrandeis@Wauplin@julien-c@pcuenca look forward to your review feedbacks! |
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading.Please reload this page.
Introduction
Hello! Friends from Huggingface 👋
Thank you for maintaining this amazing library and ecosystem.
Following a recent discussion with Hugging Face (seehere), your CEO expressed interest in exploring collaboration to bringon-device model inference to the community. This PR proposes addingNexa SDK to the local apps section as a new runtime integration option.
About Nexa SDK
Nexa SDK is an on-device inference framework that runs any model on any device, across any backend. It runs on CPUs, GPUs, NPUs with backend support for CUDA, Metal, Vulkan, and Qualcomm / Intel / AMD NPU. It handles multiple input modalities including text 📝, image 🖼️, and audio 🎧. The SDK includes an OpenAI-compatible API server with support for JSON schema-based function calling and streaming. It supports model formats such as GGUF, MLX, Nexa AI's own
.nexaformat, enabling efficient quantized inference across diverse platforms.Example Demos
🖼️ Multi-Image Reasoning
Spot the difference across two images in multi-round dialogue.
🎤 Image + Text → Function Call
Snap a poster, add a voice note, and the AI agent creates a calendar event locally.
🎶 Multi-Audio Comparison
Identify differences between two music clips — fully offline.
Related Links
Logo File
Nexa-Logo-Black
Thank you for your time reviewing this PR!
We’re excited to explore how Nexa SDK can extend Hugging Face models to mobile and edge platforms. 🙏