Movatterモバイル変換

roboflow/inferencePublic

NotificationsYou must be signed in to change notification settings
Fork191
Star1.8k

Turn any computer or edge device into a command center for your computer vision projects.

inference.roboflow.com

License

Unknown, Apache-2.0 licenses found

Licenses found

1.8k stars 191 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 7,830 Commits
.github		.github
.release/pypi		.release/pypi
app_bundles		app_bundles
build_scripts		build_scripts
development		development
docker		docker
docs		docs
examples		examples
inference		inference
inference_cli		inference_cli
inference_experimental		inference_experimental
inference_sdk		inference_sdk
requirements		requirements
signatures/version1		signatures/version1
tests		tests
theme		theme
.actrc		.actrc
.dockerignore		.dockerignore
.gitignore		.gitignore
.isort.cfg		.isort.cfg
AGENTS.md		AGENTS.md
CITATION.cff		CITATION.cff
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
LICENSE.core		LICENSE.core
Makefile		Makefile
README.md		README.md
banner.png		banner.png
debugrun.py		debugrun.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
setup.py		setup.py

Repository files navigation

notebooks |supervision |autodistill |maestro

Make Any Camera an AI Camera

Inference turns any computer or edge device into a command center for your computer vision projects.

🛠️ Self-hostyour own fine-tuned models
🧠 Access the latest and greatest foundation models (likeFlorence-2,CLIP, andSAM2)
🤝 UseWorkflows to track, count, time, measure, and visualize
👁️ Combine ML with traditional CV methods (like OCR, Barcode Reading, QR, and template matching)
📈 Monitor, record, and analyze predictions
🎥Manage cameras and video streams
📬 Send notifications when events happen
🛜 Connect with external systems and APIs
🔗Extend with your own code and models
🚀 Deploy production systems at scale

SeeExample Workflows for common use-cases like detecting small objects with SAHI, multi-model consensus, active learning, reading license plates, blurring faces, background removal, and more.

time-in-zone.mp4

🔥 quickstart

Install Docker (andNVIDIA Container Toolkitfor GPU acceleration if you have a CUDA-enabled GPU). Then run

pip install inference-cli && inference server start --dev

This will pull the proper image for your machine and start it in development mode.

In development mode, a Jupyter notebook server with a quickstart guide runs onhttp://localhost:9001/notebook/start. Dive in there for a whirlwind tourof your new Inference Server's functionality!

Now you're ready to connect your camera streams andstart building & deploying Workflows in the UIorinteracting with your new servervia its API.

🛠️ build with Workflows

A key component of Inference isWorkflows, composable blocks of common functionality that give models a common interface to make chaining and experimentation easy.

With Workflows, you can:

Detect, classify, and segment objects in images using state-of-the-art models.
Use Large Multimodal Models (LMMs) to make determinations at any stage in a workflow.
Seamlessly swap out models for a given task.
Chain models together.
Track, count, time, measure, and visualize objects.
Add business logic and extend functionality to work with your external systems.

Workflows allow you to extend simple model predictions to build computer vision micro-services that fit into a larger application or fully self-contained visual agents that run on a video stream.

Learn more, readthe Workflows docs, orstart building.

	Tutorial: Build an AI-Powered Self-Serve Checkout Created: 2 Feb 2025 Make a computer vision app that identifies different pieces of hardware, calculates the total cost, and records the results to a database.
	Tutorial: Intro to Workflows Created: 6 Jan 2025 Learn how to build and deploy Workflows for common use-cases like detecting vehicles, filtering detections, visualizing results, and calculating dwell time on a live video stream.
	Tutorial: Build a Smart Parking System Created: 27 Nov 2024 Build a smart parking lot management system using Roboflow Workflows! This tutorial covers license plate detection with YOLOv8, object tracking with ByteTrack, and real-time notifications with a Telegram bot.

📟 connecting via api

Once you've installed Inference, your machine is a fully-featured CV center.You can use its API to run models and workflows on images and video streams.By default, the server is running locally onlocalhost:9001.

To interface with your server via Python, use our SDK:

pip install inference-sdk

Then runan example model comparison Workflowlike this:

frominference_sdkimportInferenceHTTPClientclient=InferenceHTTPClient(api_url="http://localhost:9001",# use local inference server# api_key="<YOUR API KEY>" # optional to access your private data and models)result=client.run_workflow(workspace_name="roboflow-docs",workflow_id="model-comparison",images={"image":"https://media.roboflow.com/workflows/examples/bleachers.jpg"    },parameters={"model1":"yolov8n-640","model2":"yolov11n-640"    })print(result)

In other languages, use the server's REST API;you can access the API docs for your server at/docs (OpenAPI format) or/redoc (Redoc Format).

Check outthe inference_sdk docsto see what else you can do with your new server.

🎥 connect to video streams

The inference server is a video processing beast. You can set it up to runWorkflows on RTSP streams, webcam devices, and more. It will handle hardwareacceleration, multiprocessing, video decoding and GPU batching to get themost out of your hardware.

This example workflowwill watch a stream for frames thatCLIP thinks match aninputted text prompt.

frominference_sdkimportInferenceHTTPClientimportatexitimporttimemax_fps=4client=InferenceHTTPClient(api_url="http://localhost:9001",# use local inference server# api_key="<YOUR API KEY>" # optional to access your private data and models)# Start a stream on an rtsp streamresult=client.start_inference_pipeline_with_workflow(video_reference=["rtsp://user:password@192.168.0.100:554/"],workspace_name="roboflow-docs",workflow_id="clip-frames",max_fps=max_fps,workflows_parameters={"prompt":"blurry",# change to look for something else"threshold":0.16    })pipeline_id=result["context"]["pipeline_id"]# Terminate the pipeline when the script exitsatexit.register(lambda:client.terminate_inference_pipeline(pipeline_id))whileTrue:result=client.consume_inference_pipeline_result(pipeline_id=pipeline_id)ifnotresult["outputs"]ornotresult["outputs"][0]:# still initializingcontinueoutput=result["outputs"][0]is_match=output.get("is_match")similarity=round(output.get("similarity")*100,1)print(f"Matches prompt?{is_match} (similarity:{similarity}%)")time.sleep(1/max_fps)

Pipeline outputs can be consumed via API for downstream processing or theWorkflow can be configured to call external services with Notification blocks(likeEmailorTwilio)or theWebhook block.For more info on video pipeline management, see theVideo Processing overview.

If you have a Roboflow account & have linked an API key, you can also remotelymonitor and manage your running streamsvia the Roboflow UI.

🔑 connect to the cloud

Without an API Key, you can access a wide range of pre-trained and foundational models and run public Workflows.

Pass an optionalRoboflow API Key to theinference_sdk or API to access additional features enhanced by Roboflow's Cloudplatform. When running with an API Key, usage is metered according toRoboflow'spricing tiers.

	Open Access	With API Key (Metered)
Pre-Trained Models	✅	✅
Foundation Models	✅	✅
Video Stream Management	✅	✅
Dynamic Python Blocks	✅	✅
Public Workflows	✅	✅
Private Workflows		✅
Fine-Tuned Models		✅
Universe Models		✅
Active Learning		✅
Serverless Hosted API		✅
Dedicated Deployments		✅
Commercial Model Licensing		Paid
Device Management		Enterprise
Model Monitoring		Enterprise

🌩️ hosted compute

If you don't want to manage your own infrastructure for self-hosting, Roboflow offers a hosted Inference Server viaone-click Dedicated Deployments (CPU and GPU machines) billed hourly, or simple models and Workflows via ourserverless Hosted API billed per API-call.

We offer agenerous free-tier to get started.

🖥️ run on-prem or self-hosted

Inference is designed to run on a wide range of hardware from beefy cloud servers to tiny edge devices. This lets you easily develop against your local machine or our cloud infrastructure and then seamlessly switch to another device for production deployment.

inference server start attempts to automatically choose the optimal container to optimize performance on your machine (including with GPU acceleration via NVIDIA CUDA when available). Special installation notes and performance tips by device are listed below:

⭐️ New: Enterprise Hardware

For manufacturing and logistics use-cases Roboflow now offersthe NVIDIA Jetson-based Flowbox, a ruggedized CV center pre-configured with Inference and optimized for running in secure networks. It has integrated support for machine vision cameras like Basler and Lucid over GigE, supports interfacing with PLCs and HMIs via OPC or MQTT, enables enterprise device management through a DMZ, and comes with the support of our team of computer vision experts to ensure your project is a success.

📚 documentation

Visit ourdocumentation to explore comprehensive guides, detailed API references, and a wide array of tutorials designed to help you harness the full potential of the Inference package.

© license

The core of Inference is licensed under Apache 2.0.

Models are subject to licensing which respects the underlying architecture. These licenses are listed ininference/models. Paid Roboflow accounts include a commercial license for some models (seeroboflow.com/licensing for details).

Cloud connected functionality (like our model and Workflows registries, dataset management, model monitoring, device management, and managed infrastructure) requires a Roboflow account and API key & is metered based on usage.

Enterprise functionality is source-available ininference/enterprise under anenterprise license and usage in production requires an active Enterprise contract in good standing.

See the "Self Hosting and Edge Deployment" section of theRoboflow Licensing documentation for more information on how Roboflow Inference is licensed.