Movatterモバイル変換


[0]ホーム

URL:


Skip to content
DEV Community
Log in Create account

DEV Community

Mayank Laddha
Mayank Laddha

Posted on

     

My take on the Agentic Object Detection

Here are the steps:

  • Segmenting Everything with SAM : We detect everything and worry about filtering later.

  • Filtering with CLIP: Once we have all the segmented objects, we don’t want all of them. We need to filter out the noise and keep only the relevant objects.

  • Adding Reasoning with a model like GPT-4o: Okay, so we’ve segmented and filtered. But what about finalising, understanding? That’s where a strong LLM like GPT-4o comes in.

Here is what I did with SAM and clip, we now need to use a good LLM on top and add some reasoning..

Agentic Object Detection demo

code:https://github.com/maylad31/agentic-object-detection

Top comments(0)

Subscribe
pic
Create template

Templates let you quickly answer FAQs or store snippets for re-use.

Dismiss

Are you sure you want to hide this comment? It will become hidden in your post, but will still be visible via the comment'spermalink.

For further actions, you may consider blocking this person and/orreporting abuse

5y+ in AI / ML | trying to fix GEN AI and RAG related problems | not perfect but versatile
  • Location
    India
  • Joined

More fromMayank Laddha

DEV Community

We're a place where coders share, stay up-to-date and grow their careers.

Log in Create account

[8]ページ先頭

©2009-2025 Movatter.jp