My take on the Agentic Object Detection

Here are the steps:

  • Segmenting Everything with SAM : We detect everything and worry about filtering later.

  • Filtering with CLIP: Once we have all the segmented objects, we don’t want all of them. We need to filter out the noise and keep only the relevant objects.

  • Adding Reasoning with a model like GPT-4o: Okay, so we’ve segmented and filtered. But what about finalising, understanding? That’s where a strong LLM like GPT-4o comes in.

Here is what I did with SAM and clip, we now need to use a good LLM on top and add some reasoning..

code: https://github.com/maylad31/agentic-object-detection

原文链接:My take on the Agentic Object Detection

© 版权声明
THE END
喜欢就支持一下吧
点赞10 分享
Every day is beautiful if you choose to see it.
如果你愿意去发现,其实每一天都很美
评论 抢沙发

请登录后发表评论

    暂无评论内容