Identify objects in images using prompts
Complex text label dection using SAM3 with VLM-FO1
VLM-FO1-3B-Demo