🤖 Model 🎥
bytedance/sa2va-26b-video
Segment objects in video from a natural-language instruction. Provide a video and a text prompt and receive a masked vid...
Found 3 models (showing 1-3)
Segment objects in video from a natural-language instruction. Provide a video and a text prompt and receive a masked vid...
Segment objects in videos from natural-language instructions. Accepts a video and a text instruction (referring expressi...
Segment objects in images using natural language prompts. Accepts an input image and a text prompt describing the target...