bytedance/sa2va-26b-video
Segment objects in video from a natural-language instruction. Provide a video and a text prompt and receive a masked vid...
Found 7 models (showing 1-7)
Segment objects in video from a natural-language instruction. Provide a video and a text prompt and receive a masked vid...
Segment objects in videos from natural-language instructions. Takes a video and a text instruction (referring expression...
Segment objects in videos from natural-language instructions. Accepts a video and a text instruction (referring expressi...
Segment objects in videos from interactive point prompts, labels, and object IDs, outputting per-frame masks as a video...
Remove backgrounds from videos of people, outputting a green-screen video, an alpha matte, or a foreground mask. Takes a...
Segment objects across a video from a first-frame mask or a SAM point, and optionally remove them via video inpainting w...
Segment objects in videos from a text prompt. Accepts a video plus a text prompt for the target; optionally add visual p...