π€ Model π₯

bytedance/sa2va-8b-video
Segment objects in video using natural-language instructions. Accepts a video and a text prompt (e.g., βthe person weari...
Found 2 models (showing 1-2)
Segment objects in video using natural-language instructions. Accepts a video and a text prompt (e.g., βthe person weari...
Segment objects in video from a natural-language instruction. Accepts a video and a text prompt describing the target ob...