chenxwh/cogvlm2-video
Generate text descriptions and answers from a video input. Accepts a video and an optional prompt to perform video capti...
Found 7 models (showing 1-7)
Generate text descriptions and answers from a video input. Accepts a video and an optional prompt to perform video capti...
Answer questions and generate detailed descriptions from a video input. Provide a video and a text prompt to get caption...
Generate text descriptions and answers from a video input. Accepts a video and a natural-language prompt to perform vide...
Caption videos and answer open-ended questions about their content. Accept one or more video inputs plus a list of natur...
Caption and answer questions about videos. Takes a video and a text prompt and returns text, enabling detailed descripti...
Answer questions about videos and generate detailed captions from a video input. Accepts a video and a natural-language...
Answer questions about video content in a multi-turn chat. Take a video and a chat message history as input and return a...