🤖 Model 📝 → 📝

camenduru/minigpt4-video
Analyze and understand video content by generating textual descriptions from video inputs. This model uses interleaved v...
Found 2 models (showing 1-2)
Analyze and understand video content by generating textual descriptions from video inputs. This model uses interleaved v...
Generate textual captions for video content using GPT-5, focusing on visual elements without audio captioning.