🤖 Model 📝 → 📝

camenduru/minigpt4-video
Analyze and understand video content by generating textual descriptions from video inputs. This model uses interleaved v...
Found 4 models (showing 1-4)
Analyze and understand video content by generating textual descriptions from video inputs. This model uses interleaved v...
Generate realistic audio from video and text descriptions for professional-grade sound effect creation. Supports high-fi...
Classify text prompts, model responses, and multiple images for safety and policy compliance. Accepts text and a list of...
Run ComfyUI workflows to generate images or video from a ComfyUI API JSON and optional input media. Accept a workflow JS...