lucataco/apollo-7b
Analyze videos and generate text outputs, including detailed captions, summaries, and answers to questions about the con...
Found 22 models (showing 1-20)
Analyze videos and generate text outputs, including detailed captions, summaries, and answers to questions about the con...
Analyze videos and return text responses to prompts. Takes a video and a text prompt as input and outputs text for video...
Answer questions and generate descriptions from video input. Provide one or more videos and a text prompt, and receive t...
Auto-caption videos with TikTok-style on-screen subtitles. Transcribe speech using Whisper large-v3 with automatic langu...
Generate text descriptions and answers about video content from a video input. Accept a video plus an optional prompt (d...
Generate text from a video input. Take a video and an instruction prompt, and return captions, summaries, or answers to...
Caption and answer questions about videos. Accepts a video and an optional text prompt (instruction or question) and ret...
Answer questions about a video and generate detailed descriptions from a video input. Takes a video and a natural-langua...
Transcribe speech from online videos into timestamped text. Accepts a video URL (YouTube and other supported sites) and...
Transcribe speech to text with optional word-level timestamps. Accept audio files and HLS m3u8 streams, with start_time...
Answer questions about videos and generate detailed descriptions from a video input and a text prompt. Handle long-form...
Add karaoke-style captions to a video. Input a video (optionally a transcript JSON) and get a captioned video plus an ed...
Caption and answer questions about videos. Accepts a video and a text prompt and returns text outputs such as descriptio...
Edit videos by editing the transcript. Input a video and either transcribe it to text, or supply a desired transcript to...
Generate subtitles from audio or video input. Transcribe speech to text and return a JSON transcript with segment start/...
Create training-ready video datasets with automatic captions from YouTube links or uploaded video files. Extract and seg...
Transcribe or translate speech from audio files and videos to text. Accept audio or video input and return a transcript...
Transcribe audio or video to text. Accepts an audio or video input and returns a JSON transcript or ASS subtitles, lever...
Add autogenerated, stylized subtitles to a video. Input a video (optional: background music and/or a wordβlevel transcri...
Transcribe speech from silent or muted videos into text using visual speech recognition (lip reading). Accepts a video c...