aodianyun/minicpm-v-26-int4 🖼️📝 → 📝

▶️ 12 runs 📅 Sep 2024 ⚙️ Cog 0.9.21
image-to-text video-to-text

Example Output

Prompt:

"Describe the video in great detail."

Output

The video shows the process of cooking using a specialized kitchen appliance, which appears to be a deep fryer or wok with an integrated steam vent. Here's a detailed step-by-step description:

  1. Preparation: The video starts with the appliance on a stovetop, and someone is holding a ladle filled with what looks like chopped vegetables or meat.

  2. Frying: The person pours the contents into the frying section of the appliance. This section has a round opening through which food items are added for frying.

  3. Steaming: After adding the ingredients, the lid is placed on top of the appliance. There is a steam vent visible at the top, indicating that it can also function as a steamer.

  4. Cooking: Once the lid is sealed, the appliance seems to heat up, causing the steam to build up inside. The lid has perforations that allow steam to escape while keeping the food contained.

  5. Monitoring: Throughout the cooking process, the person monitoring the appliance adjusts the position of the lid slightly to ensure proper ventilation and even cooking.

  6. Final Cooking: The final frame shows the completed dish being removed from the appliance, ready to serve.

  7. Branding: The last two frames show the branding or contact information for the manufacturer or distributor of this appliance.

Overall, the video demonstrates the versatility of the appliance in both frying and steaming functions, showcasing its efficiency and ease of use in a commercial kitchen setting.

Performance Metrics

40.62s Prediction Time
40.65s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/LYg9UV9q67McivPuwvpQPjClOZ7KqaOHH5F2DgbfCZspIhOQ/input.mp4",
  "prompt": "Describe the video in great detail."
}
Input Parameters
image (required) Type: string
Input image or video
prompt Type: stringDefault:
Prompt
Output Schema

Output

Type: arrayItems Type: string

Example Execution Logs
video_duration 20.099
frame_rate 0.49753719090502013
Version Details
Version ID
bd591744b2f826dec0724825d05ef68ce1115571acee1a62de7e9dc681deaf1f
Version Created
September 13, 2024
Run on Replicate →