pranavsekhar/nano-banana-slideshow 📝🖼️🔢 → ❓

▶️ 91 runs 📅 Sep 2025 ⚙️ Cog 0.14.0
image-to-video slideshow text-to-video

About

Example Output

Prompt:

"a red corvette sitting at the base of a mountain on a misty day, realistic day"

Output

{"video":"https://replicate.delivery/xezq/US5fuyIdtUVlFKccqEuE7g5iFCIbmAbrYT66Tf8SgO1Zz5VVA/variations_video.mp4","variations_used":"1. a red corvette sitting at the base of a mountain on a misty day, realistic day (base image)
2. Add a hiker with a backpack standing near the car looking up at the mountain trail
3. Place a vintage camping lantern and folded map on the corvette's hood
4. Add a golden retriever sitting beside the driver's door
5. Include scattered autumn leaves around the car's base and on the windshield
6. Place a pair of hiking boots and a water bottle on the ground near the passenger side
7. Add a small wooden trail marker sign pointing toward a mountain path
8. Include morning dew droplets covering the car's surface and nearby grass
9. Place a thermos of coffee and binoculars on the corvette's roof
10. Add a red-tailed hawk perched on a nearby weathered fence post"}

Performance Metrics

105.64s Prediction Time
106.11s Total Time
All Input Parameters
{
  "prompt": "a red corvette sitting at the base of a mountain on a misty day, realistic day",
  "num_variations": 10,
  "seconds_per_image": 0.3
}
Input Parameters
prompt (required) Type: string
Main prompt for the base image (e.g., 'a nice garden with fairy lights and a house in the background')
input_image Type: string
Optional starting image to use as base (if not provided, will generate from prompt)
num_variations Type: integerDefault: 20Range: 1 - 50
Number of foreground variations to generate
seconds_per_image Type: numberDefault: 0.2Range: 0.1 - 2
Duration each image should show in the video (seconds)
Output Schema
video Type: stringFormat: uri
Video
variations_used Type: string
Variations Used
Example Execution Logs
Generating base image...
/app/.venv/lib/python3.13/site-packages/coglet/scope.py:36: ExperimentalFeatureWarning: current_scope is an experimental internal function. It may change or be removed without warning.
  warnings.warn(
Generating variation 2/10: Add a hiker with a backpack standing near the car looking up at the mountain trail
Generating variation 3/10: Place a vintage camping lantern and folded map on the corvette's hood
Generating variation 4/10: Add a golden retriever sitting beside the driver's door
Generating variation 5/10: Include scattered autumn leaves around the car's base and on the windshield
Generating variation 6/10: Place a pair of hiking boots and a water bottle on the ground near the passenger side
Generating variation 7/10: Add a small wooden trail marker sign pointing toward a mountain path
Generating variation 8/10: Include morning dew droplets covering the car's surface and nearby grass
Generating variation 9/10: Place a thermos of coffee and binoculars on the corvette's roof
Generating variation 10/10: Add a red-tailed hawk perched on a nearby weathered fence post
ffmpeg version 7.1.1 Copyright (c) 2000-2025 the FFmpeg developers
  built with gcc 13.2.1 (Alpine 13.2.1_git20240309) 20240309
  configuration: --pkg-config-flags=--static --extra-cflags=-fopenmp --extra-ldflags='-fopenmp -Wl,--allow-multiple-definition -Wl,-z,stack-size=2097152' --toolchain=hardened --disable-debug --disable-shared --disable-ffplay --enable-static --enable-gpl --enable-version3 --enable-fontconfig --enable-gray --enable-iconv --enable-lcms2 --enable-libaom --enable-libaribb24 --enable-libass --enable-libbluray --enable-libdav1d --enable-libdavs2 --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libharfbuzz --enable-libjxl --enable-libkvazaar --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-libopus --enable-librabbitmq --enable-librav1e --enable-librsvg --enable-librtmp --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable-libuavs3d --enable-libvidstab --enable-libvmaf --enable-libvo-amrwbenc --enable-libvorbis --enable-libvpl --enable-libvpx --enable-libvvenc --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxevd --enable-libxeve --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-openssl
  libavutil      59. 39.100 / 59. 39.100
  libavcodec     61. 19.101 / 61. 19.101
  libavformat    61.  7.100 / 61.  7.100
  libavdevice    61.  3.100 / 61.  3.100
  libavfilter    10.  4.100 / 10.  4.100
  libswscale      8.  3.100 /  8.  3.100
  libswresample   5.  3.100 /  5.  3.100
  libpostproc    58.  3.100 / 58.  3.100
Input #0, concat, from '/tmp/ffmpeg_input.txt':
  Duration: N/A, start: 0.000000, bitrate: N/A
  Stream #0:0: Video: mjpeg (Baseline), yuvj420p(pc, bt470bg/unknown/unknown), 1024x1024 [SAR 1:1 DAR 1:1], 25 fps, 25 tbr, 25 tbn
Stream mapping:
  Stream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264))
Press [q] to stop, [?] for help
[swscaler @ 0x7b7c6a137040] deprecated pixel format used, make sure you did set range correctly
[libx264 @ 0x7b7c6a9e4400] using SAR=1/1
[libx264 @ 0x7b7c6a9e4400] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512
[libx264 @ 0x7b7c6a9e4400] profile High, level 3.2, 4:2:0, 8-bit
[libx264 @ 0x7b7c6a9e4400] 264 - core 164 r3108 31e19f9 - H.264/MPEG-4 AVC codec - Copyleft 2003-2023 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=1 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00
Output #0, mp4, to '/tmp/variations_video.mp4':
  Metadata:
    encoder         : Lavf61.7.100
  Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(pc, bt470bg/unknown/unknown, progressive), 1024x1024 [SAR 1:1 DAR 1:1], q=2-31, 30 fps, 15360 tbn
      Metadata:
        encoder         : Lavc61.19.101 libx264
      Side data:
        cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A
frame=   16 fps=0.0 q=29.0 size=       0KiB time=00:00:00.46 bitrate=   0.8kbits/s dup=64 drop=0 speed=0.933x    
frame=   34 fps= 34 q=29.0 size=     256KiB time=00:00:01.06 bitrate=1966.4kbits/s dup=80 drop=0 speed=1.07x    
frame=   60 fps= 40 q=29.0 size=     512KiB time=00:00:01.93 bitrate=2169.7kbits/s dup=86 drop=0 speed=1.29x    
frame=   85 fps= 42 q=29.0 size=     768KiB time=00:00:02.76 bitrate=2274.2kbits/s dup=86 drop=0 speed=1.38x    
[out#0/mp4 @ 0x7b7c6a9f8580] video:834KiB audio:0KiB subtitle:0KiB other streams:0KiB global headers:0KiB muxing overhead: 0.239171%
frame=   97 fps= 46 q=29.0 Lsize=     836KiB time=00:00:03.16 bitrate=2163.1kbits/s dup=86 drop=0 speed= 1.5x    
[libx264 @ 0x7b7c6a9e4400] frame I:1     Avg QP:20.38  size: 95047
[libx264 @ 0x7b7c6a9e4400] frame P:24    Avg QP:20.76  size: 31266
[libx264 @ 0x7b7c6a9e4400] frame B:72    Avg QP:21.46  size:   112
[libx264 @ 0x7b7c6a9e4400] consecutive B-frames:  1.0%  0.0%  0.0% 99.0%
[libx264 @ 0x7b7c6a9e4400] mb I  I16..4:  4.5% 94.5%  1.1%
[libx264 @ 0x7b7c6a9e4400] mb P  I16..4:  0.9% 21.1%  0.2%  P16..4:  9.0%  4.5%  2.4%  0.0%  0.0%    skip:61.9%
[libx264 @ 0x7b7c6a9e4400] mb B  I16..4:  0.0%  0.0%  0.0%  B16..8:  1.8%  0.0%  0.0%  direct: 0.0%  skip:98.2%  L0:36.3% L1:60.6% BI: 3.1%
[libx264 @ 0x7b7c6a9e4400] 8x8 transform intra:95.0% inter:77.0%
[libx264 @ 0x7b7c6a9e4400] coded y,uvDC,uvAC intra: 87.9% 43.0% 6.2% inter: 2.6% 2.1% 0.1%
[libx264 @ 0x7b7c6a9e4400] i16 v,h,dc,p:  6% 32%  7% 55%
[libx264 @ 0x7b7c6a9e4400] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 15% 27% 26%  4%  5%  5%  6%  4%  8%
[libx264 @ 0x7b7c6a9e4400] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 15% 28% 10%  5% 11%  7% 11%  4%  8%
[libx264 @ 0x7b7c6a9e4400] i8c dc,h,v,p: 54% 29% 16%  2%
[libx264 @ 0x7b7c6a9e4400] Weighted P-Frames: Y:4.2% UV:0.0%
[libx264 @ 0x7b7c6a9e4400] ref P L0: 68.7% 21.1%  9.1%  0.9%  0.1%
[libx264 @ 0x7b7c6a9e4400] ref B L0: 67.6% 31.2%  1.2%
[libx264 @ 0x7b7c6a9e4400] ref B L1: 96.3%  3.7%
[libx264 @ 0x7b7c6a9e4400] kb/s:2111.78
Version Details
Version ID
6a9501a453eb7ec93984708a927ec196a16075ff36eb8aa9dcc9dc5cae7b4f34
Version Created
September 16, 2025
Run on Replicate →