pranavsekhar/nano-banana-slideshow 📝🖼️🔢 → ❓
About
Example Output
Prompt:
"a red corvette sitting at the base of a mountain on a misty day, realistic day"
Output
{"video":"https://replicate.delivery/xezq/US5fuyIdtUVlFKccqEuE7g5iFCIbmAbrYT66Tf8SgO1Zz5VVA/variations_video.mp4","variations_used":"1. a red corvette sitting at the base of a mountain on a misty day, realistic day (base image)
2. Add a hiker with a backpack standing near the car looking up at the mountain trail
3. Place a vintage camping lantern and folded map on the corvette's hood
4. Add a golden retriever sitting beside the driver's door
5. Include scattered autumn leaves around the car's base and on the windshield
6. Place a pair of hiking boots and a water bottle on the ground near the passenger side
7. Add a small wooden trail marker sign pointing toward a mountain path
8. Include morning dew droplets covering the car's surface and nearby grass
9. Place a thermos of coffee and binoculars on the corvette's roof
10. Add a red-tailed hawk perched on a nearby weathered fence post"}
2. Add a hiker with a backpack standing near the car looking up at the mountain trail
3. Place a vintage camping lantern and folded map on the corvette's hood
4. Add a golden retriever sitting beside the driver's door
5. Include scattered autumn leaves around the car's base and on the windshield
6. Place a pair of hiking boots and a water bottle on the ground near the passenger side
7. Add a small wooden trail marker sign pointing toward a mountain path
8. Include morning dew droplets covering the car's surface and nearby grass
9. Place a thermos of coffee and binoculars on the corvette's roof
10. Add a red-tailed hawk perched on a nearby weathered fence post"}
Performance Metrics
105.64s
Prediction Time
106.11s
Total Time
All Input Parameters
{ "prompt": "a red corvette sitting at the base of a mountain on a misty day, realistic day", "num_variations": 10, "seconds_per_image": 0.3 }
Input Parameters
- prompt (required)
- Main prompt for the base image (e.g., 'a nice garden with fairy lights and a house in the background')
- input_image
- Optional starting image to use as base (if not provided, will generate from prompt)
- num_variations
- Number of foreground variations to generate
- seconds_per_image
- Duration each image should show in the video (seconds)
Output Schema
- video
- Video
- variations_used
- Variations Used
Example Execution Logs
Generating base image... /app/.venv/lib/python3.13/site-packages/coglet/scope.py:36: ExperimentalFeatureWarning: current_scope is an experimental internal function. It may change or be removed without warning. warnings.warn( Generating variation 2/10: Add a hiker with a backpack standing near the car looking up at the mountain trail Generating variation 3/10: Place a vintage camping lantern and folded map on the corvette's hood Generating variation 4/10: Add a golden retriever sitting beside the driver's door Generating variation 5/10: Include scattered autumn leaves around the car's base and on the windshield Generating variation 6/10: Place a pair of hiking boots and a water bottle on the ground near the passenger side Generating variation 7/10: Add a small wooden trail marker sign pointing toward a mountain path Generating variation 8/10: Include morning dew droplets covering the car's surface and nearby grass Generating variation 9/10: Place a thermos of coffee and binoculars on the corvette's roof Generating variation 10/10: Add a red-tailed hawk perched on a nearby weathered fence post ffmpeg version 7.1.1 Copyright (c) 2000-2025 the FFmpeg developers built with gcc 13.2.1 (Alpine 13.2.1_git20240309) 20240309 configuration: --pkg-config-flags=--static --extra-cflags=-fopenmp --extra-ldflags='-fopenmp -Wl,--allow-multiple-definition -Wl,-z,stack-size=2097152' --toolchain=hardened --disable-debug --disable-shared --disable-ffplay --enable-static --enable-gpl --enable-version3 --enable-fontconfig --enable-gray --enable-iconv --enable-lcms2 --enable-libaom --enable-libaribb24 --enable-libass --enable-libbluray --enable-libdav1d --enable-libdavs2 --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libharfbuzz --enable-libjxl --enable-libkvazaar --enable-libmodplug --enable-libmp3lame --enable-libmysofa --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-libopus --enable-librabbitmq --enable-librav1e --enable-librsvg --enable-librtmp --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable-libuavs3d --enable-libvidstab --enable-libvmaf --enable-libvo-amrwbenc --enable-libvorbis --enable-libvpl --enable-libvpx --enable-libvvenc --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxavs2 --enable-libxevd --enable-libxeve --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-openssl libavutil 59. 39.100 / 59. 39.100 libavcodec 61. 19.101 / 61. 19.101 libavformat 61. 7.100 / 61. 7.100 libavdevice 61. 3.100 / 61. 3.100 libavfilter 10. 4.100 / 10. 4.100 libswscale 8. 3.100 / 8. 3.100 libswresample 5. 3.100 / 5. 3.100 libpostproc 58. 3.100 / 58. 3.100 Input #0, concat, from '/tmp/ffmpeg_input.txt': Duration: N/A, start: 0.000000, bitrate: N/A Stream #0:0: Video: mjpeg (Baseline), yuvj420p(pc, bt470bg/unknown/unknown), 1024x1024 [SAR 1:1 DAR 1:1], 25 fps, 25 tbr, 25 tbn Stream mapping: Stream #0:0 -> #0:0 (mjpeg (native) -> h264 (libx264)) Press [q] to stop, [?] for help [swscaler @ 0x7b7c6a137040] deprecated pixel format used, make sure you did set range correctly [libx264 @ 0x7b7c6a9e4400] using SAR=1/1 [libx264 @ 0x7b7c6a9e4400] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 AVX512 [libx264 @ 0x7b7c6a9e4400] profile High, level 3.2, 4:2:0, 8-bit [libx264 @ 0x7b7c6a9e4400] 264 - core 164 r3108 31e19f9 - H.264/MPEG-4 AVC codec - Copyleft 2003-2023 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=1 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/variations_video.mp4': Metadata: encoder : Lavf61.7.100 Stream #0:0: Video: h264 (avc1 / 0x31637661), yuv420p(pc, bt470bg/unknown/unknown, progressive), 1024x1024 [SAR 1:1 DAR 1:1], q=2-31, 30 fps, 15360 tbn Metadata: encoder : Lavc61.19.101 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A frame= 16 fps=0.0 q=29.0 size= 0KiB time=00:00:00.46 bitrate= 0.8kbits/s dup=64 drop=0 speed=0.933x frame= 34 fps= 34 q=29.0 size= 256KiB time=00:00:01.06 bitrate=1966.4kbits/s dup=80 drop=0 speed=1.07x frame= 60 fps= 40 q=29.0 size= 512KiB time=00:00:01.93 bitrate=2169.7kbits/s dup=86 drop=0 speed=1.29x frame= 85 fps= 42 q=29.0 size= 768KiB time=00:00:02.76 bitrate=2274.2kbits/s dup=86 drop=0 speed=1.38x [out#0/mp4 @ 0x7b7c6a9f8580] video:834KiB audio:0KiB subtitle:0KiB other streams:0KiB global headers:0KiB muxing overhead: 0.239171% frame= 97 fps= 46 q=29.0 Lsize= 836KiB time=00:00:03.16 bitrate=2163.1kbits/s dup=86 drop=0 speed= 1.5x [libx264 @ 0x7b7c6a9e4400] frame I:1 Avg QP:20.38 size: 95047 [libx264 @ 0x7b7c6a9e4400] frame P:24 Avg QP:20.76 size: 31266 [libx264 @ 0x7b7c6a9e4400] frame B:72 Avg QP:21.46 size: 112 [libx264 @ 0x7b7c6a9e4400] consecutive B-frames: 1.0% 0.0% 0.0% 99.0% [libx264 @ 0x7b7c6a9e4400] mb I I16..4: 4.5% 94.5% 1.1% [libx264 @ 0x7b7c6a9e4400] mb P I16..4: 0.9% 21.1% 0.2% P16..4: 9.0% 4.5% 2.4% 0.0% 0.0% skip:61.9% [libx264 @ 0x7b7c6a9e4400] mb B I16..4: 0.0% 0.0% 0.0% B16..8: 1.8% 0.0% 0.0% direct: 0.0% skip:98.2% L0:36.3% L1:60.6% BI: 3.1% [libx264 @ 0x7b7c6a9e4400] 8x8 transform intra:95.0% inter:77.0% [libx264 @ 0x7b7c6a9e4400] coded y,uvDC,uvAC intra: 87.9% 43.0% 6.2% inter: 2.6% 2.1% 0.1% [libx264 @ 0x7b7c6a9e4400] i16 v,h,dc,p: 6% 32% 7% 55% [libx264 @ 0x7b7c6a9e4400] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 15% 27% 26% 4% 5% 5% 6% 4% 8% [libx264 @ 0x7b7c6a9e4400] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 15% 28% 10% 5% 11% 7% 11% 4% 8% [libx264 @ 0x7b7c6a9e4400] i8c dc,h,v,p: 54% 29% 16% 2% [libx264 @ 0x7b7c6a9e4400] Weighted P-Frames: Y:4.2% UV:0.0% [libx264 @ 0x7b7c6a9e4400] ref P L0: 68.7% 21.1% 9.1% 0.9% 0.1% [libx264 @ 0x7b7c6a9e4400] ref B L0: 67.6% 31.2% 1.2% [libx264 @ 0x7b7c6a9e4400] ref B L1: 96.3% 3.7% [libx264 @ 0x7b7c6a9e4400] kb/s:2111.78
Version Details
- Version ID
6a9501a453eb7ec93984708a927ec196a16075ff36eb8aa9dcc9dc5cae7b4f34
- Version Created
- September 16, 2025