aramintak/frosting-land_wan2 ðĒðžïļâð â ðžïļ
About
Example Output
Prompt:
"a kaiju monster walking through a city illustration style"
Output
Performance Metrics
876.87s
Prediction Time
925.71s
Total Time
All Input Parameters
{
"image": "https://replicate.delivery/pbxt/Mqg8hCbYcvoUxOZ8gV7vUjMaeh9KX5dzp3BHLTFxzlxvtJGt/filename_00162_.png",
"frames": 81,
"prompt": "a kaiju monster walking through a city illustration style",
"fast_mode": "Balanced",
"resolution": "720p",
"aspect_ratio": "16:9",
"sample_shift": 7,
"sample_steps": 30,
"negative_prompt": "ugly, messy, grain, film grain, degradation, messed up",
"lora_strength_clip": 1,
"sample_guide_scale": 4,
"lora_strength_model": 0.9
}
Input Parameters
- seed
- Set a seed for reproducibility. Random by default.
- image
- Image to use as a starting frame for image to video generation.
- frames
- The number of frames to generate (1 to 5 seconds)
- prompt (required)
- Text prompt for video generation
- fast_mode
- Speed up generation with different levels of acceleration. Faster modes may degrade quality somewhat. The speedup is dependent on the content, so different videos may see different speedups.
- resolution
- The resolution of the video. 720p is not supported for 1.3b.
- aspect_ratio
- The aspect ratio of the video. 16:9, 9:16, 1:1, etc.
- sample_shift
- Sample shift factor
- sample_steps
- Number of generation steps. Fewer steps means faster generation, at the expensive of output quality. 30 steps is sufficient for most prompts
- negative_prompt
- Things you do not want to see in your video
- replicate_weights
- Replicate LoRA weights to use. Leave blank to use the default weights.
- lora_strength_clip
- Strength of the LORA applied to the CLIP model. 0.0 is no LORA.
- sample_guide_scale
- Higher guide scale makes prompt adherence better, but can reduce variation
- lora_strength_model
- Strength of the LORA applied to the model. 0.0 is no LORA.
Output Schema
Output
Example Execution Logs
Random seed set to: 853381153
2025-04-16T20:12:58Z | INFO | [ Initiating ] chunk_size=150M dest=/tmp/tmp69kkmk5l/weights url=https://replicate.delivery/xezq/UOfH00s5GetS2U9Z2elO0d8IhRo2neOc57g8WyuYGDIDbx5RB/trained_model.tar
2025-04-16T20:12:58Z | INFO | [ Cache Service ] enabled=true scheme=http target=hermes.services.svc.cluster.local
2025-04-16T20:13:01Z | INFO | [ Complete ] dest=/tmp/tmp69kkmk5l/weights size="307 MB" total_elapsed=3.159s url=https://replicate.delivery/xezq/UOfH00s5GetS2U9Z2elO0d8IhRo2neOc57g8WyuYGDIDbx5RB/trained_model.tar
Checking inputs
â
/tmp/inputs/image.png
====================================
Checking weights
â
wan_2.1_vae.safetensors exists in ComfyUI/models/vae
â
14b_49b48d33a84a18885dc3ebeddac8b186.safetensors exists in loras directory
â
umt5_xxl_fp16.safetensors exists in ComfyUI/models/text_encoders
â
clip_vision_h.safetensors exists in ComfyUI/models/clip_vision
âģ Downloading wan2.1_i2v_720p_14B_bf16.safetensors to ComfyUI/models/diffusion_models
â
wan2.1_i2v_720p_14B_bf16.safetensors downloaded to ComfyUI/models/diffusion_models in 29.46s, size: 31270.88MB
====================================
Running workflow
[ComfyUI] got prompt
Executing node 39, title: Load VAE, class type: VAELoader
[ComfyUI] Using pytorch attention in VAE
[ComfyUI] Using pytorch attention in VAE
[ComfyUI] VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
Executing node 55, title: Load Image, class type: LoadImage
Executing node 56, title: Width and height for scaling image to ideal resolution ðŠī, class type: Width and height for scaling image to ideal resolution ðŠī
Executing node 57, title: ð§ Image Resize, class type: ImageResize+
Executing node 60, title: Load CLIP Vision, class type: CLIPVisionLoader
Executing node 59, title: CLIP Vision Encode, class type: CLIPVisionEncode
[ComfyUI] Requested to load CLIPVisionModelProjection
[ComfyUI] loaded completely 141327.4875 1208.09814453125 True
Executing node 38, title: Load CLIP, class type: CLIPLoader
[ComfyUI] CLIP/text encoder model load device: cuda:0, offload device: cpu, current: cpu, dtype: torch.float16
[ComfyUI] Requested to load WanTEModel
Executing node 7, title: CLIP Text Encode (Negative Prompt), class type: CLIPTextEncode
[ComfyUI] loaded completely 139855.3869140625 10835.4765625 True
Executing node 37, title: Load Diffusion Model, class type: UNETLoader
[ComfyUI] model weight dtype torch.float16, manual cast: None
[ComfyUI] model_type FLOW
Executing node 54, title: WanVideo Tea Cache (native), class type: WanVideoTeaCacheKJ
Executing node 49, title: Load LoRA, class type: LoraLoader
[ComfyUI] Requested to load WanTEModel
Executing node 6, title: CLIP Text Encode (Positive Prompt), class type: CLIPTextEncode
[ComfyUI] loaded completely 139853.3869140625 10835.4765625 True
Executing node 58, title: WanImageToVideo, class type: WanImageToVideo
[ComfyUI] Requested to load WanVAE
[ComfyUI] loaded completely 118209.1806602478 242.02829551696777 True
Executing node 48, title: ModelSamplingSD3, class type: ModelSamplingSD3
Executing node 53, title: WanVideo Enhance A Video (native), class type: WanVideoEnhanceAVideoKJ
Executing node 3, title: KSampler, class type: KSampler
[ComfyUI] Requested to load WAN21
[ComfyUI] loaded completely 112525.20673397521 31269.802368164062 True
[ComfyUI]
[ComfyUI] 0%| | 0/30 [00:00<?, ?it/s]
[ComfyUI] 3%|â | 1/30 [00:37<18:17, 37.85s/it]
[ComfyUI] 7%|â | 2/30 [01:21<19:15, 41.28s/it]
[ComfyUI] 10%|â | 3/30 [02:06<19:23, 43.09s/it]
[ComfyUI] TeaCache: Initialized
[ComfyUI]
[ComfyUI] 13%|ââ | 4/30 [02:58<20:12, 46.62s/it]
[ComfyUI] 17%|ââ | 5/30 [03:44<19:13, 46.13s/it]
[ComfyUI] 20%|ââ | 6/30 [03:44<12:12, 30.50s/it]
[ComfyUI] 23%|âââ | 7/30 [04:29<13:32, 35.31s/it]
[ComfyUI] 27%|âââ | 8/30 [04:29<08:50, 24.13s/it]
[ComfyUI] 30%|âââ | 9/30 [05:15<10:47, 30.81s/it]
[ComfyUI] 33%|ââââ | 10/30 [05:15<07:07, 21.35s/it]
[ComfyUI] 37%|ââââ | 11/30 [06:00<09:04, 28.68s/it]
[ComfyUI] 40%|ââââ | 12/30 [06:00<06:00, 20.00s/it]
[ComfyUI] 43%|âââââ | 13/30 [06:45<07:49, 27.64s/it]
[ComfyUI] 47%|âââââ | 14/30 [06:46<05:09, 19.34s/it]
[ComfyUI] 50%|âââââ | 15/30 [07:31<06:48, 27.23s/it]
[ComfyUI] 53%|ââââââ | 16/30 [07:31<04:27, 19.09s/it]
[ComfyUI] 57%|ââââââ | 17/30 [08:17<05:50, 26.96s/it]
[ComfyUI] 60%|ââââââ | 18/30 [08:17<03:46, 18.91s/it]
[ComfyUI] 63%|âââââââ | 19/30 [09:02<04:55, 26.84s/it]
[ComfyUI] 67%|âââââââ | 20/30 [09:02<03:08, 18.83s/it]
[ComfyUI] 70%|âââââââ | 21/30 [09:02<01:59, 13.24s/it]
[ComfyUI] 73%|ââââââââ | 22/30 [09:48<03:03, 22.91s/it]
[ComfyUI] 77%|ââââââââ | 23/30 [09:48<01:52, 16.09s/it]
[ComfyUI] 80%|ââââââââ | 24/30 [10:33<02:29, 24.83s/it]
[ComfyUI] 83%|âââââââââ | 25/30 [10:33<01:27, 17.43s/it]
[ComfyUI] 87%|âââââââââ | 26/30 [11:19<01:43, 25.76s/it]
[ComfyUI] 90%|âââââââââ | 27/30 [11:19<00:54, 18.08s/it]
[ComfyUI] 93%|ââââââââââ| 28/30 [12:04<00:52, 26.22s/it]
[ComfyUI] 97%|ââââââââââ| 29/30 [12:49<00:31, 31.93s/it]
[ComfyUI] 100%|ââââââââââ| 30/30 [13:35<00:00, 35.93s/it]
Executing node 8, title: VAE Decode, class type: VAEDecode
Executing node 50, title: Video Combine ðĨð
Ĩð
ð
Ē, class type: VHS_VideoCombine
[ComfyUI] 100%|ââââââââââ| 30/30 [13:35<00:00, 27.17s/it]
[ComfyUI] Prompt executed in 843.77 seconds
outputs: {'50': {'gifs': [{'filename': 'R8_Wan_00001.mp4', 'subfolder': '', 'type': 'output', 'format': 'video/h264-mp4', 'frame_rate': 16.0, 'workflow': 'R8_Wan_00001.png', 'fullpath': '/tmp/outputs/R8_Wan_00001.mp4'}]}}
====================================
R8_Wan_00001.png
R8_Wan_00001.mp4
Version Details
- Version ID
c2d730536e34c591843983323351e09d1012c14d7596df1f640b1da8dbd102e6- Version Created
- April 1, 2025