camenduru/streaming-t2v 🔢📝✓ → 🖼️
About
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Example Output
Prompt:
"Experience the dance of jellyfish: float through mesmerizing swarms of jellyfish, pulsating with otherworldly grace and beauty."
Output
Performance Metrics
561.64s
Prediction Time
561.65s
Total Time
All Input Parameters
{ "seed": 33, "chunk": 24, "prompt": "Experience the dance of jellyfish: float through mesmerizing swarms of jellyfish, pulsating with otherworldly grace and beauty.", "enhance": true, "overlap": 8, "num_steps": 50, "num_frames": 120, "image_guidance": 9, "negative_prompt": "" }
Input Parameters
- seed
- chunk
- prompt
- enhance
- overlap
- num_steps
- num_frames
- image_guidance
- negative_prompt
Output Schema
Output
Example Execution Logs
LOCAL_RANK: 0 - CUDA_VISIBLE_DEVICES: [0] /usr/local/lib/python3.10/site-packages/pytorch_lightning/trainer/connectors/data_connector.py:442: PossibleUserWarning: The dataloader, predict_dataloader, does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` (try 48 which is the number of cpus on this machine) in the `DataLoader` init to improve performance. rank_zero_warn( INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} INFERENCE PARAMS = {'concat_video': True, 'conditioning_from_all_past': False, 'conditioning_type': 'fixed', 'eta': 1.0, 'eval_loss_metrics': False, 'frame_rate': 8, 'guidance_scale': 7.5, 'height': 256, 'mode': 'long_video', 'n_autoregressive_generations': 4, 'negative_prompt': '', 'num_inference_steps': 50, 'result_formats': ['eval_mp4'], 'scheduler_cls': '', 'seed': 33, 'start_from_real_input': False, 'use_dec_scaling': True, 'validation_samples': 80, 'video_length': 16, 'width': 256} Predicting ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/1 0:06:16 • 0:00:00 0.00it/s /usr/local/lib/python3.10/site-packages/torchsde/_brownian/brownian_interval.py:608: UserWarning: Should have tb<=t1 but got tb=4.164773464202881 and t1=4.164773. warnings.warn(f"Should have {tb_name}<=t1 but got {tb_name}={tb} and t1={self._end}.")
Version Details
- Version ID
1fe245aad4bb7f209074a231142ac3eceb3b1f2adc9cf77b46e8ffa2662323cf
- Version Created
- April 10, 2024