zelenioncode/realvisxl4 🔢📝❓ → 🖼️

▶️ 4.8K runs 📅 Feb 2024 ⚙️ Cog 0.8.6 🔗 GitHub ⚖️ License

photo-realistic photorealistic text-to-image

Performance

7.0sTypical run time

4.8KTotal runs

About

Realism photo with RealVisXl v4.0 ( Realistic Vision with Stable Diffusion XL )

Example Output

Prompt:

"RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3"

Output

Performance Metrics

7.03s Prediction Time

7.07s Total Time

All Input Parameters

{
  "seed": 42,
  "width": 768,
  "height": 1024,
  "prompt": "RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3",
  "scheduler": "DDIM",
  "guidance_scale": 7,
  "number_picture": 1,
  "negative_prompt": "(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck",
  "num_inference_steps": 30
}

Input Parameters

seed Type: integerDefault: 42: Enter a seed
width Type: integerDefault: 512: Enter a width
height Type: integerDefault: 768: Enter a height
prompt Type: stringDefault: RAW photo, a portrait photo of a latina woman in casual clothes, natural skin, 8k uhd, high quality, film grain, Fujifilm XT3: Enter a prompt
scheduler Default: DDIM: Scheduler to use
guidance_scale Type: integerDefault: 7: Enter a guidance scale
number_picture Type: integerDefault: 1Range: 1 - 4: Enter a number of picture
negative_prompt Type: stringDefault: (deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck: Enter a negative prompt
num_inference_steps Type: integerDefault: 20: Enter a number of inference steps

Output Schema

Output

Type: array • Items Type: string • Items Format: uri

Example Execution Logs

{'prompt': ['RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3'], 'negative_prompt': ['(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck'], 'width': 768, 'height': 1024, 'guidance_scale': 7, 'num_inference_steps': 30, 'generator': <torch._C.Generator object at 0x7f1d3ae82cb0>}
  0%|          | 0/30 [00:00<?, ?it/s]
  3%|▎         | 1/30 [00:00<00:04,  6.68it/s]
  7%|▋         | 2/30 [00:00<00:04,  6.62it/s]
 10%|█         | 3/30 [00:00<00:04,  6.59it/s]
 13%|█▎        | 4/30 [00:00<00:03,  6.56it/s]
 17%|█▋        | 5/30 [00:00<00:03,  6.55it/s]
 20%|██        | 6/30 [00:00<00:03,  6.54it/s]
 23%|██▎       | 7/30 [00:01<00:03,  6.54it/s]
 27%|██▋       | 8/30 [00:01<00:03,  6.53it/s]
 30%|███       | 9/30 [00:01<00:03,  6.53it/s]
 33%|███▎      | 10/30 [00:01<00:03,  6.53it/s]
 37%|███▋      | 11/30 [00:01<00:02,  6.53it/s]
 40%|████      | 12/30 [00:01<00:02,  6.53it/s]
 43%|████▎     | 13/30 [00:01<00:02,  6.53it/s]
 47%|████▋     | 14/30 [00:02<00:02,  6.51it/s]
 50%|█████     | 15/30 [00:02<00:02,  6.51it/s]
 53%|█████▎    | 16/30 [00:02<00:02,  6.52it/s]
 57%|█████▋    | 17/30 [00:02<00:01,  6.53it/s]
 60%|██████    | 18/30 [00:02<00:01,  6.54it/s]
 63%|██████▎   | 19/30 [00:02<00:01,  6.55it/s]
 67%|██████▋   | 20/30 [00:03<00:01,  6.54it/s]
 70%|███████   | 21/30 [00:03<00:01,  6.55it/s]
 73%|███████▎  | 22/30 [00:03<00:01,  6.55it/s]
 77%|███████▋  | 23/30 [00:03<00:01,  6.55it/s]
 80%|████████  | 24/30 [00:03<00:00,  6.55it/s]
 83%|████████▎ | 25/30 [00:03<00:00,  6.55it/s]
 87%|████████▋ | 26/30 [00:03<00:00,  6.56it/s]
 90%|█████████ | 27/30 [00:04<00:00,  6.55it/s]
 93%|█████████▎| 28/30 [00:04<00:00,  6.54it/s]
 97%|█████████▋| 29/30 [00:04<00:00,  6.54it/s]
100%|██████████| 30/30 [00:04<00:00,  6.54it/s]
100%|██████████| 30/30 [00:04<00:00,  6.54it/s]
StableDiffusionXLPipelineOutput(images=[<PIL.Image.Image image mode=RGB size=768x1024 at 0x7F1D543C4760>])

Version Details

Version ID: 194f6c32973b10beafa727f3b3a5a9e9336f0656ea6d1a0e25586f4a6865124d
Version Created: February 20, 2024

Run on Replicate →