zelenioncode/realvisxl4 🔢📝❓ → 🖼️

▶️ 4.7K runs 📅 Feb 2024 ⚙️ Cog 0.8.6 🔗 GitHub ⚖️ License
photorealistic text-to-image

About

Realism photo with RealVisXl v4.0 ( Realistic Vision with Stable Diffusion XL )

Example Output

Prompt:

"RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3"

Output

Example output

Performance Metrics

7.03s Prediction Time
7.07s Total Time
All Input Parameters
{
  "seed": 42,
  "width": 768,
  "height": 1024,
  "prompt": "RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3",
  "scheduler": "DDIM",
  "guidance_scale": 7,
  "number_picture": 1,
  "negative_prompt": "(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck",
  "num_inference_steps": 30
}
Input Parameters
seed Type: integerDefault: 42
Enter a seed
width Type: integerDefault: 512
Enter a width
height Type: integerDefault: 768
Enter a height
prompt Type: stringDefault: RAW photo, a portrait photo of a latina woman in casual clothes, natural skin, 8k uhd, high quality, film grain, Fujifilm XT3
Enter a prompt
scheduler Default: DDIM
Scheduler to use
guidance_scale Type: integerDefault: 7
Enter a guidance scale
number_picture Type: integerDefault: 1Range: 1 - 4
Enter a number of picture
negative_prompt Type: stringDefault: (deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck
Enter a negative prompt
num_inference_steps Type: integerDefault: 20
Enter a number of inference steps
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
{'prompt': ['RAW photo, a portrait photo of a woman in casual clothes, natural skin, tailored black three-piece suit,skin, 8k uhd, high quality, film grain, Fujifilm XT3'], 'negative_prompt': ['(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck'], 'width': 768, 'height': 1024, 'guidance_scale': 7, 'num_inference_steps': 30, 'generator': <torch._C.Generator object at 0x7f1d3ae82cb0>}
  0%|          | 0/30 [00:00<?, ?it/s]
  3%|▎         | 1/30 [00:00<00:04,  6.68it/s]
  7%|▋         | 2/30 [00:00<00:04,  6.62it/s]
 10%|█         | 3/30 [00:00<00:04,  6.59it/s]
 13%|█▎        | 4/30 [00:00<00:03,  6.56it/s]
 17%|█▋        | 5/30 [00:00<00:03,  6.55it/s]
 20%|██        | 6/30 [00:00<00:03,  6.54it/s]
 23%|██▎       | 7/30 [00:01<00:03,  6.54it/s]
 27%|██▋       | 8/30 [00:01<00:03,  6.53it/s]
 30%|███       | 9/30 [00:01<00:03,  6.53it/s]
 33%|███▎      | 10/30 [00:01<00:03,  6.53it/s]
 37%|███▋      | 11/30 [00:01<00:02,  6.53it/s]
 40%|████      | 12/30 [00:01<00:02,  6.53it/s]
 43%|████▎     | 13/30 [00:01<00:02,  6.53it/s]
 47%|████▋     | 14/30 [00:02<00:02,  6.51it/s]
 50%|█████     | 15/30 [00:02<00:02,  6.51it/s]
 53%|█████▎    | 16/30 [00:02<00:02,  6.52it/s]
 57%|█████▋    | 17/30 [00:02<00:01,  6.53it/s]
 60%|██████    | 18/30 [00:02<00:01,  6.54it/s]
 63%|██████▎   | 19/30 [00:02<00:01,  6.55it/s]
 67%|██████▋   | 20/30 [00:03<00:01,  6.54it/s]
 70%|███████   | 21/30 [00:03<00:01,  6.55it/s]
 73%|███████▎  | 22/30 [00:03<00:01,  6.55it/s]
 77%|███████▋  | 23/30 [00:03<00:01,  6.55it/s]
 80%|████████  | 24/30 [00:03<00:00,  6.55it/s]
 83%|████████▎ | 25/30 [00:03<00:00,  6.55it/s]
 87%|████████▋ | 26/30 [00:03<00:00,  6.56it/s]
 90%|█████████ | 27/30 [00:04<00:00,  6.55it/s]
 93%|█████████▎| 28/30 [00:04<00:00,  6.54it/s]
 97%|█████████▋| 29/30 [00:04<00:00,  6.54it/s]
100%|██████████| 30/30 [00:04<00:00,  6.54it/s]
100%|██████████| 30/30 [00:04<00:00,  6.54it/s]
StableDiffusionXLPipelineOutput(images=[<PIL.Image.Image image mode=RGB size=768x1024 at 0x7F1D543C4760>])
Version Details
Version ID
194f6c32973b10beafa727f3b3a5a9e9336f0656ea6d1a0e25586f4a6865124d
Version Created
February 20, 2024
Run on Replicate →