mbukerepo/photomaker 🔢📝❓🖼️✓ → 🖼️

▶️ 5.0K runs 📅 Jan 2024 ⚙️ Cog 0.8.6 📄 Paper ⚖️ License
image-consistent-character-generation image-to-image

About

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Example Output

Prompt:

"sci-fi, closeup portrait photo of a man img wearing the sunglasses in Iron man suit, face, slim body, high quality, film grain"

Output

Example output

Performance Metrics

11.24s Prediction Time
66.09s Total Time
All Input Parameters
{
  "image": "https://replicate.delivery/pbxt/KEoNTjDyiTvEKYmQF6d26yqIl7smGBQvrEKXva1qZg0DzV3j/newton_0.jpg",
  "prompt": "sci-fi, closeup portrait photo of a man img wearing the sunglasses in Iron man suit, face, slim body, high quality, film grain",
  "num_outputs": 1,
  "negative_prompt": "(asymmetry, worst quality, low quality, illustration, 3d, 2d, painting, cartoons, sketch), open mouth",
  "num_inference_steps": 40
}
Input Parameters
seed Type: integerRange: 0 - 2147483647
Seed. Leave blank to use a random number
prompt Type: stringDefault: A photo of a person img
Prompt. Example: 'a photo of a man/woman img'. The phrase 'img' is the trigger word.
num_steps Type: integerDefault: 20Range: 1 - 100
Number of sample steps
style_name Default: Photographic (Default)
Style template. The style template will add a style-specific prompt and negative prompt to the user's prompt.
first_image (required) Type: string
The input image, for example a photo of your face.
num_outputs Type: integerDefault: 1Range: 1 - 4
Number of output images
third_image Type: string
Additional input image (optional)
fourth_image Type: string
Additional input image (optional)
second_image Type: string
Additional input image (optional)
guidance_scale Type: numberDefault: 5Range: 1 - 10
Guidance scale. A guidance scale of 1 corresponds to doing no classifier free guidance.
negative_prompt Type: stringDefault: nsfw, lowres, bad anatomy, bad hands, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blurry
Negative Prompt. The negative prompt should NOT contain the trigger word.
style_strength_ratio Type: numberDefault: 20Range: 15 - 50
Style strength (%)
disable_safety_checker Type: booleanDefault: false
Disable safety checker for generated images.
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
2024-01-16 19:44:30,358 INFO predict Using seed: 318775383
  0%|          | 0/40 [00:00<?, ?it/s]
  2%|▎         | 1/40 [00:00<00:09,  4.33it/s]
  5%|▌         | 2/40 [00:00<00:08,  4.51it/s]
  8%|▊         | 3/40 [00:00<00:08,  4.57it/s]
 10%|█         | 4/40 [00:00<00:07,  4.60it/s]
 12%|█▎        | 5/40 [00:01<00:07,  4.61it/s]
 15%|█▌        | 6/40 [00:01<00:07,  4.62it/s]
 18%|█▊        | 7/40 [00:01<00:07,  4.62it/s]
 20%|██        | 8/40 [00:01<00:06,  4.62it/s]
 22%|██▎       | 9/40 [00:01<00:06,  4.62it/s]
 25%|██▌       | 10/40 [00:02<00:06,  4.63it/s]
 28%|██▊       | 11/40 [00:02<00:06,  4.61it/s]
 30%|███       | 12/40 [00:02<00:06,  4.62it/s]
 32%|███▎      | 13/40 [00:02<00:05,  4.63it/s]
 35%|███▌      | 14/40 [00:03<00:05,  4.63it/s]
 38%|███▊      | 15/40 [00:03<00:05,  4.63it/s]
 40%|████      | 16/40 [00:03<00:05,  4.64it/s]
 42%|████▎     | 17/40 [00:03<00:04,  4.64it/s]
 45%|████▌     | 18/40 [00:03<00:04,  4.64it/s]
 48%|████▊     | 19/40 [00:04<00:04,  4.63it/s]
 50%|█████     | 20/40 [00:04<00:04,  4.63it/s]
 52%|█████▎    | 21/40 [00:04<00:04,  4.63it/s]
 55%|█████▌    | 22/40 [00:04<00:03,  4.63it/s]
 57%|█████▊    | 23/40 [00:04<00:03,  4.63it/s]
 60%|██████    | 24/40 [00:05<00:03,  4.62it/s]
 62%|██████▎   | 25/40 [00:05<00:03,  4.62it/s]
 65%|██████▌   | 26/40 [00:05<00:03,  4.63it/s]
 68%|██████▊   | 27/40 [00:05<00:02,  4.63it/s]
 70%|███████   | 28/40 [00:06<00:02,  4.62it/s]
 72%|███████▎  | 29/40 [00:06<00:02,  4.63it/s]
 75%|███████▌  | 30/40 [00:06<00:02,  4.62it/s]
 78%|███████▊  | 31/40 [00:06<00:01,  4.62it/s]
 80%|████████  | 32/40 [00:06<00:01,  4.63it/s]
 82%|████████▎ | 33/40 [00:07<00:01,  4.63it/s]
 85%|████████▌ | 34/40 [00:07<00:01,  4.63it/s]
 88%|████████▊ | 35/40 [00:07<00:01,  4.63it/s]
 90%|█████████ | 36/40 [00:07<00:00,  4.63it/s]
 92%|█████████▎| 37/40 [00:08<00:00,  4.63it/s]
 95%|█████████▌| 38/40 [00:08<00:00,  4.63it/s]
 98%|█████████▊| 39/40 [00:08<00:00,  4.62it/s]
100%|██████████| 40/40 [00:08<00:00,  4.63it/s]
100%|██████████| 40/40 [00:08<00:00,  4.62it/s]
Version Details
Version ID
2cef1f7ff2e6fa2ba24569ddc467fe8e706db0f96e4663f4b5df213469c8e3a2
Version Created
January 19, 2024
Run on Replicate →