bytedance/flux-pulid 🔢📝❓🖼️ → 🖼️

▶️ 2.4M runs 📅 Sep 2024 ⚙️ Cog 0.9.20 🔗 GitHub 📄 Paper ⚖️ License
image-consistent-character-generation image-to-image

About

⚡️FLUX PuLID: FLUX-dev based Pure and Lightning ID Customization via Contrastive Alignment🎭

Example Output

Prompt:

"a woman holding sign with glowing green text "PuLID for FLUX""

Output

Example outputExample outputExample outputExample output

Performance Metrics

40.06s Prediction Time
62.28s Total Time
All Input Parameters
{
  "width": 896,
  "height": 1152,
  "prompt": "a woman holding sign with glowing green text \"PuLID for FLUX\"",
  "true_cfg": 1,
  "id_weight": 1,
  "num_steps": 20,
  "start_step": 4,
  "num_outputs": 4,
  "output_format": "webp",
  "guidance_scale": 4,
  "output_quality": 80,
  "main_face_image": "https://replicate.delivery/pbxt/MayZhLThspqLqRhirB8VEjqbEX5B0PWqjd9xe3PkTv2wLMcr/woman.png",
  "negative_prompt": "bad quality, worst quality, text, signature, watermark, extra limbs",
  "max_sequence_length": 128
}
Input Parameters
seed Type: integer
Set a random seed for generation (leave blank or -1 for random)
width Type: integerDefault: 896Range: 256 - 1536
Set the width of the generated image (256-1536 pixels)
height Type: integerDefault: 1152Range: 256 - 1536
Set the height of the generated image (256-1536 pixels)
prompt Type: stringDefault: portrait, color, cinematic
Enter a text prompt to guide image generation
true_cfg Type: numberDefault: 1Range: 1 - 10
Set the Classifier-Free Guidance (CFG) scale. 1.0 uses standard CFG, while values >1.0 enable True CFG for more precise control over generation. Higher values increase adherence to the prompt at the cost of image quality.
id_weight Type: numberDefault: 1Range: 0 - 3
Set the weight of the ID image influence (0.0-3.0)
num_steps Type: integerDefault: 20Range: 1 - 20
Set the number of denoising steps (1-20)
start_step Type: integerDefault: 0Range: 0 - 10
Set the timestep to start inserting ID (0-4 recommended, 0 for highest fidelity, 4 for more editability)
num_outputs Type: integerDefault: 1Range: 1 - 4
Set the number of images to generate (1-4)
output_format Default: webp
Choose the format of the output image
guidance_scale Type: numberDefault: 4Range: 1 - 10
Set the guidance scale for text prompt influence (1.0-10.0)
output_quality Type: integerDefault: 80Range: 1 - 100
Set the quality of the output image for jpg and webp (1-100)
main_face_image (required) Type: string
Upload an ID image for face generation
negative_prompt Type: stringDefault: bad quality, worst quality, text, signature, watermark, extra limbs, low resolution, partially rendered objects, deformed or partially rendered eyes, deformed, deformed eyeballs, cross-eyed, blurry
Enter a negative prompt to specify what to avoid in the image
max_sequence_length Type: integerDefault: 128Range: 128 - 512
Set the max sequence length for prompt (T5), smaller is faster (128-512)
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seeds: [3902735353, 3906845023, 3960091009, 3285723470]
Generating 'a woman holding sign with glowing green text "PuLID for FLUX"' with seeds [3902735353, 3906845023, 3960091009, 3285723470]
Denoising time: 32.90 seconds
Decoding time: 0.69 seconds
Total generate_image time: 39.42 seconds
Image 1 generated with seed: 3902735353
Image 2 generated with seed: 3906845023
Image 3 generated with seed: 3960091009
Image 4 generated with seed: 3285723470
Total prediction time: 39.72 seconds
Version Details
Version ID
8baa7ef2255075b46f4d91cd238c21d31181b3e6a864463f967960bb0112525b
Version Created
September 16, 2024
Run on Replicate →