prunaai/z-image-turbo 🔢📝❓ → 🖼️

⭐ Official ▶️ 9.0M runs 📅 Nov 2025 ⚙️ Cog 0.16.9
photorealistic text-in-image text-rendering text-to-image

About

Z-Image Turbo is a super fast text-to-image model of 6B parameters developed by Tongyi-MAI.

Example Output

Prompt:

"A hyper-realistic, close-up portrait of a tribal elder from the Omo Valley, painted with intricate white chalk patterns and adorned with a headdress made of dried flowers, seed pods, and rusted bottle caps. The focus is razor-sharp on the texture of the skin, showing every pore, wrinkle, and scar that tells a story of survival. The background is a blurred, smoky hut interior, with the warm glow of a cooking fire reflecting in the subject's dark, soulful eyes. Shot on a Leica M6 with Kodak Portra 400 film grain aesthetic."

Output

Example output

Performance Metrics

1.43s Prediction Time
1.45s Total Time
All Input Parameters
{
  "width": 1024,
  "height": 768,
  "prompt": "A hyper-realistic, close-up portrait of a tribal elder from the Omo Valley, painted with intricate white chalk patterns and adorned with a headdress made of dried flowers, seed pods, and rusted bottle caps. The focus is razor-sharp on the texture of the skin, showing every pore, wrinkle, and scar that tells a story of survival. The background is a blurred, smoky hut interior, with the warm glow of a cooking fire reflecting in the subject's dark, soulful eyes. Shot on a Leica M6 with Kodak Portra 400 film grain aesthetic.",
  "output_format": "jpg",
  "guidance_scale": 0,
  "output_quality": 80,
  "num_inference_steps": 8
}
Input Parameters
seed Type: integer
Random seed. Set for reproducible generation
width Type: integerDefault: 1024Range: 64 - 1440
Width of the generated image
height Type: integerDefault: 1024Range: 64 - 1440
Height of the generated image
prompt (required) Type: string
Text prompt for image generation
output_format Default: jpg
Format of the output images
guidance_scale Type: numberDefault: 0Range: 0 - 20
Guidance scale. Should be 0 for Turbo models
output_quality Type: integerDefault: 80Range: 0 - 100
Quality when saving the output images, from 0 to 100. 100 is best quality, 0 is lowest quality. Not relevant for .png outputs
num_inference_steps Type: integerDefault: 8Range: 1 - 50
Number of inference steps. This actually results in (num_inference_steps - 1) DiT forwards
Output Schema

Output

Type: stringFormat: uri

Example Execution Logs
0%|          | 0/9 [00:00<?, ?it/s]
 11%|█         | 1/9 [00:00<00:01,  6.72it/s]
 33%|███▎      | 3/9 [00:00<00:00,  7.89it/s]
 44%|████▍     | 4/9 [00:00<00:00,  7.51it/s]
 56%|█████▌    | 5/9 [00:00<00:00,  7.28it/s]
 67%|██████▋   | 6/9 [00:00<00:00,  7.14it/s]
 78%|███████▊  | 7/9 [00:00<00:00,  7.05it/s]
 89%|████████▉ | 8/9 [00:01<00:00,  6.99it/s]
100%|██████████| 9/9 [00:01<00:00,  6.95it/s]
100%|██████████| 9/9 [00:01<00:00,  7.14it/s]
Version Details
Version ID
5fd5416a46178ec2eb14ba9e3418c5c5410f85f34494b3618f29c953041e074b
Version Created
December 26, 2025
Run on Replicate →