ndreca/hunyuan3d-2.1 ๐Ÿ”ข๐Ÿ–ผ๏ธโœ“โ“ โ†’ โ“

โ–ถ๏ธ 13.1K runs ๐Ÿ“… Jun 2025 โš™๏ธ Cog 0.16.8 ๐Ÿ”— GitHub ๐Ÿ“„ Paper โš–๏ธ License
game-asset-generation image-to-3d mesh-generation single-view-3d-reconstruction

About

[Quality Mode] Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Example Output

Output

Performance Metrics

110.91s Prediction Time
266.02s Total Time
All Input Parameters
{
  "seed": 1234,
  "image": "https://replicate.delivery/pbxt/MVC2B2XKgv4X13qIpW6t2m59EVfY2CqaS9e2CSsWNHPJjQAd/image.png",
  "steps": 50,
  "num_chunks": 8000,
  "max_facenum": 20000,
  "guidance_scale": 7.5,
  "generate_texture": true,
  "octree_resolution": 256,
  "remove_background": false
}
Input Parameters
seed Type: integerDefault: 1234
Random seed for generation
image (required) Type: string
Input image for generating 3D shape
steps Type: integerDefault: 50Range: 5 - 50
Number of inference steps
num_chunks Type: integerDefault: 8000Range: 1000 - 200000
Number of chunks for mesh generation
max_facenum Type: integerDefault: 20000Range: 10000 - 200000
Maximum number of faces for mesh generation
guidance_scale Type: numberDefault: 7.5Range: 1 - 20
Guidance scale for generation
generate_texture Type: booleanDefault: true
Whether to generate PBR textures
octree_resolution Default: 256
Octree resolution for mesh generation
remove_background Type: booleanDefault: true
Whether to remove background from input image
Output Schema
mesh Type: stringFormat: uri
Mesh
Example Execution Logs
Diffusion Sampling::   0%|          | 0/50 [00:00<?, ?it/s]
Diffusion Sampling::   2%|โ–         | 1/50 [00:00<00:12,  3.93it/s]
Diffusion Sampling::   4%|โ–         | 2/50 [00:00<00:11,  4.14it/s]
Diffusion Sampling::   6%|โ–Œ         | 3/50 [00:00<00:11,  4.17it/s]
Diffusion Sampling::   8%|โ–Š         | 4/50 [00:00<00:10,  4.19it/s]
Diffusion Sampling::  10%|โ–ˆ         | 5/50 [00:01<00:10,  4.19it/s]
Diffusion Sampling::  12%|โ–ˆโ–        | 6/50 [00:01<00:10,  4.19it/s]
Diffusion Sampling::  14%|โ–ˆโ–        | 7/50 [00:01<00:10,  4.20it/s]
Diffusion Sampling::  16%|โ–ˆโ–Œ        | 8/50 [00:01<00:10,  4.20it/s]
Diffusion Sampling::  18%|โ–ˆโ–Š        | 9/50 [00:02<00:09,  4.19it/s]
Diffusion Sampling::  20%|โ–ˆโ–ˆ        | 10/50 [00:02<00:09,  4.19it/s]
Diffusion Sampling::  22%|โ–ˆโ–ˆโ–       | 11/50 [00:02<00:09,  4.18it/s]
Diffusion Sampling::  24%|โ–ˆโ–ˆโ–       | 12/50 [00:02<00:09,  4.18it/s]
Diffusion Sampling::  26%|โ–ˆโ–ˆโ–Œ       | 13/50 [00:03<00:08,  4.19it/s]
Diffusion Sampling::  28%|โ–ˆโ–ˆโ–Š       | 14/50 [00:03<00:08,  4.19it/s]
Diffusion Sampling::  30%|โ–ˆโ–ˆโ–ˆ       | 15/50 [00:03<00:08,  4.19it/s]
Diffusion Sampling::  32%|โ–ˆโ–ˆโ–ˆโ–      | 16/50 [00:03<00:08,  4.19it/s]
Diffusion Sampling::  34%|โ–ˆโ–ˆโ–ˆโ–      | 17/50 [00:04<00:07,  4.16it/s]
Diffusion Sampling::  36%|โ–ˆโ–ˆโ–ˆโ–Œ      | 18/50 [00:04<00:07,  4.17it/s]
Diffusion Sampling::  38%|โ–ˆโ–ˆโ–ˆโ–Š      | 19/50 [00:04<00:07,  4.17it/s]
Diffusion Sampling::  40%|โ–ˆโ–ˆโ–ˆโ–ˆ      | 20/50 [00:04<00:07,  4.16it/s]
Diffusion Sampling::  42%|โ–ˆโ–ˆโ–ˆโ–ˆโ–     | 21/50 [00:05<00:06,  4.16it/s]
Diffusion Sampling::  44%|โ–ˆโ–ˆโ–ˆโ–ˆโ–     | 22/50 [00:05<00:06,  4.17it/s]
Diffusion Sampling::  46%|โ–ˆโ–ˆโ–ˆโ–ˆโ–Œ     | 23/50 [00:05<00:06,  4.16it/s]
Diffusion Sampling::  48%|โ–ˆโ–ˆโ–ˆโ–ˆโ–Š     | 24/50 [00:05<00:06,  4.16it/s]
Diffusion Sampling::  50%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ     | 25/50 [00:05<00:05,  4.17it/s]
Diffusion Sampling::  52%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–    | 26/50 [00:06<00:05,  4.16it/s]
Diffusion Sampling::  54%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–    | 27/50 [00:06<00:05,  4.15it/s]
Diffusion Sampling::  56%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ    | 28/50 [00:06<00:05,  4.16it/s]
Diffusion Sampling::  58%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š    | 29/50 [00:06<00:05,  4.15it/s]
Diffusion Sampling::  60%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ    | 30/50 [00:07<00:04,  4.14it/s]
Diffusion Sampling::  62%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–   | 31/50 [00:07<00:04,  4.14it/s]
Diffusion Sampling::  64%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–   | 32/50 [00:07<00:04,  4.14it/s]
Diffusion Sampling::  66%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ   | 33/50 [00:07<00:04,  4.14it/s]
Diffusion Sampling::  68%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š   | 34/50 [00:08<00:03,  4.13it/s]
Diffusion Sampling::  70%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ   | 35/50 [00:08<00:03,  4.12it/s]
Diffusion Sampling::  72%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–  | 36/50 [00:08<00:03,  4.12it/s]
Diffusion Sampling::  74%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–  | 37/50 [00:08<00:03,  4.12it/s]
Diffusion Sampling::  76%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ  | 38/50 [00:09<00:02,  4.11it/s]
Diffusion Sampling::  78%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š  | 39/50 [00:09<00:02,  4.11it/s]
Diffusion Sampling::  80%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ  | 40/50 [00:09<00:02,  4.10it/s]
Diffusion Sampling::  82%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ– | 41/50 [00:09<00:02,  4.11it/s]
Diffusion Sampling::  84%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ– | 42/50 [00:10<00:01,  4.10it/s]
Diffusion Sampling::  86%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ | 43/50 [00:10<00:01,  4.10it/s]
Diffusion Sampling::  88%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š | 44/50 [00:10<00:01,  4.10it/s]
Diffusion Sampling::  90%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ | 45/50 [00:10<00:01,  4.09it/s]
Diffusion Sampling::  92%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–| 46/50 [00:11<00:00,  4.09it/s]
Diffusion Sampling::  94%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–| 47/50 [00:11<00:00,  4.08it/s]
Diffusion Sampling::  96%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ| 48/50 [00:11<00:00,  4.07it/s]
Diffusion Sampling::  98%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š| 49/50 [00:11<00:00,  4.06it/s]
Diffusion Sampling:: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 50/50 [00:12<00:00,  4.08it/s]
Diffusion Sampling:: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 50/50 [00:12<00:00,  4.14it/s]
Volume Decoding:   0%|          | 0/2122 [00:00<?, ?it/s]
Volume Decoding:   4%|โ–         | 89/2122 [00:00<00:02, 871.18it/s]
Volume Decoding:   8%|โ–Š         | 177/2122 [00:00<00:03, 529.24it/s]
Volume Decoding:  11%|โ–ˆ         | 238/2122 [00:00<00:03, 479.84it/s]
Volume Decoding:  14%|โ–ˆโ–Ž        | 290/2122 [00:00<00:03, 460.16it/s]
Volume Decoding:  16%|โ–ˆโ–Œ        | 338/2122 [00:00<00:04, 445.52it/s]
Volume Decoding:  18%|โ–ˆโ–Š        | 384/2122 [00:00<00:03, 436.87it/s]
Volume Decoding:  20%|โ–ˆโ–ˆ        | 429/2122 [00:00<00:03, 432.22it/s]
Volume Decoding:  22%|โ–ˆโ–ˆโ–       | 473/2122 [00:01<00:03, 427.71it/s]
Volume Decoding:  24%|โ–ˆโ–ˆโ–       | 516/2122 [00:01<00:03, 422.04it/s]
Volume Decoding:  26%|โ–ˆโ–ˆโ–‹       | 559/2122 [00:01<00:03, 420.55it/s]
Volume Decoding:  28%|โ–ˆโ–ˆโ–Š       | 602/2122 [00:01<00:03, 420.98it/s]
Volume Decoding:  30%|โ–ˆโ–ˆโ–ˆ       | 645/2122 [00:01<00:03, 419.77it/s]
Volume Decoding:  32%|โ–ˆโ–ˆโ–ˆโ–      | 687/2122 [00:01<00:03, 417.90it/s]
Volume Decoding:  34%|โ–ˆโ–ˆโ–ˆโ–      | 729/2122 [00:01<00:03, 416.67it/s]
Volume Decoding:  36%|โ–ˆโ–ˆโ–ˆโ–‹      | 771/2122 [00:01<00:03, 417.22it/s]
Volume Decoding:  38%|โ–ˆโ–ˆโ–ˆโ–Š      | 814/2122 [00:01<00:03, 418.33it/s]
Volume Decoding:  40%|โ–ˆโ–ˆโ–ˆโ–ˆ      | 856/2122 [00:01<00:03, 415.13it/s]
Volume Decoding:  42%|โ–ˆโ–ˆโ–ˆโ–ˆโ–     | 898/2122 [00:02<00:02, 415.48it/s]
Volume Decoding:  44%|โ–ˆโ–ˆโ–ˆโ–ˆโ–     | 940/2122 [00:02<00:02, 416.80it/s]
Volume Decoding:  46%|โ–ˆโ–ˆโ–ˆโ–ˆโ–‹     | 982/2122 [00:02<00:02, 414.44it/s]
Volume Decoding:  48%|โ–ˆโ–ˆโ–ˆโ–ˆโ–Š     | 1024/2122 [00:02<00:02, 415.93it/s]
Volume Decoding:  50%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ     | 1066/2122 [00:02<00:02, 414.35it/s]
Volume Decoding:  52%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–    | 1109/2122 [00:02<00:02, 416.73it/s]
Volume Decoding:  54%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–    | 1152/2122 [00:02<00:02, 418.41it/s]
Volume Decoding:  56%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‹    | 1194/2122 [00:02<00:02, 418.66it/s]
Volume Decoding:  58%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š    | 1236/2122 [00:02<00:02, 418.96it/s]
Volume Decoding:  60%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ    | 1278/2122 [00:02<00:02, 419.17it/s]
Volume Decoding:  62%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–   | 1320/2122 [00:03<00:01, 416.59it/s]
Volume Decoding:  64%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–   | 1362/2122 [00:03<00:01, 416.97it/s]
Volume Decoding:  66%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ   | 1404/2122 [00:03<00:01, 416.98it/s]
Volume Decoding:  68%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š   | 1446/2122 [00:03<00:01, 414.29it/s]
Volume Decoding:  70%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ   | 1489/2122 [00:03<00:01, 416.49it/s]
Volume Decoding:  72%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–  | 1532/2122 [00:03<00:01, 417.90it/s]
Volume Decoding:  74%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–  | 1575/2122 [00:03<00:01, 418.64it/s]
Volume Decoding:  76%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ  | 1618/2122 [00:03<00:01, 419.67it/s]
Volume Decoding:  78%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š  | 1661/2122 [00:03<00:01, 419.88it/s]
Volume Decoding:  80%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ  | 1703/2122 [00:03<00:01, 416.75it/s]
Volume Decoding:  82%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ– | 1745/2122 [00:04<00:00, 415.47it/s]
Volume Decoding:  84%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ– | 1787/2122 [00:04<00:00, 416.18it/s]
Volume Decoding:  86%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ | 1829/2122 [00:04<00:00, 414.41it/s]
Volume Decoding:  88%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š | 1871/2122 [00:04<00:00, 414.92it/s]
Volume Decoding:  90%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ | 1913/2122 [00:04<00:00, 415.30it/s]
Volume Decoding:  92%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–| 1955/2122 [00:04<00:00, 415.87it/s]
Volume Decoding:  94%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–| 1997/2122 [00:04<00:00, 414.88it/s]
Volume Decoding:  96%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Œ| 2040/2122 [00:04<00:00, 416.43it/s]
Volume Decoding:  98%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–Š| 2082/2122 [00:04<00:00, 415.61it/s]
Volume Decoding: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 2122/2122 [00:04<00:00, 425.66it/s]
PBR GLBๆ–‡ไปถๅทฒไฟๅญ˜: output/textured_mesh.glb
Version Details
Version ID
895e514f953d39e8b5bfb859df9313481ad3fa3a8631e5c54c7e5c9c85a6aa9f
Version Created
October 12, 2025
Run on Replicate โ†’