vivalapanda/conceptual-image-to-image 🔢📝🖼️❓ → 🖼️

▶️ 3.0K runs 📅 Nov 2022 ⚙️ Cog 0.4.4 🔗 GitHub
image-to-image stable-diffusion-2

About

Conceptual image-to-image model for Stable Diffusion 2.0

Example Output

Prompt:

"A psychedelic being living in an extradimensional reality, in the style of wlop, illustration, epic, fantasy, hyper detailed, smooth, unreal engine, sharp focus, ray tracing, physically based rendering, renderman, beautiful"

Output

Example outputExample outputExample outputExample output

Performance Metrics

23.09s Prediction Time
23.14s Total Time
All Input Parameters
{
  "prompt": "A psychedelic being living in an extradimensional reality, in the style of wlop, illustration, epic, fantasy, hyper detailed, smooth, unreal engine, sharp focus, ray tracing, physically based rendering, renderman, beautiful",
  "init_image": "https://replicate.delivery/pbxt/HrIfTF8vV2r9cCYnpU4bqlZQcFaW6USaRJI5tFyVBh87XQfy/Screen%20Shot%202022-11-26%20at%208.51.29%20PM.png",
  "captioning_model": "blip",
  "conceptual_image_strength": 0.44,
  "structural_image_strength": 0.09
}
Input Parameters
seed Type: integer
Random seed. Leave blank to randomize the seed
prompt Type: stringDefault:
Input prompt
init_image Type: string
Inital image to provide structural or conceptual guidance
captioning_model Default: blip
Captioning model to use. One of 'blip' or 'clip-interrogator-v1'
conceptual_image_strength Type: numberDefault: 0.4
Conceptual image strength. 0.0 doesn't use the image conceptually at all, 1.0 only uses the image concept and ignores the prompt.
structural_image_strength Type: numberDefault: 0.15
Structural (standard) image strength. 0.0 corresponds to full destruction of information, and does not use the initial image for structure.
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seed: 25141
Interrogating with ViT-L/14...
a picture of a cat in a space suit, a stock photo by Master MS, featured on reddit, space art, futuristic, sci-fi, stock photo
Captioning using blip
Image prompt: a picture of a cat in a space suit
0it [00:00, ?it/s]
2it [00:00,  8.24it/s]
3it [00:00,  7.06it/s]
4it [00:00,  6.57it/s]
5it [00:00,  6.32it/s]
6it [00:00,  6.16it/s]
7it [00:01,  6.08it/s]
8it [00:01,  6.00it/s]
9it [00:01,  5.98it/s]
10it [00:01,  5.90it/s]
11it [00:01,  5.94it/s]
12it [00:01,  5.93it/s]
13it [00:02,  5.91it/s]
14it [00:02,  5.89it/s]
15it [00:02,  5.90it/s]
16it [00:02,  5.90it/s]
17it [00:02,  5.89it/s]
18it [00:02,  5.89it/s]
19it [00:03,  5.89it/s]
20it [00:03,  5.88it/s]
21it [00:03,  5.89it/s]
22it [00:03,  5.83it/s]
23it [00:03,  5.87it/s]
24it [00:03,  5.88it/s]
25it [00:04,  5.88it/s]
26it [00:04,  5.88it/s]
27it [00:04,  5.88it/s]
28it [00:04,  5.88it/s]
29it [00:04,  5.88it/s]
30it [00:05,  5.88it/s]
31it [00:05,  5.89it/s]
32it [00:05,  5.89it/s]
33it [00:05,  5.88it/s]
34it [00:05,  5.88it/s]
35it [00:05,  5.88it/s]
36it [00:06,  5.88it/s]
37it [00:06,  5.88it/s]
38it [00:06,  5.88it/s]
39it [00:06,  5.88it/s]
40it [00:06,  5.88it/s]
41it [00:06,  5.88it/s]
42it [00:07,  5.88it/s]
43it [00:07,  5.88it/s]
44it [00:07,  5.88it/s]
45it [00:07,  5.88it/s]
46it [00:07,  5.89it/s]
47it [00:07,  5.89it/s]
48it [00:08,  5.88it/s]
49it [00:08,  5.89it/s]
50it [00:08,  5.89it/s]
51it [00:08,  5.89it/s]
52it [00:08,  5.89it/s]
53it [00:08,  5.87it/s]
54it [00:09,  5.89it/s]
54it [00:09,  5.94it/s]
Version Details
Version ID
f4563afd20429700e91f9717d9f4104b80987be25a2ad7304cf0f196804f3bef
Version Created
November 27, 2022
Run on Replicate →