vivalapanda/conceptual-image-to-image-1.5 🔢📝🖼️❓ → 🖼️

▶️ 1.1K runs 📅 Nov 2022 ⚙️ Cog 0.4.4 🔗 GitHub
image-style-transfer image-to-image stable-diffusion-1-5

About

Conceptual image-to-image model for Stable Diffusion 1.5

Example Output

Prompt:

"A psychedelic being living in an extradimensional reality, in the style of wlop, illustration, epic, fantasy, hyper detailed, smooth, unreal engine, sharp focus, ray tracing, physically based rendering, renderman, beautiful"

Output

Example outputExample outputExample outputExample output

Performance Metrics

21.83s Prediction Time
21.87s Total Time
All Input Parameters
{
  "prompt": "A psychedelic being living in an extradimensional reality, in the style of wlop, illustration, epic, fantasy, hyper detailed, smooth, unreal engine, sharp focus, ray tracing, physically based rendering, renderman, beautiful",
  "init_image": "https://replicate.delivery/pbxt/HrK08RvhHUeIwNLivLe6qUZUdH0UkuMWrJzJcNJnsLrgHcjA/qit.png",
  "captioning_model": "blip",
  "conceptual_image_strength": 0.47,
  "structural_image_strength": 0.16
}
Input Parameters
seed Type: integer
Random seed. Leave blank to randomize the seed
prompt Type: stringDefault:
Input prompt
init_image Type: string
Inital image to provide structural or conceptual guidance
captioning_model Default: blip
Captioning model to use. One of 'blip' or 'clip-interrogator-v1'
conceptual_image_strength Type: numberDefault: 0.4
Conceptual image strength. 0.0 doesn't use the image conceptually at all, 1.0 only uses the image concept and ignores the prompt.
structural_image_strength Type: numberDefault: 0.15
Structural (standard) image strength. 0.0 corresponds to full destruction of information, and does not use the initial image for structure.
Output Schema

Output

Type: arrayItems Type: stringItems Format: uri

Example Execution Logs
Using seed: 48525
Interrogating with ViT-L/14...
a picture of a cat in a space suit, a stock photo by Master MS, featured on reddit, space art, futuristic, sci-fi, stock photo
Captioning using blip
Image prompt: a picture of a cat in a space suit
0it [00:00, ?it/s]
2it [00:00,  8.19it/s]
3it [00:00,  7.03it/s]
4it [00:00,  6.57it/s]
5it [00:00,  6.32it/s]
6it [00:00,  6.17it/s]
7it [00:01,  6.07it/s]
8it [00:01,  6.02it/s]
9it [00:01,  5.97it/s]
10it [00:01,  5.95it/s]
11it [00:01,  5.94it/s]
12it [00:01,  5.93it/s]
13it [00:02,  5.92it/s]
14it [00:02,  5.91it/s]
15it [00:02,  5.90it/s]
16it [00:02,  5.89it/s]
17it [00:02,  5.89it/s]
18it [00:02,  5.87it/s]
19it [00:03,  5.90it/s]
20it [00:03,  5.90it/s]
21it [00:03,  5.90it/s]
22it [00:03,  5.90it/s]
23it [00:03,  5.88it/s]
24it [00:03,  5.89it/s]
25it [00:04,  5.89it/s]
26it [00:04,  5.89it/s]
27it [00:04,  5.89it/s]
28it [00:04,  5.90it/s]
29it [00:04,  5.89it/s]
30it [00:04,  5.89it/s]
31it [00:05,  5.89it/s]
32it [00:05,  5.88it/s]
33it [00:05,  5.89it/s]
34it [00:05,  5.89it/s]
35it [00:05,  5.89it/s]
36it [00:06,  5.89it/s]
37it [00:06,  5.87it/s]
38it [00:06,  5.89it/s]
39it [00:06,  5.89it/s]
40it [00:06,  5.89it/s]
41it [00:06,  5.89it/s]
42it [00:07,  5.90it/s]
43it [00:07,  5.87it/s]
44it [00:07,  5.90it/s]
45it [00:07,  5.90it/s]
46it [00:07,  5.90it/s]
47it [00:07,  5.90it/s]
48it [00:08,  5.90it/s]
49it [00:08,  5.88it/s]
50it [00:08,  5.89it/s]
51it [00:08,  5.89it/s]
51it [00:08,  5.96it/s]
Version Details
Version ID
738154b934ddc51f3828f9ef34b500e40f4122018e669d95d25a2b26574fd206
Version Created
November 27, 2022
Run on Replicate →