camenduru/ml-mgie 🔢📝🖼️ → ❓

▶️ 6.9K runs 📅 Feb 2024 ⚙️ Cog 0.9.4 🔗 GitHub 📄 Paper ⚖️ License

image-editing large-language-model multimodal text-guided-editing

Performance

12.3sTypical run time

~87sCold start (first call)

6.9KTotal runs

About

Guiding Instruction-based Image Editing via Multimodal Large Language Models

Example Output

Prompt:

"make the frame red"

Output

{"path":"https://replicate.delivery/pbxt/LkDA2ecYabWSdSnQjomeAewvXaPjB5jZMFXIsOZnffshAVpSC/image.png","text":"If the frame of the glasses in the image were made red, the overall appearance of the scene would change significantly.The red frame would draw more attention to the glass and create a stronger contrast with the black frame."}

Performance Metrics

12.28s Prediction Time

87.24s Total Time

All Input Parameters

{
  "seed": 13331,
  "prompt": "make the frame red",
  "text_cfg": 7.5,
  "image_cfg": 1.5,
  "input_image": "https://replicate.delivery/pbxt/KNSKXP6DiykiZn7bEsZoZiaxGmE5o90BSUbDr67KrbOZcAvc/_input_0.jpg"
}

Input Parameters

seed Type: integerDefault: 13331
prompt Type: stringDefault: make the frame red
text_cfg Type: numberDefault: 7.5
image_cfg Type: numberDefault: 1.5
input_image (required) Type: string: Input Image

Output Schema

path Type: stringFormat: uri: Path
text Type: string: Text

Example Execution Logs

/usr/local/lib/python3.10/site-packages/transformers/generation/utils.py:1211: UserWarning: You have modified the pretrained model configuration to control generation. This is a deprecated strategy to control generation and will be removed soon, in a future version. Please use a generation configuration file (see https://huggingface.co/docs/transformers/main_classes/text_generation)
warnings.warn(

Version Details

Version ID: cd6688b06dcdcf8b6c614abe400d37d40d85b9e07e438396582a1721686667b7
Version Created: February 10, 2024

Run on Replicate →