🤖 Model 🖼️ → 📝

w95/tinyclick
Automate GUI actions from a screenshot and a natural-language command. Takes a GUI screenshot image and a text instructi...
Found 2 models (showing 1-2)
Automate GUI actions from a screenshot and a natural-language command. Takes a GUI screenshot image and a text instructi...
Segment objects in a video from natural-language instructions. Takes a video and a text prompt (referring expression) an...