w95/tinyclick 📝🖼️ → ❓

▶️ 28 runs 📅 Apr 2025 ⚙️ Cog 0.14.6 🔗 GitHub ⚖️ License
gui-automation image-object-detection image-to-action image-to-text visual-grounding

About

TinyClick: Single-Turn Agent for Empowering GUI Automation

Example Output

Output

{"action":"click","click_point":[133,538],"execution_time_seconds":0.72}

Performance Metrics

0.79s Prediction Time
20.64s Total Time
All Input Parameters
{
  "text": "click on accept and continue button",
  "image": "https://huggingface.co/Samsung/TinyClick/resolve/main/sample.png"
}
Input Parameters
text (required) Type: string
Command to perform on the GUI screenshot
image (required) Type: string
GUI screenshot image
Output Schema

Output

Type: object

Example Execution Logs
/root/.pyenv/versions/3.11.10/lib/python3.11/site-packages/transformers/generation/utils.py:1220: UserWarning: Using the model-agnostic default `max_length` (=20) to control the generation length. We recommend setting `max_new_tokens` to control the maximum length of the generation.
warnings.warn(
Version Details
Version ID
727421b20ac9cddc5ba2b591115edb56e27b5313b988d1e9c560cf47a9d778c4
Version Created
April 21, 2025
Run on Replicate →