pku-yuangroup/llava-cot 🖼️🔢📝 → 📝
About
Let Vision Language Models Reason Step-by-Step

Example Output
"how to make this pastry"
Output
Begin by preparing the nuts. You can use either walnuts, pistachios, or a combination of both. Roast the nuts in a 350°F oven for about 10 minutes until they are fragrant and slightly toasted. Let them cool down.
In a large bowl, combine the cooled nuts with the sugar and cinnamon. Mix well until the nuts are evenly coated with the sugar and cinnamon mixture.
To assemble the baklava, preheat your oven to 350°F. Take a rectangular tray or baking dish and line the bottom with a sheet of phyllo dough. Brush the phyllo with melted butter, then repeat this process until you have about 8-10 layers of phyllo dough.
Once you've assembled the phyllo dough layers, spread the nut mixture evenly over the top layer of phyllo.
Carefully roll the baklava tightly from one end of the tray to the other, applying gentle pressure to ensure it's compact. Use a knife or sharp object to cut the baklava into diamond shapes.
Place the baklava in the preheated oven and bake for about 45 minutes, or until the phyllo is golden brown and crisp.
Remove the baklava from the oven and let it cool for a few minutes. While it's still warm, drizzle the honey syrup evenly over the top of the baklava.
Let the baklava cool completely before serving. This will allow the flavors to meld together and the pastry to absorb the honey syrup.
Enjoy your delicious homemade baklava!
Performance Metrics
All Input Parameters
{ "image": "https://replicate.delivery/pbxt/M4VFa6E18it1vazUahiTB5RjNjDoajLbHcpMgMFBhJvmgGdh/Baklava%281%29.png", "top_p": 0.95, "prompt": "how to make this pastry", "temperature": 0.9, "max_new_tokens": 1024 }
Input Parameters
- image (required)
- Grayscale input image
- top_p
- When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens, used when temperature > 0
- prompt
- Text prompt
- temperature
- Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
- max_new_tokens
- Max number of generated tokens
Output Schema
Output
Version Details
- Version ID
40c17578a6b98679fc08e1a92fcef1b8000f857bf300a381091a13a67c28754d
- Version Created
- December 2, 2024