output multiple coordinate points #961

sms-s · 2025-03-17T13:01:49Z

How should I prompt qwen-2.5-VL to output multiple coordinate points?
With this prompt: prompt = "point to the rolling pin on the far side of the table, output its coordinates in XML format object", it can only output a single coordinate point.

xin-li-67 · 2025-03-19T07:06:50Z

Try using the sample codes in the spatial understanding notebook in the cookbooks folder.

sms-s · 2025-03-19T14:11:09Z

I'm following the format of the cookbooks， but it only outputs a single point. I want multiple coordinate points to point to the object.Do you have any other suggestions?

Try using the sample codes in the spatial understanding notebook in the cookbooks folder.

xin-li-67 · 2025-03-20T01:43:29Z

I'm following the format of the cookbooks， but it only outputs a single point. I want multiple coordinate points to point to the object.Do you have any other suggestions?

Try using the sample codes in the spatial understanding notebook in the cookbooks folder.

Well, I've never tried the points example. For bbox coordinates scenes, I can get multiple returns.

HumanZhong · 2025-03-21T08:13:51Z

@sms-s
Hi, you may try this prompt:

If the model still outputs only one point, you can also try:

add "every/one by one" such kind of words into your prompts.
try using JSON format

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

output multiple coordinate points #961

output multiple coordinate points #961

sms-s commented Mar 17, 2025

xin-li-67 commented Mar 19, 2025

sms-s commented Mar 19, 2025

xin-li-67 commented Mar 20, 2025

HumanZhong commented Mar 21, 2025

output multiple coordinate points #961

output multiple coordinate points #961

Comments

sms-s commented Mar 17, 2025

xin-li-67 commented Mar 19, 2025

sms-s commented Mar 19, 2025

xin-li-67 commented Mar 20, 2025

HumanZhong commented Mar 21, 2025