-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add ocr vqa images #1458
base: main
Are you sure you want to change the base?
add ocr vqa images #1458
Conversation
You are a legend |
It's parquet, not jpg, can not use to train directly |
You misunderstand my pull request. Please check the changed readme file. The downloading link for the ocr_vqa images are https://huggingface.co/datasets/weizhiwang/llava_v15_instruction_images/resolve/main/ocr_vqa_images_llava_v15.zip?download=true. The mentioned link is the original release. |
Dude, You are a Legend |
hero! |
Life saver! |
Super hero! |
@Victorwz 您好,请问这个数据现在没法下载了吗? |
你点一下链接应该直接可以下载的,我刚点了一下没问题 |
Thanks a lot! |
Thanks buddy! |
Most of downloading urls of images in ocr_vqa dataset are no longer available. Everyone has to rerun the downloading script to get a small portion of ocr_vqa images in the LLaVA-v1.5-665k instruction dataset. I zip all images from original release into a zip file. Everyone can easily download it and unzip to their path of
./ocr_vqa/images