关于images.csv的问题 #4

qimingfeijin · 2019-02-18T12:07:31Z

运行download_images.py报错，错误提示为No such file or directory: 'images.csv'，请问我该怎么解决

lars76 · 2019-02-19T19:37:48Z

Hi,

instead of download_images.py, just use the COCO dataset. It is much smaller and for OCR you actually don't need so many images. You can directly download 5K images here: http://images.cocodataset.org/zips/val2017.zip. Then you don't need download_images.py

Hope this helps.

qimingfeijin · 2019-02-20T02:04:17Z

感谢你的帮助与分享。我想做中文的文本检测，需要一些中文的图片训练和测试，请问你的中文数据集是在哪里下载的？

lars76 · 2019-02-20T22:54:24Z

I generated the dataset myself by using a subtitle file (srt) and then doing manual annotation. I don't think that there are any datasets that you can download.

Most papers actually generate their own training/test images by creating random text on images. Look at this github project https://github.com/JarveeLee/SynthText_Chinese_version and the corresponding paper is described here https://blog.csdn.net/u010167269/article/details/52389676. I tried something similar myself and it produced equal or better results than a real dataset.

qimingfeijin · 2019-02-21T01:38:05Z

我明白了，谢谢你的分享

wushilian · 2019-05-06T06:41:23Z

@lars76 can you share your method for synthesise dataset？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于images.csv的问题 #4

关于images.csv的问题 #4

qimingfeijin commented Feb 18, 2019

lars76 commented Feb 19, 2019

qimingfeijin commented Feb 20, 2019

lars76 commented Feb 20, 2019

qimingfeijin commented Feb 21, 2019

wushilian commented May 6, 2019

关于images.csv的问题 #4

关于images.csv的问题 #4

Comments

qimingfeijin commented Feb 18, 2019

lars76 commented Feb 19, 2019

qimingfeijin commented Feb 20, 2019

lars76 commented Feb 20, 2019

qimingfeijin commented Feb 21, 2019

wushilian commented May 6, 2019