Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some image links are not extracted #388

Open
NicolasSandoval opened this issue Apr 21, 2021 · 0 comments
Open

Some image links are not extracted #388

NicolasSandoval opened this issue Apr 21, 2021 · 0 comments

Comments

@NicolasSandoval
Copy link
Member

Description

Image links like maps with PNG format are left out.

2021-04-17 17:35:45,509  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:48,371  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:49,997  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:53,665  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,40.4308,-3.7031,266x250.png?lang=es'
2021-04-17 17:35:54,902  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,9,40.67,-73.94,275x250.png?lang=es'
2021-04-17 17:35:55,537  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-34.599722222222,-58.381944444444,300x300.png?lang=es&domain=es.wikipedia.org&title=Buenos+Aires&groups=_1f2405ce1888c1041823dcda962c8595b0609749'
2021-04-17 17:35:55,890  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,701  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:56,702  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:57,699  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:35:59,303  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,48.8565,2.345,266x266.png?lang=es'
2021-04-17 17:36:02,625  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,35.6879633,139.7690085,275x250.png?lang=es'
2021-04-17 17:36:05,072  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/ImageMap/resources/desc-20.png?15600
2021-04-17 17:36:06,057  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,41.388,2.168,266x250.png?lang=es'
2021-04-17 17:36:09,118  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,10,-33.44889,-70.669265,275x250.png?lang=es'
2021-04-17 17:36:11,183  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,12,16.753056,-93.115556,255x255.png?lang=es'
2021-04-17 17:36:14,544  images.extract       WARNING  Unsupported image type. Won't be included: 'https://maps.wikimedia.org/img/osm-intl,11,25.6715,-100.3085,200x200.png?lang=es'

and

2021-04-20 14:04:50,462  images.extract       WARNING  Unsupported image with GET args. Won't be included: extensions/wikihiero/img/hiero_G1.png?4d556

Possible Solution

Links that not start with maps are excluded by ImageParser.replace of the extract.py module, perhaps such links could be included.

Another point to consider is why some links do not pass if they have the ? symbol without having lang after.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant