Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GTQ scrape HTML descriptions #21

Open
saumier opened this issue Dec 14, 2023 · 0 comments
Open

GTQ scrape HTML descriptions #21

saumier opened this issue Dec 14, 2023 · 0 comments

Comments

@saumier
Copy link
Member

saumier commented Dec 14, 2023

The descriptions in the JSON-LD on GTQ do not have layout, and sometimes squish text from different paragraphs without spaces at the end of each paragraph.

Instead of using the JSON-LD, the HTML from the event description should be scraped.

See discussion #19

Le problème est que les changements de ligne sont perdus dans la description. Par exemple, le passage ...sa génération.À propos de Chansons hivernalesChansons hivernales... aurait besoin de changements de ligne.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

1 participant