Skip to content

Latest commit

 

History

History
41 lines (27 loc) · 1.98 KB

README.md

File metadata and controls

41 lines (27 loc) · 1.98 KB

Grand Théâtre de Québec Website data pipelines

https://grandtheatre.qc.ca

Mapping to Artsdata event types

This repo contains a manually created taxonomy gtq-event-type-mapping.ttl to map strings from the original grandtheatre.qc.ca website to Artsdata event types. To push the taxonomy to Artsdata you can manually run the Github workflow in this repo.

Check for missing event type mapping using this all keywords or event type mapping. If there is no closeMatch or relatedMatch then one should be added. To look for Artsdasta Event Types you can use this Artsdata event types query. Edit the mappings in gtq-event-type-mapping.ttl and pushed to Artsdata using the automatic workflow in this repo.

Input to Artsdata

This website is crawled by an agent on the Artsdata Huginn platform.

Here is the summary of the GTQ pipeline in Huginn:

  1. Crawl events listed on https://grandtheatre.qc.ca/programmation/
  2. Extract JSON-LD from each webpage
  3. Transform with the following SPARQLs 'remove-blanks', 'fix-schemaorg-https-objects', 'fix-wikidata-uri', 'add-artsdata-uri-using-wikidata-bridge', 'fix-schemaorg-date-datatype', 'create-eventseries', 'copy-subevent-data-to-eventseries', 'fix-isni', 'add-artsdata-uri-using-isni-bridge', 'collapse_duplicate_contact_point_blanknodes'
  4. Load graph into Artsdata using Artsdata Databus API

Compare event images between original website and Artsdata

https://api.artsdata.ca/events?source=http://kg.artsdata.ca/culture-creates/huginn/derived-grandtheatre-qc-ca

https://grandtheatre.qc.ca/programmation/

Output from Artsdata

Nothing is outputed from Artsdata for use by GTQ at this time.