Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add JSON-LD from capitol.nb to Artsdata.ca #56

Open
tammy-culture opened this issue Aug 15, 2024 · 14 comments
Open

Add JSON-LD from capitol.nb to Artsdata.ca #56

tammy-culture opened this issue Aug 15, 2024 · 14 comments
Assignees

Comments

@tammy-culture
Copy link

tammy-culture commented Aug 15, 2024

Please grab JSON-LD from the following website (https://capitol.nb.ca/en/tickets-events) and add it to Artsdata.ca.

The reason WHY, is that out client ArtscultureNB would like to include these events in their Production CMS.

@saumier
Copy link
Member

saumier commented Aug 19, 2024

@tammy-culture I took a look and this website will be difficult at the moment because of the way the JSON-LD was added. There is nothing wrong in the way they added their JSON-LD, its only because it uses an approach that we don't support yet. I will assign to @dev-aravind so he can start working on it.

@dev-aravind Can you design a technique to crawl this site with a headless browser. The JSON-LD is not in the webpage but added by javascript in the browser. SO the JSON-LD only appears after the page is loaded and javascript on the page has executed. We can discuss. I also propose you use the Orion repo and make the choice of how to crawl into an option.

@saumier saumier assigned dev-aravind and unassigned saumier Aug 19, 2024
@saumier saumier transferred this issue from culturecreates/footlight-app Aug 20, 2024
@saumier saumier changed the title Add the JSON-LD from event website to Artsdata.ca Add JSON-LD from capitol.nb to Artsdata.ca Aug 20, 2024
@saumier
Copy link
Member

saumier commented Aug 23, 2024

@dev-aravind Please start with only first page of events.

@dev-aravind
Copy link
Contributor

@saumier The data is now available in artsdata user the artifact name capitolnb-ca

@dev-aravind dev-aravind assigned saumier and unassigned dev-aravind Aug 26, 2024
@dev-aravind dev-aravind moved this from In Progress to In Review in Artsdata Aug 26, 2024
@tammy-culture
Copy link
Author

tammy-culture commented Aug 26, 2024

@saumier Question: Is "capitolnb-ca" ready to be added as an aggregator (acapitolnb-ca) to the ArtscultureNB Calendar in Footlight CMS?

@saumier
Copy link
Member

saumier commented Aug 26, 2024

@tammy-culture No. Not yet. There are some little things to fix.

@dev-aravind Please work on the following:

  • include the derived from add_derived_from.sparql
  • convert the location to a full location for Capitol Theatre with PostalAddress (this will be tricky but all events are the same place http://kg.artsdata.ca/resource/K11-116 so the location could be passed somehow to the main code as a URI).
  • Fix the name of the artifact to be capitol-nb-ca

@saumier saumier assigned dev-aravind and unassigned saumier Aug 26, 2024
@saumier saumier moved this from In Review to Todo in Artsdata Aug 26, 2024
@dev-aravind dev-aravind moved this from Todo to In Progress in Artsdata Aug 27, 2024
@dev-aravind
Copy link
Contributor

@saumier These issues are now fixed in artsdata. You can find a sample event here.

@dev-aravind dev-aravind assigned saumier and unassigned dev-aravind Aug 27, 2024
@dev-aravind dev-aravind moved this from In Progress to In Review in Artsdata Aug 27, 2024
@tammy-culture
Copy link
Author

@saumier Thank-you, please let me know when I can put in a request to add a capitol-nb-ca aggregator.

@saumier
Copy link
Member

saumier commented Aug 27, 2024

Status:

  • Only first page of events is crawled (about 12) because of technical difficulty to change pages with our crawler. They have about 7 pages. Since we crawl each week we will get all events over time as they pass through the first page.
  • Only events with singe dates have JSON-LD. Their event series don’t yet have JSON-LD (example https://capitol.nb.ca/en/tickets-events/louis-jose-houde-3) so since there is no JSON-LD for event series we aren’t loading those.
  • All events are at the location (Capitol Theatre http://kg.artsdata.ca/resource/K11-116) even though some events are at Empress Theatre (example: https://capitol.nb.ca/en/tickets-events/lucy-macneil) which we also have as K11-166. Inside Capitol Theatre we have 4 places. To see all 4 you can visit K11-116 and scroll down to “containsPlace”.

@saumier
Copy link
Member

saumier commented Aug 27, 2024

@tammy-culture There is another thing with importing capitol.nb.ca. The auto-minting only works right now with CMS and Footlight because those events have URIs. The events on capitol.nb.ca do not have URIs (they are temporary and change with each load).

I have turned off the crawl schedule until I can work on auto-minting/linking with this type of website. I will mint the 11 events that are currently there so they can be loaded into CMS.

@saumier
Copy link
Member

saumier commented Aug 27, 2024

@tammy-culture Please request (create an issue) that http://kg.artsdata.ca/culture-creates/artsdata-orion/capitol-nb-ca gets loaded into CMS.

@saumier saumier closed this as completed Aug 27, 2024
@github-project-automation github-project-automation bot moved this from In Review to Done in Artsdata Aug 27, 2024
@saumier
Copy link
Member

saumier commented Sep 9, 2024

@dev-aravind I am reopening this issue because we need more events than only those on the home page. Can you explore different ways to get all their events. For example, clicking on a month in the calendar or a pagination approach.

I set the priority to high because Tammy's client ArtsCultureNB needs to launch this client by end of October.

@saumier saumier moved this from Done to Todo in Artsdata Sep 9, 2024
@saumier saumier removed their assignment Sep 9, 2024
@dev-aravind dev-aravind reopened this Sep 10, 2024
@github-project-automation github-project-automation bot moved this from In Progress to Todo in Artsdata Sep 10, 2024
@dev-aravind
Copy link
Contributor

@saumier All the events from this year are crawled and available in artsdata right now here.

@dev-aravind dev-aravind assigned saumier and unassigned dev-aravind Sep 10, 2024
@dev-aravind dev-aravind moved this from In Progress to In Review in Artsdata Sep 10, 2024
@tammy-culture
Copy link
Author

tammy-culture commented Sep 10, 2024

As per our stand up meeting this morning, we are able to grab approximately 3 to 4 months of upcoming events. This is progress from the the 10 events we were only able to grab previously. We now can grab approximately 30 upcoming events out of the approx. 72 total upcoming events. Tammy to mention to client.

@saumier saumier assigned tammy-culture and unassigned saumier Sep 10, 2024
@saumier
Copy link
Member

saumier commented Dec 18, 2024

@tammy-culture There are 62 events from capitol.nb.ca in Artsdata now. Can you confirm this is all the events on their website and not just part of the events?

BTW, I still have some more work to do so these events with temporary URIs can be minted.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

3 participants