Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

discover the link from wfs:getcapabilities to metadata-link-content #2

Open
pvgenuchten opened this issue Nov 17, 2015 · 0 comments
Open

Comments

@pvgenuchten
Copy link

A wfs-getcapabilities document typically has a number of links to metadata content. If a crawler accesses a new getcapabilities document, via the WFS-proxy, it will discover metadata via those links, which may or may not already be available in a coupled catalogue (if available). A process should be able to grab that metadata and make it crawlable for the search engine (convert to schema.org).

Two approaches:

  • a CSW proxy may instantly convert the iso19139 document to schema.org and expose it to the crawler with a urn of http://{csw-proxy}?url={full-csw-request-as-get}
  • A process will verify if the CSW document is in the coupled catalogue, and if so expose that document to the crawler using a urn http://{catalogue}/dataset/{namespace}/{uuid}, if the catalogue doesn't contain the document, the process will trigger a harvest of that document into the catalogue (how will the catalogue later synchronise that resource)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant