discover the link from wfs:getcapabilities to metadata-link-content #2

pvgenuchten · 2015-11-17T09:28:48Z

A wfs-getcapabilities document typically has a number of links to metadata content. If a crawler accesses a new getcapabilities document, via the WFS-proxy, it will discover metadata via those links, which may or may not already be available in a coupled catalogue (if available). A process should be able to grab that metadata and make it crawlable for the search engine (convert to schema.org).

Two approaches:

a CSW proxy may instantly convert the iso19139 document to schema.org and expose it to the crawler with a urn of http://{csw-proxy}?url={full-csw-request-as-get}
A process will verify if the CSW document is in the coupled catalogue, and if so expose that document to the crawler using a urn http://{catalogue}/dataset/{namespace}/{uuid}, if the catalogue doesn't contain the document, the process will trigger a harvest of that document into the catalogue (how will the catalogue later synchronise that resource)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

discover the link from wfs:getcapabilities to metadata-link-content #2

discover the link from wfs:getcapabilities to metadata-link-content #2

pvgenuchten commented Nov 17, 2015

discover the link from wfs:getcapabilities to metadata-link-content #2

discover the link from wfs:getcapabilities to metadata-link-content #2

Comments

pvgenuchten commented Nov 17, 2015