Improve performance of importing labels #92

saumier · 2023-10-03T17:44:00Z

There are 2 SPARQLs that load missing labels for objects, qualifiers and references. Currently the labels are all dumped into the default graph. The SPARQLs are very slow. The "load labels" launches seperate SPARQLs for objects and for qualifiers/references. They took 20 minutes for 3 productions + tertiary nodes.

I propose saving labels in a named graph that is shared, and optimizing the SPARQL by removing some filters and adding more performant filters to minimize the calls to Wikidata.

For example improving the removal of entities with existing labels in the default graph (or other named graph) before doing a federated SPARQL to Wikidata:

  filter not exists {
          graph <http://www.ontotext.com/explicit> {
              ?o rdfs:label ?b .
          }
      }

The text was updated successfully, but these errors were encountered:

saumier added this to Culture In-Time Oct 3, 2023

saumier converted this from a draft issue Oct 3, 2023

saumier removed the status in Culture In-Time Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of importing labels #92

Improve performance of importing labels #92

saumier commented Oct 3, 2023 •

edited

Loading

Improve performance of importing labels #92

Improve performance of importing labels #92

Comments

saumier commented Oct 3, 2023 • edited Loading

saumier commented Oct 3, 2023 •

edited

Loading