Alternative evaluation matrices beyond F1 score and exact match #2

wbcbugfree · 2024-06-06T14:04:41Z

Find and/or develop other possible matrices to evaluate different strategies for converting text to RDF statements. Current matrices like F1 score and exact match are not able to match RDF triples semantically. They treat a RDF triple as a string (exact match) or three strings (general F1 score). Consequently, a string from RDF triples can be recognized as correct only if it is exactly the same as the ground truth; otherwise, it is wrong. However, for example, for the concept "Soil Health", some times it was defined as "ex:SoilHealth" for URI, other times as "ex:HealthySoils". Semantically, they are not that different. But for the current metrics, only one of these two definitions is likely to be correct, and any other definitions score no points. This potentially underestimates the performance of zero-shot learning because it is much less likely to consistently define the URI of the concept.

Possible solutions:

RDF2vec
Convert RDF statements back to plain text, embed them and compute similarity

wbcbugfree · 2024-12-18T12:35:44Z

The metrics we currently have are:

Vanilla precision, recall and F1 score based on triple-level exact match;
Graph BERTScore;
Bleu-F1 & ROUGE-F1.

To-do:

Graph Edit Distance;
Optimal Edit Paths.

What we won't do anymore:

Matching S, P and O separately.

wbcbugfree self-assigned this Jun 6, 2024

wbcbugfree changed the title ~~Alternative evaluation matrices beyond F1 score and Exact match~~ Alternative evaluation matrices beyond F1 score and exact match Jun 6, 2024

wbcbugfree closed this as completed Dec 18, 2024

wbcbugfree reopened this Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alternative evaluation matrices beyond F1 score and exact match #2

Alternative evaluation matrices beyond F1 score and exact match #2

wbcbugfree commented Jun 6, 2024 •

edited

Loading

wbcbugfree commented Dec 18, 2024

Alternative evaluation matrices beyond F1 score and exact match #2

Alternative evaluation matrices beyond F1 score and exact match #2

Comments

wbcbugfree commented Jun 6, 2024 • edited Loading

wbcbugfree commented Dec 18, 2024

wbcbugfree commented Jun 6, 2024 •

edited

Loading