`GeneralizedDiceScore` yields 0 scores when using `per_class=True` for samples where class is not present #2846

nkaenzig · 2024-11-28T09:24:49Z

🐛 Bug

The current implementation of GeneralizedDiceScore yields scores of 0.0 for samples that don't contain a particular class when calculating class-wise metrics via per_class=True.

This leads to very low dice scores, particularly for rare classes and therefore makes the dice scores between classes incomparable.

To Reproduce

The following code sample calculates class-wise scores of tensor([0.2500, 0.2500, 0.0000]), even though all the predictions match the targets:

Code sample

import torch
from torchmetrics.segmentation import GeneralizedDiceScore
from torchmetrics.segmentation import DiceScore

N_SAMPLES = 4
N_CLASSES = 3

target = torch.full((N_SAMPLES, N_CLASSES, 128, 128), 0, dtype=torch.int8)
preds = torch.full((N_SAMPLES, N_CLASSES, 128, 128), 0, dtype=torch.int8)

target[0, 0], preds[0, 0] = 1, 1 
target[2, 1], preds[2, 1]  = 1, 1

generalized_dice = GeneralizedDiceScore(num_classes=3, per_class=True, include_background=True)
print(generalized_dice(preds, target))

Expected behavior

I'd expect the above code sample to return [1.0, 1.0, nan] for the class-wise scores (nan for the third class, given that this class is not present in any of the samples, therefore returning a 1.0 score might also be misleading). Also, samples where the class doesn't occur should not contribute to the dice score of that class.

Environment

TorchMetrics version (if build from source, add commit SHA): 1.6.0
Python & PyTorch Version (e.g., 1.0): 3.11.10
Any other relevant information such as OS (e.g., Linux): macOS 15.1.1 (24B91)

Additional context

Very similar to issue #2850.

The text was updated successfully, but these errors were encountered:

github-actions · 2024-11-28T09:25:12Z

Hi! thanks for your contribution!, great first issue!

nkaenzig added bug / fix Something isn't working help wanted Extra attention is needed labels Nov 28, 2024

Borda assigned SkafteNicki Nov 28, 2024

nkaenzig mentioned this issue Nov 29, 2024

DiceScore yields 1.0 scores when using average="none" for samples where class is not present #2850

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`GeneralizedDiceScore` yields 0 scores when using `per_class=True` for samples where class is not present #2846

`GeneralizedDiceScore` yields 0 scores when using `per_class=True` for samples where class is not present #2846

nkaenzig commented Nov 28, 2024 •

edited

Loading

github-actions bot commented Nov 28, 2024

GeneralizedDiceScore yields 0 scores when using per_class=True for samples where class is not present #2846

GeneralizedDiceScore yields 0 scores when using per_class=True for samples where class is not present #2846

Comments

nkaenzig commented Nov 28, 2024 • edited Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

github-actions bot commented Nov 28, 2024

`GeneralizedDiceScore` yields 0 scores when using `per_class=True` for samples where class is not present #2846

`GeneralizedDiceScore` yields 0 scores when using `per_class=True` for samples where class is not present #2846

nkaenzig commented Nov 28, 2024 •

edited

Loading