PatchInferer with AvgMerger and filter_fn leads to NaNs #7898

nicholas-greig · 2024-05-06T06:01:00Z

nicholas-greig
May 6, 2024

Describe the bug
On master currently, when using the PatchInferer class with an AvgMerger (the default Merger class), and a filter_fn, the counts will be zero everywhere the filter_fn filters a region. Then, when the AvgMerger.finalize() is called, the self.values attr of AvgMerger is in-place divided by the self.counts tensor. This is an issue, since the self.counts tensor is initialised to zero, and div by zero causes NaNs. So, everywhere that a filter_fn successfully filters a region, we get NaN outputs.

A quick inplace assignment to counts (to set counts to 1, for example), will set all of these values to zero after this inplace division, but if the output is supposed to be real valued/continuous, it might be better to inplace overwrite these values to be the smallest value possible (using torch.finfo(self.values.dtype).min or something similar). Monkey patching the outputs from an Inferer isn't the best situation, since a network can produce NaNs due to weights exploding or overflow during training, and masking this with by overwriting NaNs to zero would merely obfuscate that problem.

KumoLiu · 2024-05-06T06:39:18Z

KumoLiu
May 6, 2024
Maintainer

Hi @nicholas-greig, could you please share a small piece of code that I can reproduce the issue?

Thanks.

0 replies

nicholas-greig · 2024-05-07T00:14:41Z

nicholas-greig
May 7, 2024
Author

from monai.inferers.splitter import SlidingWindowSplitter
from monai.inferers.inferer import PatchInferer
import torch 
H,W = 512,512
def filter_fn(x,location):
    if location[1]>H//2:
        return False
    return True
    
splitter = SlidingWindowSplitter(
    (128,128),
    overlap=0,
    offset=0,filter_fn=filter_fn
)

inferer = PatchInferer(
    splitter,
)
inputs = torch.randn((1,1,H,W))
outputs = inferer(inputs=inputs,
                  network = lambda x: x)

print(torch.sum(torch.isnan(outputs[0])))
import matplotlib.pyplot as plt
plt.imshow(torch.isnan(outputs[0]).squeeze())
plt.show()

0 replies

nicholas-greig · 2024-07-01T04:24:07Z

nicholas-greig
Jul 1, 2024
Author

@KumoLiu bump

0 replies

KumoLiu · 2024-07-02T08:25:16Z

KumoLiu
Jul 2, 2024
Maintainer

Hi @nicholas-greig, sorry for the later response. After taking a look at your code, I guess the problem is that you set the filter_fn in the Splitter which is used to filter patches. If you set it to None, then it will works as your expected.

MONAI/monai/inferers/splitter.py

Line 105 in 15d0771

    
                   filter_fn: a callable to filter patches. It should accepts exactly two parameters (patch, location), and

Hope it helps, thanks.

2 replies

KumoLiu Jul 2, 2024
Maintainer

Sorry, I just get the points. Let me take a look again, looks it shouldn't assign the filter patch to nan.

KumoLiu Jul 2, 2024
Maintainer

@drbeh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PatchInferer with AvgMerger and filter_fn leads to NaNs #7898

{{title}}

Replies: 4 comments 2 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

PatchInferer with AvgMerger and filter_fn leads to NaNs #7898

nicholas-greig May 6, 2024

Replies: 4 comments · 2 replies

KumoLiu May 6, 2024 Maintainer

nicholas-greig May 7, 2024 Author

nicholas-greig Jul 1, 2024 Author

KumoLiu Jul 2, 2024 Maintainer

KumoLiu Jul 2, 2024 Maintainer

KumoLiu Jul 2, 2024 Maintainer

nicholas-greig
May 6, 2024

Replies: 4 comments 2 replies

KumoLiu
May 6, 2024
Maintainer

nicholas-greig
May 7, 2024
Author

nicholas-greig
Jul 1, 2024
Author

KumoLiu
Jul 2, 2024
Maintainer

KumoLiu Jul 2, 2024
Maintainer

KumoLiu Jul 2, 2024
Maintainer