Why can't use SAM encoder to get extracted feature? #5

ruizhaoz · 2023-08-10T19:39:33Z

Have you try directly use SAM encoder to extract feature instead use other pretrained model?

yangliu96 · 2023-08-11T07:30:54Z

The features extracted using SAM achieve only around 20 mIoU on fold 0 of COCO-20i. The SAM encoder with weak semantics performs poorly in complex scenes. Here are two reasons for this:

Poor feature matching: SAM's features fail to match multiple instances with similar semantics in complex scenes.
Poor semantic guidance: SAM cannot provide effective semantic guidance for ILM (Instance-Level Matching) to select high-quality mask proposals.

fjchange · 2024-02-20T10:12:22Z

Dinov2 has great ability in instance retrieval / dense matching. The backbone of SAM is pretrained via MAE, whose feature is not that discriminative.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why can't use SAM encoder to get extracted feature? #5

Why can't use SAM encoder to get extracted feature? #5

ruizhaoz commented Aug 10, 2023

yangliu96 commented Aug 11, 2023 •

edited

Loading

fjchange commented Feb 20, 2024

Why can't use SAM encoder to get extracted feature? #5

Why can't use SAM encoder to get extracted feature? #5

Comments

ruizhaoz commented Aug 10, 2023

yangliu96 commented Aug 11, 2023 • edited Loading

fjchange commented Feb 20, 2024

yangliu96 commented Aug 11, 2023 •

edited

Loading