SAM-Zegformer

use SAM model to generate the masks of photos and then The process of selectively merging the regions corresponding to these masks is supervised by CLIP, which measures the score of the categories recognized by CLIP in the image area. The merging process refers to the selective search algorithm. Due to the effective recognition ability of CLIP for small targets, this project can ultimately generate a segmented image of the given image, including masks and corresponding categories, which can be automatically annotated for the target segmented dataset, To replace manual annotation.

On the far left is the original image, in the middle is the automatic annotation (category not marked) made for this project, and on the right is the result of manual annotation. The annotation method is different, so it looks different. The recognition results are very similar.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
CLIP		CLIP
segment_anything		segment_anything
README.md		README.md
auto_gen.py		auto_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAM-Zegformer

About

Releases

Packages

Languages

cjf-repo/SAM-Zegformer-

Folders and files

Latest commit

History

Repository files navigation

SAM-Zegformer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages