Why built this project

I created a project with the purpose of using Grounded-DINO, SAM (Segment Anything Model), and tracking algorithms to achieve text-prompt-based object recognition and continuous tracking in videos. This combination allows for precise and efficient identification and tracking of objects within video content based on textual descriptions.

Objectives:

1.	Text-Prompt Based Object Recognition:
•	Utilize Grounded-DINO to interpret and understand textual prompts for object identification within video frames.
2.	Segmentation and Analysis:
•	Implement SAM (Segment Anything Model) to accurately segment and analyze objects in video frames based on the prompts provided.
3.	Continuous Object Tracking:
•	Apply sam2 tracking algorithms to maintain and follow the identified objects throughout the video, ensuring consistent and reliable tracking over time.

Benefits:

•	Efficiency: Streamline the process of object recognition and tracking by leveraging state-of-the-art models.
•	Accuracy: Enhance the precision of object identification and tracking through advanced segmentation techniques.
•	Automation: Enable automated monitoring and analysis of video content based on textual descriptions, reducing the need for manual intervention.

This project aims to integrate cutting-edge technologies in computer vision and natural language processing to create a robust system for video content analysis and tracking.

prepare

prepare the images data as the follow rules: /your_own_path/raw_data

where raw_data is fixed

Step 1 : Open the env

You could use the dockerfile, and create environment as Grounded-SAM


cd /home/appuser/Grounded-Segment-Anything

Step 2: Run Code

Then command example:

python grounded_sam_with_sam_tracking.py -i /your_path_to_data/raw_data -o /your_path_to_data/ --box_threshold 0.23

If you wan to see the result of pretraining, please run:

python draw_raw_image_and_box.py

#Result

💘 Acknowledgements

Grounded-SAM Segment-anything-2

Name		Name	Last commit message	Last commit date
Latest commit History 310 Commits
.vscode		.vscode
EfficientSAM		EfficientSAM
GroundingDINO		GroundingDINO
VISAM @ d7c3823		VISAM @ d7c3823
assets		assets
grounded-sam-osx @ 6688b03		grounded-sam-osx @ 6688b03
model		model
playground		playground
segment_anything		segment_anything
segment_anything_2		segment_anything_2
utils		utils
voxelnext_3d_box		voxelnext_3d_box
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_ORIGIN.md		README_ORIGIN.md
automatic_label_demo.py		automatic_label_demo.py
automatic_label_ram_demo.py		automatic_label_ram_demo.py
automatic_label_simple_demo.py		automatic_label_simple_demo.py
automatic_label_tag2text_demo.py		automatic_label_tag2text_demo.py
chatbot.py		chatbot.py
clean_boundary_class.py		clean_boundary_class.py
cog.yaml		cog.yaml
draw_raw_image_and_box.py		draw_raw_image_and_box.py
gradio_app.py		gradio_app.py
grounded_sam.ipynb		grounded_sam.ipynb
grounded_sam_3d_box.ipynb		grounded_sam_3d_box.ipynb
grounded_sam_colab_demo.ipynb		grounded_sam_colab_demo.ipynb
grounded_sam_demo.py		grounded_sam_demo.py
grounded_sam_demo_batch.py		grounded_sam_demo_batch.py
grounded_sam_demo_single_sensor.py		grounded_sam_demo_single_sensor.py
grounded_sam_inpainting_demo.py		grounded_sam_inpainting_demo.py
grounded_sam_osx_demo.py		grounded_sam_osx_demo.py
grounded_sam_simple_demo.py		grounded_sam_simple_demo.py
grounded_sam_visam.py		grounded_sam_visam.py
grounded_sam_whisper_demo.py		grounded_sam_whisper_demo.py
grounded_sam_whisper_inpainting_demo.py		grounded_sam_whisper_inpainting_demo.py
grounded_sam_with_sam_tracking.py		grounded_sam_with_sam_tracking.py
grounding_dino_demo.py		grounding_dino_demo.py
predict.py		predict.py
read_mask.py		read_mask.py
requirements.txt		requirements.txt
run_long_trip_to_tracking.py		run_long_trip_to_tracking.py
transfer_bags_to_image.py		transfer_bags_to_image.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why built this project

prepare

Step 1 : Open the env

Step 2: Run Code

💘 Acknowledgements

About

Releases

Packages

Languages

License

ShuoShenDe/Grounded-Sam2-Tracking

Folders and files

Latest commit

History

Repository files navigation

Why built this project

prepare

Step 1 : Open the env

Step 2: Run Code

💘 Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages