Change the repository type filter
All
Repositories list
3.8k repositories
MMFuser
PublicThe official implementation of the paper "MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding". MMFuser addresses the limitations of current MLLMs in capturing complex image details by simply yet efficiently integrating multi-layer features from ViTs.jetson-examples
PublicAdver-City
Publict2v-turbo
PublicOccRWKV
PublicVLAD-BuFF
PublicPDF-Embedding
PublicPlugNPlay-Modules
PublicSpaceJAM
PublicSimMAT
PublicXNetv2
PublicVP-LLR
PublicSCNet-IJCV24
Public3D-Speaker
PublicCOCO-UniHuman
PublicAPGCC
PublicABAFnet
Publicmgc
PublicRWKV-CLIP
PublicVisualRWKV
PublicStreamSpeech
Publicin-context-matting
PublicBasicPBC
PublicShadow_R
PublicE2STR
PublicMPCount
PublicPIIP
Publicconv-llava
Public