Skip to content
@dvlab-research

DV Lab

Deep Vision Lab

Popular repositories Loading

  1. MGM MGM Public

    Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

    Python 3.2k 281

  2. LongLoRA LongLoRA Public

    Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

    Python 2.6k 281

  3. LISA LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 2k 137

  4. ControlNeXt ControlNeXt Public

    Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

    Python 1.5k 77

  5. LLaMA-VID LLaMA-VID Public

    LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

    Python 774 45

  6. VoxelNeXt VoxelNeXt Public

    VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)

    Python 761 68

Repositories

Showing 10 of 73 repositories
  • VisionZip Public

    Official repository for VisionZip (CVPR 2025)

    Python 246 Apache-2.0 11 11 0 Updated Feb 27, 2025
  • LISA Public

    Project Page for "LISA: Reasoning Segmentation via Large Language Model"

    Python 2,048 Apache-2.0 137 89 0 Updated Feb 16, 2025
  • Step-DPO Public

    Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

    Python 352 13 18 0 Updated Jan 19, 2025
  • MagicMirror Public

    Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

    108 3 6 0 Updated Jan 13, 2025
  • Lyra Public

    Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

    Python 279 Apache-2.0 43 2 1 Updated Jan 9, 2025
  • LBGAT Public

    Learnable Boundary Guided Adversarial Training (ICCV2021)

    Python 36 MIT 2 3 0 Updated Dec 9, 2024
  • Mr-Ben Public

    This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"

    Python 46 MIT 1 1 0 Updated Oct 31, 2024
  • Parametric-Contrastive-Learning Public

    Parametric Contrastive Learning (ICCV2021) & GPaCo (TPAMI 2023)

    Python 247 MIT 32 7 0 Updated Sep 26, 2024
  • ControlNeXt Public

    Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

    Python 1,525 Apache-2.0 77 47 1 Updated Sep 25, 2024
  • TagCLIP Public
    Python 8 1 1 0 Updated Sep 3, 2024