Skip to content
@DAMO-NLP-SG

Language Technology Lab at Alibaba DAMO Academy

Pinned Loading

  1. DAMO-SeaLLMs Public

    [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

    JavaScript 166 16

  2. VideoLLaMA3 Public

    Frontier Multimodal Foundation Models for Image and Video Understanding

    Jupyter Notebook 747 50

  3. CoI-Agent Public

    Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

    Python 446 27

  4. Inf-CLIP Public

    [CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…

    Python 242 11

  5. multimodal_textbook Public

    The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

    Python 152 17

  6. VideoRefer Public

    [CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

    Python 188 10

Repositories

Showing 10 of 53 repositories
  • LLM-Multilingual-Knowledge-Boundaries Public

    Analyzing LLMs Multilingual Knowledge Boundaries through the lens of Internal Representations

    Python 4 MIT 0 1 0 Updated Apr 22, 2025
  • translation-all-you-need Public

    [NAACL 2025] Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Python 1 MIT 0 0 0 Updated Apr 21, 2025
  • SeaLLMs-Audio Public
    HTML 45 4 1 0 Updated Apr 19, 2025
  • VideoLLaMA3 Public

    Frontier Multimodal Foundation Models for Image and Video Understanding

    Jupyter Notebook 747 Apache-2.0 50 48 (2 issues need help) 2 Updated Apr 17, 2025
  • VideoRefer Public

    [CVPR 2025] The code for "VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM"

    Python 188 10 5 0 Updated Apr 1, 2025
  • multimodal_textbook Public

    The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

    Python 152 Apache-2.0 17 4 1 Updated Mar 17, 2025
  • MMR1 Public Forked from LengSicong/MMR1

    MMR1: Advancing the Frontiers of Multimodal Reasoning

    0 Apache-2.0 5 0 0 Updated Mar 12, 2025
  • FineReason Public

    FineReason: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

    Python 5 0 1 0 Updated Mar 3, 2025
  • LongPO Public

    [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization

    Python 33 4 0 0 Updated Feb 27, 2025
  • VideoLLaMA2 Public

    VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Python 1,147 Apache-2.0 77 80 (2 issues need help) 0 Updated Jan 23, 2025