I am currently a master student of Information Processing Lab at University of Washington. I am currently working on video understanding and generation, as well as embodied agent. Have a look at my homepage for more details.
When I am not doing research, I like photography, traveling, and singing.
Updates:
- 12/2024: Two papers accepted to AAAI 2025.
- 10/2024: I start to write blogs. Check it out here.
- 07/2024: Two papers accepted to ACM MM 2024.
- 07/2024: Two papers accepted to ECCV 2024.
- 06/2024: One technique report accepted to CVPR 2024 workshop @ NTIRE.
- 06/2024 MovieChat is selected as a highlight paper (rank 68) of CVPR 2024 in Paper Digest.
- 06/2024: We are working with Pika Lab to develop next-generation video understanding and generation models.
- 05/2024: One paper accepted to CVPR 2024 workshop @ Embodied AI.
- 04/2024: We are hosting CVPR 2024 Long-form Video Understanding Challenge @ LOVEU.
- 04/2024: Invited talk at AgentX seminar about our STEVE series works.
- 03/2024: One paper accepted to ICLR 2024 workshop at LLM Agents.
- 02/2024: Two papers accepted to CVPR 2024.
- 02/2024: Invited talk at AAAI 2024 workshop at IMAGEOMICS.
- 12/2023: One paper accepted to ICASSP 2024.
- 12/2023: One paper accepted to AAAI 2024.
- 11/2023: Two papers accepted to WACV 2024 and its workshop at CV4Smalls.
- 09/2023: One paper accepted to ICCV 2023 workshop at TNGCV-DataComp.
- 09/2023: One paper accepted to IEEE T-MM.
- 08/2023: One paper accepted to BMVC 2023.
- 07/2023: Two papers accepted to ACM MM 2023.
- 07/2023: Finished my research internship at Microsoft Research Asia (MSRA), Beijing.
- 07/2023: Two papers accepted to ICCV 2023.