- 👩 I’m Sheng, a PhD student from China, currently studying as a visiting student at the National University of Singapore.
- 🧐 My focus is multimedia learning, especially VQA, and I’m currently exploring multimodal LLMs.
- 💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
- 📫 You can reach me at [email protected]—let’s connect!
🐢
Focusing
VQA & MLLM.
-
Hefei University of Technology
- China
-
08:41
- 8h ahead - https://zhousheng97.github.io/
Pinned Loading
-
EgoTextVQA
EgoTextVQA Public[CVPR 2025] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
Python 26
-
Awesome-MLLM-TextVQA
Awesome-MLLM-TextVQA Public✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks
326 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | |||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
zhousheng97/zhousheng97.github.io,
zhousheng97/EgoTextVQA,
zhousheng97/EgoTextVQA_page
and 11 other
repositories
Loading
Contribution activity
April 2025
Created 1 commit in 1 repository
Created an issue in LLaVA-VL/LLaVA-NeXT that received 2 comments
For LLaVA-Video-Qwen2-7B, are the results of video inference and multi-frame inference consistent?
Hello, has anyone tested llava-video-qwen2-7b? Is there any difference in the results for the same video content, but only the difference in multi-…
2
comments