- π I'm currently pursuing a PhD in the field of multimodal learning.
- π Iβm currently working on expressing visual content using language (e.g., image captioning, object description).
- π± Iβm looking to collaborate on using high-quality captions to train a diffusion / auto-regressive generation model to generate high-quality visual content (video/image/3D model).
- π« How to reach me: [email protected].
π¬
brain storm
I am currently pursuing a PhD
π in the field of computer vision (CV).
Pinned Loading
-
ControllableObjectDescription
ControllableObjectDescription PublicA training-free pipeline to control dimension details in object description.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.