Skip to content
View xin-ran-w's full-sized avatar
πŸ”¬
brain storm
πŸ”¬
brain storm

Organizations

@PRIS-CV

Block or report xin-ran-w

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
xin-ran-w/README.md

Hi there πŸ‘‹

  • πŸŽ“ I'm currently pursuing a PhD in the field of multimodal learning.
  • πŸ”­ I’m currently working on expressing visual content using language (e.g., image captioning, object description).
  • 🌱 I’m looking to collaborate on using high-quality captions to train a diffusion / auto-regressive generation model to generate high-quality visual content (video/image/3D model).
  • πŸ“« How to reach me: [email protected].

Pinned Loading

  1. CapAgent CapAgent Public

    From Simple to Professional - A Combinatorial Controllable Image Captioning Agent

    Python 3

  2. ControllableObjectDescription ControllableObjectDescription Public

    A training-free pipeline to control dimension details in object description.

    Python 1