Skip to content
View Haoxiang-Wang's full-sized avatar

Organizations

@AI-secure

Block or report Haoxiang-Wang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ISR ISR Public

    Invariant-feature Subspace Recovery (ISR)

    Python 23 5

  2. RLHFlow/RLHF-Reward-Modeling RLHFlow/RLHF-Reward-Modeling Public

    Recipes to train reward model for RLHF.

    Python 740 62

  3. AI-secure/multi-task-learning AI-secure/multi-task-learning Public

    Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

    Python 68 9

  4. RLHFlow/Directional-Preference-Alignment RLHFlow/Directional-Preference-Alignment Public

    Directional Preference Alignment

    47 3