Skip to content

Commit

Permalink
Support cogvlm.
Browse files Browse the repository at this point in the history
Optimize cogvlm performance.

Patch cogvlm language part.

Remove some redundant code.

Remove some changes.

Remove some variables.

feat: change infer_ext ops function param order (#2)

feat: support ascend qwen2 and qwen2_moe (#6)

* feat: support ascend qwen2 and qwen2_moe

* fix: fix ascend mixtral

ascend: align attention mask to 32bytes (#7)

fix attn args (#9)

fix: expand shape of attn_mask (#10)

Fix list.
  • Loading branch information
pdx1989 committed Aug 23, 2024
1 parent b03e086 commit 311eac1
Show file tree
Hide file tree
Showing 2 changed files with 320 additions and 5 deletions.
Loading

0 comments on commit 311eac1

Please sign in to comment.