Skip to content

Commit

Permalink
Fix qwen-vl-chat hung (#1824)
Browse files Browse the repository at this point in the history
  • Loading branch information
irexyc authored Jun 21, 2024
1 parent 21533e7 commit 4067cb2
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions lmdeploy/vl/model/qwen.py
Original file line number Diff line number Diff line change
Expand Up @@ -37,10 +37,10 @@ def build_model(self):
max_memory = get_balanced_memory(
model,
dtype=torch.half,
no_split_module_classes=['VisualAttentionBlock'])
no_split_module_classes=['VisualAttentionBlock', 'Resampler'])
device_map = infer_auto_device_map(
model,
no_split_module_classes=['VisualAttentionBlock'],
no_split_module_classes=['VisualAttentionBlock', 'Resampler'],
max_memory=max_memory,
dtype=torch.half)
same_device_keys = [('transformer.visual.conv1',
Expand Down

0 comments on commit 4067cb2

Please sign in to comment.