ChatGLM4模型由于transformers版本问题部分解决 #52

3186218763 · 2024-12-11T04:09:43Z

ChatGLMForConditionalGeneration部分需要额外继承transformers的GenerationMixin，

`class ChatGLMForConditionalGeneration(ChatGLMPreTrainedModel, GenerationMixin):
def init(self, config: ChatGLMConfig, empty_init=True, device=None):
super().init(config)

    self.max_sequence_length = config.max_length  # 最大序列长度
    self.transformer = ChatGLMModel(config, empty_init=empty_init, device=device)  # 使用 ChatGLMModel 类
    self.config = config

def _update_model_kwargs_for_generation(
        self,
        outputs: ModelOutput,
        model_kwargs: Dict[str, Any],
        is_encoder_decoder: bool = False,
        standardize_cache_format: bool = False,
) -> Dict[str, Any]:
    # 更新 past_key_values
    _, model_kwargs["past_key_values"] = self._extract_past_from_model_output(
        outputs
    )

`
self._extract_past_from_model_output方法和之前不一样，传入请删除standardize_cache_format，高版本没有这个参数，返回也变成了cache_name, past_key_values

把model_kwargs["past_key_values"] = self._extract_past_from_model_output(
outputs, standardize_cache_format=standardize_cache_format
)
变成
_, model_kwargs["past_key_values"] = self._extract_past_from_model_output(
outputs
)就欧克了

The text was updated successfully, but these errors were encountered:

HeiBoWang · 2024-12-11T04:10:17Z

你好，我已经收到您的邮件，稍后尽快给你回复。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatGLM4模型由于transformers版本问题部分解决 #52

ChatGLM4模型由于transformers版本问题部分解决 #52

3186218763 commented Dec 11, 2024

HeiBoWang commented Dec 11, 2024 via email

ChatGLM4模型由于transformers版本问题部分解决 #52

ChatGLM4模型由于transformers版本问题部分解决 #52

Comments

3186218763 commented Dec 11, 2024

HeiBoWang commented Dec 11, 2024 via email