DPMultiheadedAttention is not a full drop in replacement for nn.MultiheadAttention #596

jfb54 · 2023-07-07T15:37:19Z

🐛 Bug

Not only is the API missing the batch_first parameter (#512), it is missing the in_proj_weight parameter. Thus it makes it impossible to use Opacus with a transformer.

Expected behavior

Ability to access the in_proj_weight parameter so that it may be initialized.

Environment

Opacus 1.4
PyTorch 2.01

HuanyuZhang · 2023-07-11T15:14:32Z

Thanks for reporting bugs. Will fix it and #512 to make it work.

tranvansang · 2023-07-20T09:54:57Z

I am facing the same issue. In addition to what described in the issue description, forward() call also lacks is_causal parameter.

HuanyuZhang · 2023-12-06T04:04:50Z

Closed this issue, since we launched fixes in PR(#598). Btw, it is also feasible to use (https://github.com/lxuechen/private-transformers) for transformers in hugging face.

HuanyuZhang closed this as completed Dec 6, 2023

hueck mentioned this issue Jan 11, 2024

Support for is_causal flag for forward hook of layers.DPMultiheadAttention #617

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DPMultiheadedAttention is not a full drop in replacement for nn.MultiheadAttention #596

DPMultiheadedAttention is not a full drop in replacement for nn.MultiheadAttention #596

jfb54 commented Jul 7, 2023

HuanyuZhang commented Jul 11, 2023

tranvansang commented Jul 20, 2023

HuanyuZhang commented Dec 6, 2023 •

edited

Loading

DPMultiheadedAttention is not a full drop in replacement for nn.MultiheadAttention #596

DPMultiheadedAttention is not a full drop in replacement for nn.MultiheadAttention #596

Comments

jfb54 commented Jul 7, 2023

🐛 Bug

Expected behavior

Environment

HuanyuZhang commented Jul 11, 2023

tranvansang commented Jul 20, 2023

HuanyuZhang commented Dec 6, 2023 • edited Loading

HuanyuZhang commented Dec 6, 2023 •

edited

Loading