Process separate q/k/v weights in MHA converter #1020

aakhundov · 2024-08-12T21:15:43Z

Summary:
ATT. The converter was not ready for the not self._qkv_same_embed_dim case here with separate q/k/v weights. Here we cover this case.

Intenral:

This causes a failure in the AIT lowering of the IGCTR MC model. See the post: https://fb.workplace.com/groups/gpuinference/permalink/2872581106223872/ .

Differential Revision: D61155566

Summary: ATT. The converter was not ready for the `not self._qkv_same_embed_dim` case [here](/~https://github.com/pytorch/pytorch/blob/80ed3e9ccdaab20814b4156611a19043aaaaef03/torch/nn/modules/activation.py#L1074) with separate q/k/v weights. Here we cover this case. Intenral: This causes a failure in the AIT lowering of the IGCTR MC model. See the post: https://fb.workplace.com/groups/gpuinference/permalink/2872581106223872/ . Differential Revision: D61155566

facebook-github-bot · 2024-08-12T21:16:03Z

This pull request was exported from Phabricator. Differential Revision: D61155566

facebook-github-bot · 2024-08-13T01:06:36Z

This pull request has been merged in 7b41778.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 12, 2024

facebook-github-bot added the fb-exported label Aug 12, 2024

facebook-github-bot closed this in 7b41778 Aug 13, 2024

facebook-github-bot added the Merged label Aug 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process separate q/k/v weights in MHA converter #1020

Process separate q/k/v weights in MHA converter #1020

aakhundov commented Aug 12, 2024

facebook-github-bot commented Aug 12, 2024

facebook-github-bot commented Aug 13, 2024

Process separate q/k/v weights in MHA converter #1020

Process separate q/k/v weights in MHA converter #1020

Conversation

aakhundov commented Aug 12, 2024

facebook-github-bot commented Aug 12, 2024

facebook-github-bot commented Aug 13, 2024