Integration flash attention #49869

kuizhiqing · 2023-01-16T12:56:39Z

PR types

New features

PR changes

OPs

Describe

Integrating flash-attention to PaddlePaddle.

Usage

from paddle.nn.functional.flash_attention import flash_attention
flash_attention(q, k, v, dropout)

Validation with PaddelFleetX GPT 1.3B model.

Performance impact: 700ms/step -> 619ms/step, ~9% speed up.

Convergence results shows as follows,

paddle-bot · 2023-01-16T12:56:42Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-bot · 2023-01-16T12:56:44Z

❌ The PR is not created using PR's template. You can refer to this Demo.
Please use PR's template, it helps save our maintainers' time so that more developers get helped.

JiabinYang

LGTM for pkg size

zhangbo9674

LGTM for setup

cmake/external/flashattn.cmake

paddle/phi/backends/dynload/flashattn.cc

paddle/phi/kernels/gpu/flash_attn_grad_kernel.cu

python/paddle/nn/functional/flash_attention.py

paddle/phi/kernels/gpu/flash_attn_grad_kernel.cu

paddle/phi/kernels/gpu/flash_attn_kernel.cu

zyfncg

@jeff41404 翔哥看下这个接口算是新增API吗？

paddle/phi/api/yaml/ops.yaml

paddle/phi/infermeta/ternary.h

paddle/phi/kernels/flash_attn_grad_kernel.h

paddle/phi/kernels/gpu/arange_kernel.cu

paddle/phi/kernels/gpu/flash_attn_grad_kernel.cu

JiabinYang

LGTM for pkg size

qili93

LGTM

kuizhiqing added 4 commits January 11, 2023 12:52

flash attn

8c27292

seed

02c4e0a

almost

a737fbc

softmax

a196895

kuizhiqing added 10 commits January 17, 2023 08:25

fix workspace

fa9af3e

add unitest; linux only

9cabe56

fix setup

df96cee

Merge branch 'PaddlePaddle:develop' into flashattn

d700caf

fix datatype include

59c3a77

Merge remote-tracking branch 'baidu/flashattn' into flashattn

5053956

fix setup typo

06bbf00

fix def scope

af7360f

new error api

48544e2

use paddle fork

d41dbf6

kuizhiqing requested a review from sneaxiy February 6, 2023 06:13

kuizhiqing added 2 commits February 7, 2023 14:19

Merge remote-tracking branch 'origin/develop' into flashattn

9d4b829

fix attr bug; complete ut

5290470

kuizhiqing force-pushed the flashattn branch from f691d4c to 5290470 Compare February 11, 2023 15:46

update flash hash

04e45a4

sneaxiy previously approved these changes Feb 23, 2023

View reviewed changes

fix rng reset

647b06f

kuizhiqing dismissed sneaxiy’s stale review via 647b06f February 26, 2023 13:16

kuizhiqing added 2 commits February 28, 2023 10:14

Merge remote-tracking branch 'pd/develop' into flashattn

2467a4e

fix offset

7ad71a0

sneaxiy previously approved these changes Mar 1, 2023

View reviewed changes

JiabinYang previously approved these changes Mar 1, 2023

View reviewed changes

zhangbo9674 previously approved these changes Mar 1, 2023

View reviewed changes

chenwhql reviewed Mar 1, 2023

View reviewed changes

zyfncg reviewed Mar 1, 2023

View reviewed changes

fix comments

bb70b92

kuizhiqing dismissed stale reviews from zhangbo9674, JiabinYang, and sneaxiy via bb70b92 March 1, 2023 07:10

zyfncg approved these changes Mar 1, 2023

View reviewed changes

chenwhql approved these changes Mar 1, 2023

View reviewed changes

JiabinYang approved these changes Mar 1, 2023

View reviewed changes

qili93 approved these changes Mar 1, 2023

View reviewed changes

kuizhiqing requested a review from sneaxiy March 1, 2023 12:46

zhangbo9674 approved these changes Mar 1, 2023

View reviewed changes

sneaxiy approved these changes Mar 1, 2023

View reviewed changes

kuizhiqing changed the title ~~[WIP] integration flash attention~~ Integration flash attention Mar 1, 2023

raindrops2sea approved these changes Mar 1, 2023

View reviewed changes

sneaxiy merged commit 6161178 into PaddlePaddle:develop Mar 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration flash attention #49869

Integration flash attention #49869

kuizhiqing commented Jan 16, 2023 •

edited

Loading

paddle-bot bot commented Jan 16, 2023

paddle-bot bot commented Jan 16, 2023

JiabinYang left a comment

zhangbo9674 left a comment

zyfncg left a comment

JiabinYang left a comment

qili93 left a comment

Integration flash attention #49869

Integration flash attention #49869

Conversation

kuizhiqing commented Jan 16, 2023 • edited Loading

PR types

PR changes

Describe

paddle-bot bot commented Jan 16, 2023

paddle-bot bot commented Jan 16, 2023

JiabinYang left a comment

Choose a reason for hiding this comment

zhangbo9674 left a comment

Choose a reason for hiding this comment

zyfncg left a comment

Choose a reason for hiding this comment

JiabinYang left a comment

Choose a reason for hiding this comment

qili93 left a comment

Choose a reason for hiding this comment

kuizhiqing commented Jan 16, 2023 •

edited

Loading