Skip to content

[CINN] x86 runtime intrinsics for composite reduce #72371

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 21, 2025

Conversation

Enigmatisms
Copy link
Contributor

@Enigmatisms Enigmatisms commented Apr 19, 2025

PR Category

CINN

PR Types

Improvements

Description

The following PRs forms the support for x86 end composite reduce (Welford variance & argmin/argmax):

This PR adds the intrinsic functions on the x86 end to support composite reduce ops. Note that, due to the ABI optimization of LLVM, functions defined in cinn_runtime basically do the same thing as those in cinn_cuda_runtime_source, but the function signatures differ a lot (on x86 end, most of the struct in/outputs are pointer-typed).

Also, we removed pd_op.variance, cinn_op.argmax, cinn_op.argmin from the denied ops on CPU. These kernels are now allowed to be generated on x86 end.

Pcard-89620

Copy link

paddle-bot bot commented Apr 19, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@lshpku lshpku merged commit e888198 into PaddlePaddle:develop Apr 21, 2025
38 checks passed
@Enigmatisms Enigmatisms deleted the rt_intrinsic_x86 branch April 21, 2025 06:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants