this pr is to:
1.fix diopi cuda impl related so that it could be compiled normally.
2.add log1p on impl cuda that supports float32 and float64.
3.add log1p on impl torch that supports almost all input type.
Description
Use cases (Optional)
BC-breaking (Optional)
Checklist
Before PR:
[x] I have read and followed the workflow indicated in the Contributors.md to create this PR.
[x] Pre-commit or linting tools indicated in Contributors.md are used to fix the potential lint issues.
[x] Bug fixes are covered by unit tests, the case that causes the bug should be added in the unit tests.
[x] New functionalities are covered by complete unit tests. If not, please add more unit test to ensure the correctness.
[x] The documentation has been modified accordingly, including docstring or example tutorials.
After PR:
[x] CLA has been signed and all committers have signed the CLA in this PR.
Motivation and Context
this pr is to: 1.fix diopi cuda impl related so that it could be compiled normally. 2.add log1p on impl cuda that supports float32 and float64. 3.add log1p on impl torch that supports almost all input type.
Description
Use cases (Optional)
BC-breaking (Optional)
Checklist
Before PR:
After PR: