[CUDA] Fix NumericLimits #22738

tianleiwu · 2024-11-05T21:56:18Z

Description

Fix NumericLimits<float> that used infinity as max, which is not consistent with std::numeric_limits<float>::max()
In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5.
Rename NumericLimits<T>::Min to Lowest to be consistent with std::numeric_limits
Fix topk implementation: use NumericLimits<CudaT> instead of NumericLimits<T> in kernel. That could avoid defining a confusing defintion of NumericLimits<MLFloat16> that returns half instead of MLFloat16.
Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half.

Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now.

Motivation and Context

### Description * Fix `NumericLimits<float>` that used infinity as max, which is not consistent with `std::numeric_limits<float>::max()` In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5. * Rename `NumericLimits<T>::Min` to Lowest to be consistent with std::numeric_limits * Fix topk implementation: use `NumericLimits<CudaT>` instead of `NumericLimits<T>` in kernel. That could avoid defining a confusing defintion of `NumericLimits<MLFloat16>` that returns half instead of MLFloat16. * Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half. Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now. ### Motivation and Context microsoft#22728

### Description * Fix `NumericLimits<float>` that used infinity as max, which is not consistent with `std::numeric_limits<float>::max()` In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5. * Rename `NumericLimits<T>::Min` to Lowest to be consistent with std::numeric_limits * Fix topk implementation: use `NumericLimits<CudaT>` instead of `NumericLimits<T>` in kernel. That could avoid defining a confusing defintion of `NumericLimits<MLFloat16>` that returns half instead of MLFloat16. * Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half. Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now. ### Motivation and Context #22728

### Description * Fix `NumericLimits<float>` that used infinity as max, which is not consistent with `std::numeric_limits<float>::max()` In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5. * Rename `NumericLimits<T>::Min` to Lowest to be consistent with std::numeric_limits * Fix topk implementation: use `NumericLimits<CudaT>` instead of `NumericLimits<T>` in kernel. That could avoid defining a confusing defintion of `NumericLimits<MLFloat16>` that returns half instead of MLFloat16. * Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half. Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now. ### Motivation and Context microsoft#22728

### Description * Fix `NumericLimits<float>` that used infinity as max, which is not consistent with `std::numeric_limits<float>::max()` In Windows, (float)(1e+300) is used for INFINITY, which causes compiler error in Visual Studio 2022 v17.12 Preview 5. * Rename `NumericLimits<T>::Min` to Lowest to be consistent with std::numeric_limits * Fix topk implementation: use `NumericLimits<CudaT>` instead of `NumericLimits<T>` in kernel. That could avoid defining a confusing defintion of `NumericLimits<MLFloat16>` that returns half instead of MLFloat16. * Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly, which is faster than converting float to half. Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3 and __nv_fp8_e5m2 right now. ### Motivation and Context #22728

### Description - Update ORT version to 1.20.2 - Cherry-pick commits: - #23243 - #22738 - #22868 - #23281 - #22543 - #22566 - #23308 - #23017 - [Main feature] #23368 ### Motivation and Context  --------- Co-authored-by: Changming Sun <chasun@microsoft.com> Co-authored-by: Tianlei Wu <tlwu@microsoft.com> Co-authored-by: Jian Chen <cjian@microsoft.com> Co-authored-by: Yi Zhang <zhanyi@microsoft.com> Co-authored-by: Caroline Zhu <wolfivyaura@gmail.com>

Fix NumericLimits

ce702b4

tianleiwu marked this pull request as draft November 5, 2024 22:16

refine

d27100f

tianleiwu marked this pull request as ready for review November 6, 2024 00:27

tianleiwu requested review from yufenglee and snnn November 6, 2024 01:22

snnn approved these changes Nov 6, 2024

View reviewed changes

tianleiwu merged commit d993ec3 into main Nov 6, 2024
91 checks passed

tianleiwu deleted the tlwu/fix_numeric_limits branch November 6, 2024 17:53

adrianlizarraga mentioned this pull request Feb 4, 2025

[ORT 1.20.2 Release] Cherry pick 1st round #23574

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Fix NumericLimits #22738

[CUDA] Fix NumericLimits #22738

tianleiwu commented Nov 5, 2024 •

edited

Loading

[CUDA] Fix NumericLimits #22738

[CUDA] Fix NumericLimits #22738

Conversation

tianleiwu commented Nov 5, 2024 • edited Loading

Description

Motivation and Context

tianleiwu commented Nov 5, 2024 •

edited

Loading