Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ORT 1.20.2 Release] Cherry pick 1st round #23574

Open
wants to merge 12 commits into
base: rel-1.20.2
Choose a base branch
from

Conversation

adrianlizarraga and others added 7 commits February 3, 2025 15:20
### Description
The Web CI pipeline uses three different Windows machine pools:
1. onnxruntime-Win2022-webgpu-A10
2. onnxruntime-Win2022-VS2022-webgpu-A10
3. onnxruntime-Win-CPU-2022-web

This PR merges them together to reduce ongoing maintenance cost.
### Description
* Fix `NumericLimits<float>` that used infinity as max, which is not
consistent with `std::numeric_limits<float>::max()`
In Windows, (float)(1e+300) is used for INFINITY, which causes compiler
error in Visual Studio 2022 v17.12 Preview 5.
* Rename `NumericLimits<T>::Min` to Lowest to be consistent with
std::numeric_limits
* Fix topk implementation: use `NumericLimits<CudaT>` instead of
`NumericLimits<T>` in kernel. That could avoid defining a confusing
defintion of `NumericLimits<MLFloat16>` that returns half instead of
MLFloat16.
* Use CUDART_MAX_NORMAL_FP16 if possible. It sets bits value directly,
which is faster than converting float to half.

Note that NumericLimits does not support __nv_bfloat16 and _nv_fp8_e4m3
and __nv_fp8_e5m2 right now.

### Motivation and Context
#22728
Replace INFINITY by `std::numeric_limits<float>::infinity()` to avoid
build errors with Visual Studio 2022 v17.12 Preview 5

### Motivation and Context
#22728
… default (#23368)

Makes the QNN provider option `offload_graph_io_quantization` enabled by
default. It was previously disabled by default.

Enabling this option significantly decreases inference latency for many
models.
### Description
Add a temporary path to RN 0.69.3 to update the boost url


### Motivation and Context
Fix the React-native CI until we update the RN to 0.70.15 or 0.73.3+
versions
@snnn
Copy link
Member

snnn commented Feb 5, 2025

Please also cherry-pick a25c931 and a910ced so that orttraining-linux-ci-pipeline will pass.

snnn and others added 5 commits February 5, 2025 12:08
Move ORT Training pipeline to github actions and enable CodeQL scan for the code(including inference code).
We will move all pull request pipelines to Github Actions.
### Description
Move Linux Github actions to a dedicated pool. Currently the
"orttraining-linux-ci-pipeline " is too slow.

### Motivation and Context
To speed up the running.
Fix some inconsistency. 

All our iOS build should target iOS 15.1.
All our macOS desktop build should target macOS 13.3 to align with the
changes made in #17361
### Description
<!-- Describe your changes. -->



### Motivation and Context
ESRP code-sign task has supported .net 8, so we can remove macos-12
### Description
Removing net8.0-android target from the MAUI test project. This test is
not currently being run in the pipelines, but build issues is blocking
the patch release.

[Pipeline
run](https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=674060&view=results)
(Still running into unrelated Failure to get the
"onnxruntimepackagetest" image errors)
@adrianlizarraga adrianlizarraga marked this pull request as ready for review February 7, 2025 00:14
@adrianlizarraga adrianlizarraga requested a review from a team as a code owner February 7, 2025 00:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants